-
Amazon Web Services
- https://liangfu.org
Pinned Loading
-
apache/tvm
apache/tvm PublicOpen deep learning compiler stack for cpu, gpu and specialized accelerators
-
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
75 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
vllm-project/vllm,
aws-neuron/transformers-neuronx,
awslabs/slapo
and 8 other
repositories
Loading
Contribution activity
April 2025
Created 2 commits in 1 repository
Created a pull request in vllm-project/vllm that received 7 comments
[Neuron][kernel] Fuse kv cache into a single tensor
Fusing KV cache into a single tensor can help eliminate unnecessary slice operator on K/V cache tensor.
%p11.224 = bf16[2,17487,4,32,64]{4,3,2,1,0}…
+46
−56
lines changed
•
7
comments
Reviewed 3 pull requests in 1 repository
vllm-project/vllm
3 pull requests
-
Update BASE_IMAGE to 2.22 release of Neuron
This contribution was made on Apr 8
-
fix neuron config override
This contribution was made on Apr 4
-
[Neuron][kernel] Fuse kv cache into a single tensor
This contribution was made on Apr 3