vllm/cmake at c20ef40fd0e8663e82911f53d00a64f53beb98aa - vllm

mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 23:03:52 +08:00

Files

yexin(叶鑫) b22980a1dc [Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457 )

Signed-off-by: cynthieye <yexin93@qq.com>
Co-authored-by: MagnetoWang <magnetowang@outlook.com>

2025-04-25 14:52:28 +08:00

2025-04-25 14:52:28 +08:00

cpu_extension.cmake

2025-04-05 11:00:12 +00:00

hipify.py

2025-02-03 11:16:59 -08:00

utils.cmake

2025-02-12 19:51:51 -08:00