mirror of
https://github.com/vllm-project/vllm-ascend.git
synced 2025-10-20 21:53:54 +08:00
### What this PR does / why we need it? Add custom ascendc kernel support in vllm-ascend, this PR mainly include 3 parts: - AscendC implementation of rotary_embedding, and its unitest. - CMakeLists.txt to compile AscendC kernel and related torch library binding to this kernel. - Build and pack all the compiled so into the vllm_ascend's package. For now, this rotary embedding kernel dose not support the scenario with `neoxStyle=False`, So its not used in the actual modeling parts. We will soon add this implements into the vllm-ascend and integrate it into the modeling parts. ### Does this PR introduce _any_ user-facing change? No change at all --------- Signed-off-by: ganyi <pleaplusone.gy@gmail.com>