vllm/csrc at e74b1736a1f1673e7823a68442cbf574d4493390 - vllm

mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 23:03:52 +08:00

Files

History

Yanming W e0c6f556e8 [Build] Avoid building too many extensions (#1624 )

2023-11-23 16:31:19 -08:00

2023-10-31 15:19:30 -07:00

Support SqueezeLLM (#1326 )

2023-10-21 23:14:59 -07:00

activation_kernels.cu

2023-11-03 14:12:48 -07:00

cache_kernels.cu

2023-10-31 15:19:30 -07:00

cache.h

2023-11-23 16:31:19 -08:00

cuda_utils_kernels.cu

2023-09-26 22:27:13 -07:00

cuda_utils.h

2023-11-23 16:31:19 -08:00

dispatch_utils.h

2023-09-02 14:59:47 +09:00

layernorm_kernels.cu

2023-11-18 18:18:02 -08:00

ops.h

2023-11-23 16:31:19 -08:00

pos_encoding_kernels.cu

2023-11-03 14:12:48 -07:00

pybind.cpp

2023-11-23 16:31:19 -08:00

reduction_utils.cuh

2023-06-17 03:07:40 -07:00