A high-throughput and memory-efficient inference and serving engine for LLMs
Updated 2025-10-20 03:47:19 +08:00
Train transformer language models with reinforcement learning.
Updated 2025-10-20 01:27:03 +08:00
Community maintained hardware plugin for vLLM on Ascend
Updated 2025-10-19 17:06:05 +08:00
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Updated 2025-10-17 22:24:46 +08:00
A high-throughput and memory-efficient inference and serving engine for LLMs
Updated 2025-10-11 16:48:30 +08:00
npu fused RMSNorm for transformers kernel
Updated 2025-09-24 16:23:46 +08:00
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Updated 2025-09-22 11:27:37 +08:00