Gitea: Git for Me

frozenleaves/ vllm

Python 0 0

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated 2025-10-20 03:47:19 +08:00

frozenleaves/ trl

Python 0 0

Train transformer language models with reinforcement learning.

Updated 2025-10-20 01:27:03 +08:00

frozenleaves/ transformers

Python 0 0

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated 2025-10-19 19:36:25 +08:00

frozenleaves/ vllm-ascend

Python 0 0

Community maintained hardware plugin for vLLM on Ascend

ascend inference llm llm-serving llmops mlops model-serving transformer vllm

Updated 2025-10-19 17:06:05 +08:00

frozenleaves/ LLaMA-Factory

Python 0 0

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated 2025-10-18 18:02:14 +08:00

frozenleaves/ DeepSpeed

Python 0 0

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

billion-parameters compression data-parallelism deep-learning gpu inference machine-learning mixture-of-experts model-parallelism pipeline-parallelism pytorch trillion-parameters zero

Updated 2025-10-15 09:58:53 +08:00

frozenleaves/ accelerate

Python 0 0

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Updated 2025-10-14 20:11:32 +08:00

frozenleaves/ vllm-dev

Python 0 0

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated 2025-10-11 16:48:30 +08:00