Default Branch

8a81d776ce · Fix typo in ValueError message: use kv_role instead of kv_disagg_role (#27166) · Updated 2025-10-20 03:47:19 +08:00

Branches

6f47333c4e · [Misc] Allow override VLLM_DISTRIBUTED_INIT_METHOD_OVERRIDE · Updated 2025-10-19 09:47:13 +08:00

9
1

14299bfcaf · Derive auto max model len state from original value · Updated 2025-10-19 02:49:36 +08:00

35
1

dcf059ab84 · deepep HT dispatch no abstraction · Updated 2025-10-18 09:42:27 +08:00

237
6

99c02cce50 · update using local · Updated 2025-10-18 04:37:15 +08:00

33
9

a2599dca0f · fix missing removal · Updated 2025-10-18 02:35:42 +08:00

55
2

3565e693c6 · Merge branch 'main' into wentao-fix-mypy-v1 · Updated 2025-10-18 00:53:05 +08:00

37
3

69c9a01538 · disable flashinfer warmup · Updated 2025-10-17 00:49:29 +08:00

108
4

01e389cd94 · fix · Updated 2025-10-17 00:48:51 +08:00

108
14

6f30ab9ab3 · [Performance] Run shared_experts on a separate cuda stream (in parallel with the FusedMoE) · Updated 2025-10-17 00:10:35 +08:00

86
1

c72d44ba4a · Add test for batched triton fallback behavior · Updated 2025-10-16 11:46:02 +08:00

100
3

2797adb329 · cleanup · Updated 2025-10-16 02:07:49 +08:00

115
2

c3a722fcb2 · [CI Failure] Fix tests with missing TinyLlama-1.1B-Chat-v1.0-FP8-e2e (#26816) · Updated 2025-10-15 02:38:59 +08:00

167
0
Included

38cf8237d4 · Fix pytest verbosity for prime-rl ci · Updated 2025-10-14 09:06:33 +08:00

205
1

22bf5c5077 · fix · Updated 2025-10-12 02:38:33 +08:00

251
4

37d0a00b16 · [CI] Skip lm-format-enforcer test cases · Updated 2025-10-11 03:14:25 +08:00    frozenleaves

269
1

b8b302cde4 · Update CUDA architecture list in build pipeline for 12.9.1 wheels (#26592) · Updated 2025-10-11 02:15:45 +08:00    frozenleaves

668
34

01efc7ef78 · [ci] fix wheel names for arm wheels (#24898) · Updated 2025-10-08 04:40:13 +08:00    frozenleaves

1206
8

944913c0fa · docs: clarify remaining v0 references · Updated 2025-10-07 01:59:13 +08:00    frozenleaves

421
1

920db41128 · [Quantization/NVFP4] Speed up TRTLLM NVFP4 MOE weight loading and fix K/V scale loading for MLA Attn (#25968) · Updated 2025-10-04 04:35:58 +08:00    frozenleaves

937
454

6f62c94d7e · updated · Updated 2025-10-04 01:47:16 +08:00    frozenleaves

491
2