Default Branch

14b4326b94 · v1: Support KV events from connectors (#19737) · Updated 2025-09-01 09:13:21 +08:00

Branches

9d762c3aa5 · updated · Updated 2025-07-15 10:09:43 +08:00

1386
5

94e7c6dac7 · updated · Updated 2025-07-13 06:38:42 +08:00

1394
5

32e4481626 · [Attention] MLA - cutlass decode with unresticted num_heads · Updated 2025-07-12 01:37:33 +08:00

1418
1

ab153be252 · take 2 · Updated 2025-07-11 22:42:44 +08:00

1447
1

45c02abd72 · updated · Updated 2025-07-11 08:57:50 +08:00

1878
37

37cf1f27f2 · hack 2 · Updated 2025-07-11 06:56:08 +08:00

1485
6

1db4b78a13 · Mock gguf in doc build · Updated 2025-07-11 04:39:35 +08:00

1442
1

9e011d3954 · Update mistaken usage of GREATER to GREATER_EQUAL · Updated 2025-07-10 01:41:55 +08:00

1484
5

a5dd03c1eb · Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (#20412)" · Updated 2025-07-07 05:02:36 +08:00

1555
1

8209f9057d · i honestly can't believe i spelled it that way · Updated 2025-07-05 03:14:03 +08:00

1576
3

7d092fc32c · revert skip-merge-desc · Updated 2025-07-04 04:30:45 +08:00

1597
3

f8768f5244 · Remove executable flag on a few files · Updated 2025-07-02 21:58:53 +08:00

1625
1

8d6f411247 · fix · Updated 2025-07-02 02:24:59 +08:00

1650
2

b801bf30d7 · iterate · Updated 2025-06-29 06:21:17 +08:00

1706
2

e53382cc2e · Sage Moore fixes for full cuda graph support for DeepEP+DeepGEMM LL · Updated 2025-06-24 23:21:52 +08:00

1779
1

fcec8c8827 · add debug cruft · Updated 2025-06-21 04:37:37 +08:00

1866
12

86bfededba · [Do not merge] Cache model info · Updated 2025-06-19 13:31:33 +08:00

1850
1

e17250f0d2 · fix precommit · Updated 2025-06-19 12:17:43 +08:00

1853
1

ca15f0afe6 · ci(Mergify): configuration update · Updated 2025-06-09 15:44:44 +08:00

2038
1

d3b51c9bba · fix build · Updated 2025-06-09 08:38:37 +08:00

2283
10