mirror of https://github.com/vllm-project/vllm-ascend.git synced 2025-10-20 13:43:53 +08:00

Files

Li Wang bf84f2dbfa [Doc] Support kimi-k2-w8a8 (#2162 )

### What this PR does / why we need it?
In fact, the kimi-k2 model is similar to the deepseek model, and we only
need to make a few changes to support it. what does this pr do:
1. Add kimi-k2-w8a8 deployment doc
2. Update quantization doc
3. Upgrade torchair support list
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?


- vLLM version: v0.10.0
- vLLM main:
9edd1db02b

---------

Signed-off-by: wangli <wangli858794774@gmail.com>

2025-08-06 19:28:47 +08:00

source

[Doc] Support kimi-k2-w8a8 (#2162 )

2025-08-06 19:28:47 +08:00

Makefile

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

README.md

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

requirements-docs.txt

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

requirements-test.txt

static EPLB fix bug, add unit test (#1186 )

2025-06-18 19:46:56 +08:00

README.md

vLLM Ascend Plugin documents

Live doc: https://vllm-ascend.readthedocs.io

Build the docs

# Install dependencies.
pip install -r requirements-docs.txt

# Build the docs.
make clean
make html

# Build the docs with translation
make intl

# Open the docs with your browser
python -m http.server -d _build/html/

Launch your browser and open:

English version: http://localhost:8000
Chinese version: http://localhost:8000/zh_CN