mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 23:03:52 +08:00

Files

Rakesh Asapanna 30498f2a65 [Doc]: Remove 404 hyperlinks (#24785 )

Signed-off-by: Rakesh Asapanna  <45640029+rozeappletree@users.noreply.github.com>

2025-09-13 00:15:41 -07:00

README.md

2025-09-13 00:15:41 -07:00

Examples

vLLM's examples are split into three categories:

If you are using vLLM from within Python code, see the Offline Inference section.
If you are using vLLM from an HTTP application or client, see the Online Serving section.
For examples of using some of vLLM's advanced features (e.g. LMCache or Tensorizer) which are not specific to either of the above use cases, see the Others section.