vllm/test_cpu_offload.py at 43721bc67f9f30dcd1c893af370727b835d3d6dd - vllm - Gitea: Git for Me

frozenleaves/vllm

mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 23:03:52 +08:00

Files

Tahsin Tunan 43721bc67f [CI] Replace large models with tiny alternatives in tests (#24057 )

Signed-off-by: Tahsin Tunan <tahsintunan@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Nick Hill <nhill@redhat.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-10-16 15:51:27 +01:00

11 lines

285 B

Python

Raw Blame History

 # SPDX-License-Identifier: Apache-2.0
 # SPDX-FileCopyrightText: Copyright contributors to the vLLM project
 from ..utils import compare_two_settings
 def test_cpu_offload():
     compare_two_settings(
         "hmellor/tiny-random-LlamaForCausalLM", [], ["--cpu-offload-gb", "1"]
     )