mirror of
https://github.com/vllm-project/vllm-ascend.git
synced 2025-10-20 21:53:54 +08:00
This pr mainly focuses on: 1. adapting to new quantization_config generated from msmodelslim. 2. removing unnecessary imports. 3. disable warning when loading BasevLLMParameter --------- Signed-off-by: angazenn <zengyanjia@huawei.com> Co-authored-by: angazenn <zengyanjia@huawei.com>