[Bugfix][Qwen] fixes the weights dtype in qwen3_next: it is actually a bfloat16 (#27030)

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>
2025-10-20 14:53:52 +08:00 · 2025-10-17 11:37:52 +08:00
parent 08405609cc
commit bde9e2272a
1 changed files with 0 additions and 1 deletions
--- a/vllm/model_executor/models/qwen3_next.py
+++ b/vllm/model_executor/models/qwen3_next.py
@ -325,7 +325,6 @@ class Qwen3NextGatedDeltaNet(nn.Module, MambaBase):
        self.A_log = nn.Parameter(
            torch.empty(
                divide(self.num_v_heads, self.tp_size),
-                dtype=torch.float32,
            )
        )