[V1] TPU - Fix fused MOE (#15834)

Signed-off-by: Alexander Matveev <amatveev@redhat.com>
This commit is contained in:
Alexander Matveev
2025-04-01 01:58:07 -04:00
committed by GitHub
parent 63d8eabed0
commit 7e4e709b43

View File

@ -309,7 +309,7 @@ class UnquantizedFusedMoEMethod(FusedMoEMethodBase, CustomOp):
expert_map=expert_map,
renormalize=renormalize)
forward_native = forward_cuda
forward_native = forward_tpu if current_platform.is_tpu else forward_cuda
def determine_expert_map(