vllm/moe_ops.h at 5f6d10c14c17122e6d711a4829ee0ca672e07f6f - vllm - Gitea: Git for Me

frozenleaves/vllm

mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 14:53:52 +08:00

Files

Michael Goin 5f6d10c14c [CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722 )

2024-05-22 07:18:41 +00:00

8 lines

224 B

C

Raw Blame History

 #pragma once
 #include <torch/extension.h>
 void topk_softmax(torch::Tensor& topk_weights, torch::Tensor& topk_indices,
                   torch::Tensor& token_expert_indices,
                   torch::Tensor& gating_output);