Files
DeepSpeed/docs/_posts/2023-11-06-deepspeed-fastgen.md
Olatunji Ruwase fd40516923 Update GH org references (#6998)
Signed-off-by: Olatunji Ruwase <olruwase@microsoft.com>
Signed-off-by: Logan Adams <loadams@microsoft.com>
Signed-off-by: Fabien Dupont <fdupont@redhat.com>
Co-authored-by: Fabien Dupont <fabiendupont@fabiendupont.fr>
2025-02-05 00:56:50 +00:00

254 B

title, excerpt, link, date, tags
title excerpt link date tags
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference https://github.com/deepspeedai/DeepSpeed/tree/master/blogs/deepspeed-fastgen 2023-11-06 00:00:00 inference English