mirror of
https://github.com/deepspeedai/DeepSpeed.git
synced 2025-10-20 15:33:51 +08:00
Signed-off-by: Olatunji Ruwase <olruwase@microsoft.com> Signed-off-by: Logan Adams <loadams@microsoft.com> Signed-off-by: Fabien Dupont <fdupont@redhat.com> Co-authored-by: Fabien Dupont <fabiendupont@fabiendupont.fr>
254 B
254 B
title, excerpt, link, date, tags
title | excerpt | link | date | tags |
---|---|---|---|---|
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference | https://github.com/deepspeedai/DeepSpeed/tree/master/blogs/deepspeed-fastgen | 2023-11-06 00:00:00 | inference English |