Files
DeepSpeed/docs/_posts/2023-08-24-ulysses.md
Olatunji Ruwase fd40516923 Update GH org references (#6998)
Signed-off-by: Olatunji Ruwase <olruwase@microsoft.com>
Signed-off-by: Logan Adams <loadams@microsoft.com>
Signed-off-by: Fabien Dupont <fdupont@redhat.com>
Co-authored-by: Fabien Dupont <fabiendupont@fabiendupont.fr>
2025-02-05 00:56:50 +00:00

282 B

title, excerpt, link, date, tags
title excerpt link date tags
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models https://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepspeed-ulysses/README.md 2023-08-24 00:00:00 training ZeRO English