Files
DeepSpeed/docs/_posts/2023-08-24-ulysses-japanese.md
Olatunji Ruwase fd40516923 Update GH org references (#6998)
Signed-off-by: Olatunji Ruwase <olruwase@microsoft.com>
Signed-off-by: Logan Adams <loadams@microsoft.com>
Signed-off-by: Fabien Dupont <fdupont@redhat.com>
Co-authored-by: Fabien Dupont <fabiendupont@fabiendupont.fr>
2025-02-05 00:56:50 +00:00

292 B

title, excerpt, link, date, tags
title excerpt link date tags
DeepSpeed Ulysses: Transformerモデルを非常に長いシーケンスで訓練するための最適化 https://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepspeed-ulysses/japanese/README.md 2023-08-24 00:00:00 training ZeRO Japanese