Files
DeepSpeed/blogs/deepspeed-ucp/media/image2.png
Sam Ade Jacobs 121efdbd5c DeepSpeed Universal Checkpointing: Blog and Tutorial (#5711)
Train {GPT,LLaMA, Phi}-like models (or any model) at ultra low-cost with
DeepSpeed Universal Checkpointing (UCP). UCP abstracts away the
complexities of saving and loading model states. See arxiv paper, blog
and tutorial in this PR for details.

---------

Co-authored-by: Masahiro Tanaka <mtanaka@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Logan Adams <loadams@microsoft.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
2024-07-01 14:37:24 -07:00

175 KiB
984x519px

/frozenleaves/DeepSpeed/raw/commit/fd405169232dd83bdc7883df1c7d707d482e1be6/blogs/deepspeed-ucp/media/image2.png