DeepSpeed

mirror of https://github.com/deepspeedai/DeepSpeed.git synced 2025-10-20 23:46:02 +08:00

Files

Stas Bekman 4d00b38ada Ulysses SP for HF Integration (#7268 )

This is the Deepspeed counterpart of
https://github.com/snowflakedb/ArcticTraining/pull/45 - as the new
feature(s) require changes on both sides.


For PR reviewers: 

Readiness status:
- [x] Code
- [x] Tests
- [ ] Docs - working on it


Features:

- [x] add support for delaying grad addition via
`param.ds_grad_is_ready` flag (used when performing tiled compute in an
autograd function)
- [x] add light sp-only mpu version (Jeff Rasley)
- [x] improved debug
- [x] added `all_gather_object` to `dist`
- [x] `UlyssesSPAttentionHF` (port of UlyssesAttention from
Megatron-Deepspeed plus modern MHA-variations)
- [x] `UlyssesSPDataLoaderAdapter` - DL adapter to shard the normal DL
batches to be used by `UlyssesSPAttentionHF`
- [x] `SequenceTiledCompute` - generic autograd function to perform
compute after tiling on the sequence dimension
- [x] `TiledMLP` - a specific autograd function to perform tiled MLP
(it's much easier to understand before trying to grok
`SequenceTiledCompute`)
- [x] added a differentiable `_DimZeroAllToAll` (Samyam Rajbhandari)
- [x] torch-dist-check now allows `torch.distributed.nn` (which is
needed since deepspeed's dist is not up to date with
`torch.distributed.nn`)

---------

Signed-off-by: Stas Bekman <stas.bekman@snowflake.com>
Signed-off-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas.bekman@snowflake.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>

2025-05-31 07:25:23 +00:00

check-extraindexurl.py

Fix issues XPU tests hit with extra-index-url (#7291 )

2025-05-16 19:07:35 -07:00

check-license.py

disable license check until the new license situation has been sorted… (#7301 )

2025-05-22 00:27:39 +00:00

check-torchcuda.py

DeepSpeed-FastGen (#4604 )

2023-11-03 15:07:35 -07:00

check-torchdist.py

Ulysses SP for HF Integration (#7268 )

2025-05-31 07:25:23 +00:00

replace_copyright.py

fix typo in comments with deepspeed/ (#3537 )

2023-05-15 19:20:46 +00:00