Files
DeepSpeed/deepspeed
JoshWoo2003 7cb1b88ec4 Add ZenFlow code for Stage 3 (#7516)
This PR completes the ZenFlow integration for DeepSpeed ZeRO Stage 3. 

Highlights:

- ZenFlowSelectiveAdamW_stage3: Optimizer with importance-aware
selective parameter updates for ZeRO Stage 3.
- ZenFlowZeroOptimizer_Stage3: Full Stage 3 optimizer integration with
partitioned parameters and CPU offload.
- Configurable via ZenFlowConfig, fully integrated with
DeepSpeedZeroConfig for Stage 3.
- Unit tests for Stage 3 cases ensuring correctness and compatibility.

Note: Intergration with ZeRO Stage 1&2 was introduced in #7391

---------

Signed-off-by: Yusen Wu <xrn4ub@virginia.edu>
Co-authored-by: Ma, Guokai <guokai.ma@intel.com>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Co-authored-by: Tingfeng Lan <erc8gx@virginia.edu>
2025-10-13 12:19:18 -04:00
..
2025-09-01 01:12:40 +00:00
2025-08-16 18:22:19 +00:00
2024-11-06 18:57:12 +00:00
2025-06-06 18:49:41 -04:00
2025-08-16 18:22:19 +00:00
2025-08-16 18:22:19 +00:00
2023-06-02 00:47:14 +00:00
2025-08-16 18:22:19 +00:00
2025-10-13 12:19:18 -04:00
2025-10-03 19:30:26 -07:00
2025-06-06 18:49:41 -04:00
2025-08-16 18:22:19 +00:00