DeepSpeed

mirror of https://github.com/deepspeedai/DeepSpeed.git synced 2025-10-21 16:48:52 +08:00

Author	SHA1	Message	Date
Michael Wyatt	b361c72761	Update DeepSpeed copyright license to Apache 2.0 (#3111 ) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>	2023-03-30 17:14:38 -07:00
Jeff Rasley	91d63e0228	update formatter version and style settings (#3098 )	2023-03-27 07:55:19 -04:00
Conglong Li	7c99def0f0	Data efficiency library update (#2866 ) * data efficiency library update * data efficiency library update * data efficiency update * data efficiency update	2023-02-21 14:43:29 -08:00
Ma, Guokai	98cc35b6a8	Abstract accelerator (step 3) (#2677 ) * Integrate accelerator abstraction interface into deepspeed/ * Fix error message in fp16/fused_optimizer * fix error message in fp16/unfused_optimizer.py * assign get_accelerator().pin_memory() result to input Tensor name * no need to check cuda and whether nvtx supported * move try-except into inner most block * call Event() and Stream() in get_accelerator() for data type * Make Stream and Event as properties of abstract interface so they can be used as data type in deepspeed * Apply op_builder backend api change from #2705 from @jeffra * fix tests where Builder NAME is used * keep original ...Builder.NAME interface instead of ...Builder().NAME interface * fix builder closure for installation * fix randomltd builder * add comments to clarify create_op_builder and get_op_builder * fix compatibility with pip install -e Co-authored-by: Cheng Li <pistasable@gmail.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>	2023-01-26 06:03:12 -08:00
Conglong Li	ef869377e9	DeepSpeed Data Efficiency Library (#2585 ) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>	2022-12-12 16:55:18 -08:00
Aman Sanger	ae198e20f7	DataLoader Length Fix (#1718 )	2022-06-15 22:20:14 -07:00
Hari Prasad	c0b27fb019	Added drop_last to DeepSpeedDataLoader (#1321 ) * Added drop_last to DeepSpeedDataLoader This solves issue #326 * Updated drop_last in engine.py added drop_last as a ds_config as mentioned by @tjruwase * Update engine.py * Update engine.py * updated config.py and constants.py * Update constants.py * added dataloader_ prefix * Update dataloader.py * corrected yapf test errors * Update test_data.py Added dataloader_drop_last unit test * Corrected yapf and formatting issues * updated simple_model.py and test_data.py * Update simple_model.py * pre-commit fix * corrected issues * Update test_data.py * Update test_data.py * Update test_data.py * Update test_data.py * removed batch_size from test_data.py * Update simple_model.py * Update test_data.py * Update test_data.py * Fix unit test issues * Use fp32 to make things work Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com>	2021-09-01 12:12:20 -07:00
Cheng Li	c5700bc09f	Relax dataset type check in deepspeed io (#1012 ) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>	2021-04-30 10:03:07 -07:00
Shaden Smith	65c2f974d8	Pipeline parallel training engine. (#392 ) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>	2020-09-09 23:14:55 -07:00
Jeff Rasley	e5bbc2e559	Sparse attn + ops/runtime refactor + v0.3.0 (#343 ) * Sparse attn + ops/runtime refactor + v0.3.0 Co-authored-by: Arash Ashari <arashari@microsoft.com> Co-authored-by: Arash Ashari <arashari@microsoft.com>	2020-09-01 18:06:15 -07:00

10 Commits