18 Commits

Author SHA1 Message Date
fd40516923 Update GH org references (#6998)
Signed-off-by: Olatunji Ruwase <olruwase@microsoft.com>
Signed-off-by: Logan Adams <loadams@microsoft.com>
Signed-off-by: Fabien Dupont <fdupont@redhat.com>
Co-authored-by: Fabien Dupont <fabiendupont@fabiendupont.fr>
2025-02-05 00:56:50 +00:00
baa95c6256 remove benchmarks (now in DSE) and add links (#3157)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-04-07 08:51:49 -07:00
b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-30 17:14:38 -07:00
a6317eb509 ♻️ replace deprecated functions for communication (#2995)
* 💩 drop dead code

* ♻️ replace has_all_gather_base with has_all_gather_into_tensor

* ♻️ remove deprecated _all_gather_base

* ♻️ remove deprecated _reduce_scatter_base

* 🎨 reformat files

* 🔧 fix _six

* Trigger CI

* Trigger CI

* Trigger CI

* 🎨 formatting

* incorporate suggestion

* incorporate suggestion

---------

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-29 22:27:15 -04:00
9726bd4650 Fix comms benchmark import issues and support MPI/slurm launching (#2932)
* Fix benchmark import issues and support MPI launching with pure torch.dist

* Formatting

* Update comms benchmark README

* Formatting

* Added better error handling and support MPI torch.dist backend

* Update formatting versions

* Formatting again

* Trigger CI

---------

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-27 17:15:18 -04:00
91d63e0228 update formatter version and style settings (#3098) 2023-03-27 07:55:19 -04:00
0acf7e9c48 [RFC] add device abstraction to allow other device than CUDA be used (#2221)
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-07 09:40:17 -08:00
da84e60d98 add missing license info to top of all source code (#2889)
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Co-authored-by: Conglong Li <conglong.li@gmail.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
2023-02-27 11:20:41 -08:00
e4b3b610ba Refactor DS inference API. No longer need replace_method. (#2831)
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
2023-02-15 23:17:02 +00:00
2076bf2311 get mask token from tokenizer (#2592) 2022-12-09 17:10:03 -08:00
5eafb8c78d Make error regex more generic in collect_results.py (#2415)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-10-14 12:47:37 -07:00
eed40324db Capture error message during sweep tests (#2351)
* Collect error messages in results.csv

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
2022-09-27 15:54:40 -07:00
993264388d Inference profiling updates/fixes (#2348) (#2349)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
2022-09-23 14:38:09 -07:00
1592381018 Add more options to inference benchmark (#2325) 2022-09-16 09:58:28 -07:00
9595dff6d7 add inference eval scripts (#2303) 2022-09-12 13:25:00 -07:00
316c4a43e0 Add flake8 to pre-commit checks (#2051) 2022-07-25 16:48:08 -07:00
8413b7f83d DS Benchmarks QoL Improvements (#2120)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-07-22 15:47:56 -07:00
9b70ce56e7 Comms Benchmarks (#2040)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-06-29 10:49:20 -07:00