pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Bin Bao	cbefaf2a37	[AOTI] Move c10/util ostream function implementations to their headers (#123847 ) Summary: AOTInductor generated code for CPU models may have direct reference to these c10-implemented data types, see _inductor/codegen/cpp_prefix.h. To make sure the AOTI generated code is ABI backward compatible, we need to change those headers to a header-only implementation. The next PR in this stack will add tests to use those data types without linking against libtorch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123847 Approved by: https://github.com/jansel	2024-04-19 00:51:24 +00:00
Gao Tianlin	be33d31ae2	add std::ostream& operator<< for BFloat16 in BFloat16.h (#121302 ) This PR Move `operator<<` of `BFloat16` to `BFloat16.h`. Previously, this function is in `TensorDataContainer.h`. If need `std::cout` a `BFloat16` variable when debugging, `TensorDataContainer.h` have to be included. This is inconvient and counterintuitive. Other dtypes such as `Half`, define their `operator<<` in headers where they are defined such as `Half.h`. Therefore, I think it makes more sense to move `operator<<` of `BFloat16` to `BFloat16.h` Pull Request resolved: https://github.com/pytorch/pytorch/pull/121302 Approved by: https://github.com/ezyang	2024-03-13 06:47:34 +00:00
cyy	7663ffb673	[10/N] Fixes clang-tidy warnings in c10/util/.h (#116326 ) Still a continued work for clean up c10/util/.h Pull Request resolved: https://github.com/pytorch/pytorch/pull/116326 Approved by: https://github.com/Skylion007	2023-12-23 04:59:55 +00:00
Amadeusz Skrzypczak	b64bd4a5dd	Add torch.float8_e5m2 and torch.float8_e4m3 data types (#104242 ) Proposal of two float8 variants - e5m2 and e4m3 - based on https://arxiv.org/pdf/2209.05433.pdf Hide all Float8 operator implementations behind `#if !defined(C10_MOBILE)` guard to keep Android build size almost unchanged TODO: - Refactor duplicated code - Cleanup unbalanced pragma pop in dtype utils - Add native implementation on the CUDA size Co-authored-by: Nikita Shulga <nshulga@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/104242 Approved by: https://github.com/albanD	2023-07-20 16:09:11 +00:00
PyTorch MergeBot	f2b15772ff	Revert "Add torch.float8_e5m2 and torch.float8_e4m3 data types (#104242 )" This reverts commit a9804130e5a9a982d82934fa9702abd08d6903ce. Reverted https://github.com/pytorch/pytorch/pull/104242 on behalf of https://github.com/PaliC due to breaks lint (run lintrunner and remerge) ([comment](https://github.com/pytorch/pytorch/pull/104242#issuecomment-1644150284))	2023-07-20 15:37:53 +00:00
Amadeusz Skrzypczak	a9804130e5	Add torch.float8_e5m2 and torch.float8_e4m3 data types (#104242 ) Proposal of two float8 variants - e5m2 and e4m3 - based on https://arxiv.org/pdf/2209.05433.pdf Hide all Float8 operator implementations behind `#if !defined(C10_MOBILE)` guard to keep Android build size almost unchanged TODO: - Refactor duplicated code - Cleanup unbalanced pragma pop in dtype utils - Add native implementation on the CUDA size Co-authored-by: Nikita Shulga <nshulga@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/104242 Approved by: https://github.com/albanD	2023-07-20 09:45:45 +00:00

6 Commits