pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Scott Wolchok	e816e17655	[PyTorch] Add native fast path for transformer encoder inference (#76333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/76333 The current PyTorch multi-head attention and transformer implementations are slow. This should speed them up for inference. ghstack-source-id: 154737857 (Note: this ignores all push blocking failures!) Test Plan: CI Reviewed By: cpuhrsch Differential Revision: D35239925 fbshipit-source-id: 5a7eb8ff79bc6afb4b7d45075ddb2a24a6e2df28	2022-04-26 12:58:03 -04:00
Jon Janzen	2387efd356	Revert "[PyTorch] Add native fast path for transformer encoder inference" This reverts commit b369b89f235f54bc9de85d768fb62ac4579681dc. This has internal changes and should not have been landed via mergebot. Ref: https://github.com/pytorch/pytorch/pull/75809#issuecomment-1108717166	2022-04-25 11:40:02 -04:00
Scott Wolchok	b369b89f23	[PyTorch] Add native fast path for transformer encoder inference Pull Request resolved: https://github.com/pytorch/pytorch/pull/75809 The current PyTorch multi-head attention and transformer implementations are slow. This should speed them up for inference. Differential Revision: [D35239925](https://our.internmc.facebook.com/intern/diff/D35239925/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D35239925/)! Approved by: https://github.com/ezyang	2022-04-25 06:11:36 +00:00
Scott Wolchok	97c993ca7a	[PyTorch] Add NestedTensor support functions for transformers Pull Request resolved: https://github.com/pytorch/pytorch/pull/75491 Here are the NestedTensor kernels we'll need for the improved transformer implementation. Differential Revision: [D35409275](https://our.internmc.facebook.com/intern/diff/D35409275/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D35409275/)! Approved by: https://github.com/cpuhrsch	2022-04-14 16:30:23 +00:00
Shintaro Iwasaki	4fa70a2483	[pytorch] fix hipify_python (#70619 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70619 This Diff improves `hipify_python`, which is needed for AMD GPUs. Change 1: ``` if (c == "," or ind == len(kernel_string) - 1) and closure == 0: ``` This is needed to deal with the following case (ex: https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/test/cuda_vectorized_test.cu#L111) ``` kernel<<<val, func()>>>(...) // In this case, kernel_string is "val, func()" // so closure gets 0 when ind == len(kernel_string) - 1. ``` Change 2: ``` mask_comments() ``` This is needed to deal with a case where "<<<" is included in a comment or a string literal (ex: https://github.com/pytorch/pytorch/blob/master/torch/csrc/deploy/interpreter/builtin_registry.cpp#L71) ``` abc = "<<<XYZ>>>" // Though this <<<XYZ>>> is irrelevant to CUDA kernels, // the current script attempts to hipify this and fails. ``` Test Plan: This patch fixes errors I encountered by running ``` python3 tools/amd_build/build_amd.py ``` I confirmed, with Linux `diff`, that this patch does not change HIP code that was generated successfully with the original script. Reviewed By: hyuen Differential Revision: D33407743 fbshipit-source-id: bec822e040a154be4cda1c294536792ca8d596ae	2022-01-06 13:27:43 -08:00
Ivan Yashchuk	2267a984eb	[ROCm] Add sparse mappings for CUDA->HIP translation (#67323 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67323 Applied patch proposed by Jeff https://github.com/pytorch/pytorch/pull/63948#issuecomment-952166982. In PyTorch, we map cuBLAS->rocBLAS and cuSPARSE->hipSPARSE. Note the prefix, roc versus hip. The 'hip' APIs offer a more direct CUDA-friendly mapping, but calling rocBLAS directly has better performance. Unfortunately, the `roc` types and `hip` types differ, i.e., `rocblas_float_complex` versus `hipComplex`. In the case of SPARSE, we must use the hip types for complex instead of the roc types, but the pytorch mappings assume roc. Therefore, we create a new SPARSE mapping that has a higher priority. Its mappings will trigger first, and only when a miss occurs will the lower-priority pytorch mapping take place. When a file contains "sparse" in the filename, a mapping marked with API_SPARSE is preferred over other choices. cc jeffdaily sunway513 jithunnair-amd ROCmSupport KyleCZH Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D31969246 Pulled By: cpuhrsch fbshipit-source-id: 4ce1b35eaf9ef0d146a0955ce70c354ddd8f4669	2021-10-27 16:28:37 -07:00
Arpan Abhishek	6401658b08	fix type error in hipify_python.py (#66164 ) Summary: - [x] Fixed the Pyre type checking errors in `torch/utils/hipify/hipify_python.py`: ``` torch/utils/hipify/hipify_python.py:196:8 Incompatible variable type [9]: clean_ctx is declared to have type `GeneratedFileCleaner` but is used as type `None`. torch/utils/hipify/hipify_python.py:944:4 Incompatible variable type [9]: clean_ctx is declared to have type `GeneratedFileCleaner` but is used as type `None`. ``` Fixing the issue: https://github.com/MLH-Fellowship/pyre-check/issues/78 Pull Request resolved: https://github.com/pytorch/pytorch/pull/66164 Reviewed By: onionymous Differential Revision: D31411443 Pulled By: 0xedward fbshipit-source-id: c69f8fb839ad1d5ba5e4a223e1322ae7207e1574	2021-10-13 10:33:49 -07:00
Peter Bell	560cd88195	Kill THCUNN (#63429 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63429 Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D30441308 Pulled By: ngimel fbshipit-source-id: 3ae342a2f8d5c7f8827b637c4055c5d1b0a1be26	2021-08-23 12:07:16 -07:00
Sam Estep	2e26976ad3	Disallow versionless Python shebangs (#58275 ) Summary: Some machines don't have a versionless `python` on their PATH, which breaks these existing shebangs. I'm assuming that all the existing versionless `python` shebangs are meant to be `python3` and not `python2`; please let me know if my assumption was incorrect for any of these. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58275 Test Plan: CI. Reviewed By: zhouzhuojie Differential Revision: D28428143 Pulled By: samestep fbshipit-source-id: 6562be3d12924db72a92a0207b060ef740f61ebf	2021-05-14 08:26:02 -07:00
Nikita Shulga	75f7b22025	Fix hipify_python (#52709 ) Summary: Two changes: - Print a warning rather than fail if creating hipified file fails with permission denied error - Do not attempt to create /usr/include/libpng/png_hip.h in the first place Pull Request resolved: https://github.com/pytorch/pytorch/pull/52709 Reviewed By: walterddr Differential Revision: D26625033 Pulled By: malfet fbshipit-source-id: ff82dc24aee12eac2daaa6e5bc938811b49ebbc6	2021-02-23 19:19:13 -08:00
rraminen	8c185e62f9	torchvision hipify revamp fix (#51453 ) Summary: The torchvision build error from hipify revamp, "KeyError: '/usr/include/libpng16/png.h'" is fixed in this PR Description: Traceback (most recent call last): File "setup.py", line 471, in <module> ext_modules=get_extensions(), File "setup.py", line 329, in get_extensions extra_compile_args=extra_compile_args File "/opt/conda/lib/python3.6/site-packages/torch/utils/cpp_extension.py", line 892, in CUDAExtension is_pytorch_extension=True, File "/opt/conda/lib/python3.6/site-packages/torch/utils/hipify/hipify_python.py", line 978, in hipify clean_ctx=clean_ctx) File "/opt/conda/lib/python3.6/site-packages/torch/utils/hipify/hipify_python.py", line 212, in preprocess hip_clang_launch, is_pytorch_extension, clean_ctx, show_progress) File "/opt/conda/lib/python3.6/site-packages/torch/utils/hipify/hipify_python.py", line 175, in preprocess_file_and_save_result hip_clang_launch, is_pytorch_extension, clean_ctx, show_progress) File "/opt/conda/lib/python3.6/site-packages/torch/utils/hipify/hipify_python.py", line 792, in preprocessor output_source = RE_ANGLE_HEADER.sub(mk_repl('#include <{0}>', False), output_source) File "/opt/conda/lib/python3.6/site-packages/torch/utils/hipify/hipify_python.py", line 785, in repl value = HIPIFY_FINAL_RESULT[header_filepath]["hipified_path"] KeyError: '/usr/include/libpng16/png.h' Pull Request resolved: https://github.com/pytorch/pytorch/pull/51453 Reviewed By: agolynski Differential Revision: D26459979 Pulled By: fmassa fbshipit-source-id: f653f55fd34c71314e6c6682217f84b2d1e49335	2021-02-17 03:12:20 -08:00
Tyler Reddy	4a2d3d1cfd	MAINT: char class regex simplify (#50294 ) Summary: * remove some cases of single characters in character classes--these can incur the overhead of a character class with none of the benefits of a multi-character character class * for more details, see Chapter 6 of: Friedl, Jeffrey. Mastering Regular Expressions. 3rd ed., O’Reilly Media, 2009. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50294 Reviewed By: zhangguanheng66 Differential Revision: D25870912 Pulled By: malfet fbshipit-source-id: 9be5be9ed11fd49876213f0be8121b24739f1c13	2021-01-13 08:48:17 -08:00
Guilherme Leobas	211f35631f	Add type annotations to _tensorboard_vis.py and hipify_python.py (#49834 ) Summary: closes gh-49833 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49834 Reviewed By: mruberry Differential Revision: D25725341 Pulled By: malfet fbshipit-source-id: 7454c7afe07a3ff829826afe02aba05b7f649d9b	2021-01-04 09:29:51 -08:00
Jithun Nair	5f62308739	Hipify revamp [REDUX] (#48715 ) Summary: [Refiled version of earlier PR https://github.com/pytorch/pytorch/issues/45451] This PR revamps the hipify module in PyTorch to overcome a long list of shortcomings in the original implementation. However, these improvements are applied only when using hipify to build PyTorch extensions, not for PyTorch or Caffe2 itself. Correspondingly, changes are made to cpp_extension.py to match these improvements. The list of improvements to hipify is as follows: 1. Hipify files in the same directory as the original file, unless there's a "cuda" subdirectory in the original file path, in which case the hipified file will be in the corresponding file path with "hip" subdirectory instead of "cuda". 2. Never hipify the file in-place if changes are introduced due to hipification i.e. always ensure the hipified file either resides in a different folder or has a different filename compared to the original file. 3. Prevent re-hipification of already hipified files. This avoids creation of unnecessary "hip/hip" etc. subdirectories and additional files which have no actual use. 4. Do not write out hipified versions of files if they are identical to the original file. This results in a cleaner output directory, with minimal number of hipified files created. 5. Update header rewrite logic so that it accounts for the previous improvement. 6. Update header rewrite logic so it respects the rules for finding header files depending on whether "" or <> is used. 7. Return a dictionary of mappings of original file paths to hipified file paths from hipify function. 8. Introduce a version for hipify module to allow extensions to contain back-compatible code that targets a specific point in PyTorch where the hipify functionality changed. 9. Update cuda_to_hip_mappings.py to account for the ROCm component subdirectories inside /opt/rocm/include. This also results in cleanup of the Caffe2_HIP_INCLUDE path to remove unnecessary additions to the include path. The list of changes to cpp_extension.py is as follows: 1. Call hipify when building a CUDAExtension for ROCm. 2. Prune the list of source files to CUDAExtension to include only the hipified versions of any source files in the list (if both original and hipified versions of the source file are in the list) 3. Add subdirectories of /opt/rocm/include to the include path for extensions, so that ROCm headers for subcomponent libraries are found automatically cc jeffdaily sunway513 ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/48715 Reviewed By: bdhirsh Differential Revision: D25272824 Pulled By: ezyang fbshipit-source-id: 8bba68b27e41ca742781e1c4d7b07c6f985f040e	2020-12-02 18:03:23 -08:00
Nikita Shulga	8af9f2cc23	Revert D24924736: [pytorch][PR] Hipify revamp Test Plan: revert-hammer Differential Revision: D24924736 (`10b490a3e0`) Original commit changeset: 4af42b8ff4f2 fbshipit-source-id: 7f8f90d55d8a69a2890ec73622fcea559189e381	2020-11-18 11:48:30 -08:00
Jithun Nair	10b490a3e0	Hipify revamp (#45451 ) Summary: This PR revamps the hipify module in PyTorch to overcome a long list of shortcomings in the original implementation. However, these improvements are applied only when using hipify to build PyTorch extensions, not for PyTorch or Caffe2 itself. Correspondingly, changes are made to `cpp_extension.py` to match these improvements. The list of improvements to hipify is as follows: 1. Hipify files in the same directory as the original file, unless there's a "cuda" subdirectory in the original file path, in which case the hipified file will be in the corresponding file path with "hip" subdirectory instead of "cuda". 2. Never hipify the file in-place if changes are introduced due to hipification i.e. always ensure the hipified file either resides in a different folder or has a different filename compared to the original file. 3. Prevent re-hipification of already hipified files. This avoids creation of unnecessary "hip/hip" etc. subdirectories and additional files which have no actual use. 4. Do not write out hipified versions of files if they are identical to the original file. This results in a cleaner output directory, with minimal number of hipified files created. 5. Update header rewrite logic so that it accounts for the previous improvement. 6. Update header rewrite logic so it respects the rules for finding header files depending on whether `""` or `<>` is used. 7. Return a dictionary of mappings of original file paths to hipified file paths from `hipify` function. 8. Introduce a version for hipify module to allow extensions to contain back-compatible code that targets a specific point in PyTorch where the hipify functionality changed. 9. Update `cuda_to_hip_mappings.py` to account for the ROCm component subdirectories inside `/opt/rocm/include`. This also results in cleanup of the `Caffe2_HIP_INCLUDE` path to remove unnecessary additions to the include path. The list of changes to `cpp_extension.py` is as follows: 1. Call `hipify` when building a CUDAExtension for ROCm. 2. Prune the list of source files to CUDAExtension to include only the hipified versions of any source files in the list (if both original and hipified versions of the source file are in the list) 3. Add subdirectories of /opt/rocm/include to the include path for extensions, so that ROCm headers for subcomponent libraries are found automatically cc jeffdaily sunway513 hgaspar lcskrishna ashishfarmer Pull Request resolved: https://github.com/pytorch/pytorch/pull/45451 Reviewed By: ezyang Differential Revision: D24924736 Pulled By: malfet fbshipit-source-id: 4af42b8ff4f21c3782dedb8719b8f9f86b34bd2d	2020-11-18 08:37:49 -08:00
Guilherme Leobas	375a83e6c1	Annotate torch.utils.(tensorboard/show_pickle/hypify) (#44216 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44215 Pull Request resolved: https://github.com/pytorch/pytorch/pull/44216 Reviewed By: gchanan Differential Revision: D23963216 Pulled By: ezyang fbshipit-source-id: b3fed51b2a1cbd05e3cd0222c89c38d61d8968c1	2020-09-29 18:14:26 -07:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Jithun Nair	28be3ef2f2	Fix hipify script for pytorch extensions (#43528 ) Summary: PyTorch extensions can have .cpp or .h files which contain CUDA code that needs to be hipified. The current hipify script logic has overly strict conditions to determine which files get considered for hipification: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L146 These conditions might apply well to pytorch/caffe2 source code, but are overconstrained for third-party extensions. `is_pytorch_file` conditions: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L549 `is_caffe2_gpu_file` conditions: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L561 This PR relaxes these conditions if we're hipifying a pytorch extension (specified by `is_pytorch_extension=True`) and considers all the file extensions specified using the `extensions` parameter: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L820 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43528 Reviewed By: mruberry Differential Revision: D23328272 Pulled By: ngimel fbshipit-source-id: 1e9c3a54ae2da65ac596a7ecd5539f3e14eeed88	2020-08-26 18:41:48 -07:00
Priyanshu	5aa2b572ff	replace black list with block (#42091 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41729 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42091 Reviewed By: pbelevich Differential Revision: D22792096 Pulled By: ezyang fbshipit-source-id: caafa42d12cbad377b67ddbaba8f84a2b8c98066	2020-07-28 10:23:51 -07:00
wudenggang	9600ed9af3	typo fixes (#41632 ) Summary: typo fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/41632 Reviewed By: ezyang Differential Revision: D22617827 Pulled By: mrshenli fbshipit-source-id: c2bfcb7cc36913a8dd32f13fc9adc3aa0a9b682f	2020-07-20 07:23:00 -07:00
Jithun Nair	0e6b750288	Insert parentheses around kernel name argument to hipLaunchKernelGGL (#41022 ) Summary: This is to workaround an issue in hipclang wrt templated kernel name arguments to hipLaunchKernelGGL. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41022 Differential Revision: D22404183 Pulled By: ngimel fbshipit-source-id: 63135ccb9e087f4c8e8663ed383979f7e2c1ba06	2020-07-07 11:31:45 -07:00
Xiang Gao	e22dd561ad	Migrate pow kernel to c10::complex (#39286 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39286 Test Plan: Imported from OSS Differential Revision: D21999454 Pulled By: pbelevich fbshipit-source-id: c8f1ba4ff4ec66ffbc283700cabb6794e6b2896a	2020-06-11 15:49:30 -07:00
Huang Shuang	fedb70a8fb	Fix encoding errors for hipify tool (#37906 ) Summary: Encoding errors occur when using anaconda python 3.6.10 to run hipify_python.py, e.g., "' ASCII 'codec can't decode byte 0xc3". Pull Request resolved: https://github.com/pytorch/pytorch/pull/37906 Differential Revision: D21549531 Pulled By: ezyang fbshipit-source-id: 2ffb5787e192a5c03711baa5c7e2577cb5bcab5a	2020-05-14 08:07:04 -07:00
David Reiss	e75fb4356b	Remove (most) Python 2 support from Python code (#35615 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35615 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well (though using side-by-side view and ignoring whitespace change might be helpful). Test Plan: CI Differential Revision: D20842886 Pulled By: dreiss fbshipit-source-id: 8cad4e87c45895e7ce3938a88e61157a79504aed	2020-04-22 09:23:14 -07:00
Thomas Viehmann	d070c0bcf0	ROCm: enable cpp_extensions.load/load_inline (#35897 ) Summary: This enables cpp_extensions.load/load_inline. This works by hipify-ing cuda sources. Also enable tests. CuDNN/MIOpen extensions aren't yet supported, I propose to not do this in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35897 Differential Revision: D20983279 Pulled By: ezyang fbshipit-source-id: a5d0f5ac592d04488a6a46522c58e2ee0a6fd57c	2020-04-13 11:44:08 -07:00
ashish	616beb1412	[ROCm] Added support for pytorch extensions to use HIP (#32669 ) Summary: This pull request has changes for: 1. Enabling a torch module with HIP code to be compiled by cpp_extensions.py 2. Fixes for hipify module to be able to be used by a torch extension cc: ezyang iotamudelta jeffdaily Pull Request resolved: https://github.com/pytorch/pytorch/pull/32669 Differential Revision: D20033893 Pulled By: zou3519 fbshipit-source-id: fd6ddc8cdcd3930f41008636bb2bc9dd26cdb008	2020-02-21 12:10:02 -08:00
Hong Xu	a6a72ac68f	Fix all occurrences of C416. (#33429 ) Summary: C416: Unnecessary (list/set) comprehension - rewrite using list/set(). See https://pypi.org/project/flake8-comprehensions/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/33429 Differential Revision: D19972858 Pulled By: ezyang fbshipit-source-id: faac042a94c59d737bd5ae983121a0a029346e23	2020-02-21 08:32:22 -08:00
Serhat Yilmaz	4f5a4be45f	Add native/quantized to the list of header rewrites (#31151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31151 same as title. I am not sure why this was not added in the first place. Test Plan: wait for build to succeed. Reviewed By: bddppq, xw285cornell Differential Revision: D18880216 fbshipit-source-id: 8b17d4fbd5dd08c28c52df8b1da77b69d56d65dc	2019-12-11 21:59:29 -08:00
Your Name	4bd8ae13c6	Move hipify to torch/utils to bundle them into torch package (#27425 ) Summary: Similar to https://github.com/pytorch/pytorch/pull/27418 but try to put it under "torch" namespace Pull Request resolved: https://github.com/pytorch/pytorch/pull/27425 Differential Revision: D17779490 Pulled By: bddppq fbshipit-source-id: 688338d143509b37dfc110df17af3331db48a42b	2019-10-07 17:25:45 -07:00

30 Commits