pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 13:44:15 +08:00

Author	SHA1	Message	Date
Nikita Shulga	7e274964d3	[BE] Disamntle pyramid of doom in _check_cuda_version (#81602 ) Replace `if stmt: doSmth; else: raise_or_return` with `if not stmt: raise_or_return; doSmth` Pull Request resolved: https://github.com/pytorch/pytorch/pull/81602 Approved by: https://github.com/ezyang	2022-07-17 05:49:39 +00:00
Jithun Nair	71ee384924	[ROCm] Use torch._C._cuda_getArchFlags to get list of gfx archs pytorch was built for (#80498 ) even if no GPUs are available When building PyTorch extensions for ROCm Pytorch, if the user doesn't specify a list of archs using PYTORCH_ROCM_ARCH env var, we would like to use the list of gfx archs that PyTorch was built for as the default value. To do this successfully even in an environment where no GPUs are available eg. a build-only CPU node, we need to be able to get the list of archs. `torch.cuda.get_arch_list()` doesn't work here because it calls `torch.cuda.available()` first: `0922cc024e/torch/cuda/__init__.py (L463)`, which will return `False` if no GPUs are available, resulting in an empty list being returned by `torch.cuda.get_arch_list()`. To get around this issue, we call the underlying API `torch._C._cuda_getArchFlags()` Pull Request resolved: https://github.com/pytorch/pytorch/pull/80498 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-07-07 16:06:12 +00:00
PyTorch MergeBot	ec4be38ba9	Revert "To add hipify_torch as a submodule in pytorch/third_party (#74704 )" This reverts commit 93b0fec39dd112d5c06106ad0186d55d61f1531a. Reverted https://github.com/pytorch/pytorch/pull/74704 on behalf of https://github.com/malfet due to broke torchvision	2022-06-21 23:54:00 +00:00
Bhavya Medishetty	93b0fec39d	To add hipify_torch as a submodule in pytorch/third_party (#74704 ) `hipify_torch` as a submodule in `pytorch/third_party` Pull Request resolved: https://github.com/pytorch/pytorch/pull/74704 Approved by: https://github.com/jeffdaily, https://github.com/malfet	2022-06-21 18:56:49 +00:00
Xiao Wang	ef0332e36d	Allow relocatable device code linking in pytorch CUDA extensions (#78225 ) Close https://github.com/pytorch/pytorch/issues/57543 Doc: check `Relocatable device code linking:` in https://docs-preview.pytorch.org/78225/cpp_extension.html#torch.utils.cpp_extension.CUDAExtension Pull Request resolved: https://github.com/pytorch/pytorch/pull/78225 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-06-02 21:35:56 +00:00
Michael Suo	fb0f285638	[lint] upgrade mypy to latest version Fixes https://github.com/pytorch/pytorch/issues/75927. Had to fix some bugs and add some ignores. To check if clean: ``` lintrunner --paths-cmd='git grep -Il .' --take MYPY,MYPYSTRICT ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76753 Approved by: https://github.com/malfet	2022-05-03 20:51:34 +00:00
PyTorch MergeBot	3d7428d9ac	Revert "[lint] upgrade mypy to latest version" This reverts commit 9bf18aab94943f5352604a39340ad57ad4d0c5a4. Reverted https://github.com/pytorch/pytorch/pull/76753 on behalf of https://github.com/suo	2022-05-03 20:01:18 +00:00
Michael Suo	9bf18aab94	[lint] upgrade mypy to latest version Fixes https://github.com/pytorch/pytorch/issues/75927. Had to fix some bugs and add some ignores. To check if clean: ``` lintrunner --paths-cmd='git grep -Il .' --take MYPY,MYPYSTRICT ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76753 Approved by: https://github.com/malfet	2022-05-03 19:43:28 +00:00
rraminen	7422ccea8b	Hipify fixes for a successful DeepSpeed build These commits are required to build DeepSpeed on ROCm without the hipify errors. `a41829d9ed` `663c718462` cc: @jeffdaily Pull Request resolved: https://github.com/pytorch/pytorch/pull/76141 Approved by: https://github.com/jeffdaily, https://github.com/pruthvistony, https://github.com/albanD	2022-04-28 13:19:59 +00:00
Min Si	9562aedb58	ROCm: add HIP_HOME/include,lib in cpp_extensions (#75548 ) Summary: hip/hip_runtime.h and libamdhip64.so may be required to compile extension such as torch_ucc. They are in $ROCM_HOME/hip by default, and may not be symlinked to $ROCM_HOME/include and $ROCM_HOME/lib. This commit defines $ROCM_HOME/hip as $HIP_HOME, and adds its include and lib paths when building hipified extension. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75548 Test Plan: ## Verify OSS pytorch + TorchUCC on an AMD GPU machine (MI100) - step 1. Install OSS pytorch ``` export ROCM_PATH=/opt/rocm-4.5.2 git clone https://github.com/pytorch/pytorch.git cd pytorch python3 tools/amd_build/build_amd.py USE_NCCL=0 USE_RCCL=0 USE_KINETO=0 with-proxy python3 setup.py develop USE_NCCL=0 USE_RCCL=0 USE_KINETO=0 with-proxy python3 setup.py install ``` - step2. Install torchUCC extension ``` # /opt/rocm-4.5.2/include/hip does not exist, need include /opt/rocm-4.5.2/hip/include at compile time export ROCM_PATH=/opt/rocm-4.5.2 export RCCL_INSTALL_DIR=/opt/rccl-rocm-rel-4.4-rdc git clone https://github.com/facebookresearch/torch_ucc.git cd torch_ucc UCX_HOME=$RCCL_INSTALL_DIR UCC_HOME=$RCCL_INSTALL_DIR WITH_CUDA=$ROCM_PATH python setup.py ``` Build log before fix (error "hip/hip_runtime.h: No such file or directory"): P493038915 Build log after fix: P493037572 Reviewed By: ezyang Differential Revision: D35506098 Pulled By: minsii fbshipit-source-id: 76cbb6d4eaa6549a00898c9d9ebaca47a55330e9 (cherry picked from commit d684c080edf1fbd293e3321151976812c1da8533)	2022-04-19 20:51:37 +00:00
provefar	7a243ddd19	Add import to `importlib.abc` Fixes #70525 ``` --------------------------------------------------------------------------- AttributeError Traceback (most recent call last) <ipython-input-3-334d309cf512> in <module> ----> 1 lltm_cpp = load(name="lltm_cpp", sources=["lltm.cpp"]) /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in load(name, sources, extra_cflags, extra_cuda_cflags, extra_ldflags, extra_include_paths, build_directory, verbose, with_cuda, is_python_module, is_standalone, keep_intermediates) 1122 verbose=True) 1123 ''' -> 1124 return _jit_compile( 1125 name, 1126 [sources] if isinstance(sources, str) else sources, /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in _jit_compile(name, sources, extra_cflags, extra_cuda_cflags, extra_ldflags, extra_include_paths, build_directory, verbose, with_cuda, is_python_module, is_standalone, keep_intermediates) 1360 return _get_exec_path(name, build_directory) 1361 -> 1362 return _import_module_from_library(name, build_directory, is_python_module) 1363 1364 /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in _import_module_from_library(module_name, path, is_python_module) 1751 spec = importlib.util.spec_from_file_location(module_name, filepath) 1752 module = importlib.util.module_from_spec(spec) -> 1753 assert isinstance(spec.loader, importlib.abc.Loader) 1754 spec.loader.exec_module(module) 1755 return module AttributeError: module 'importlib' has no attribute 'abc' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/75736 Approved by: https://github.com/ezyang	2022-04-14 03:32:30 +00:00
Edgar Andrés Margffoy Tuay	86deecd7be	Check clang++/g++ version when compiling CUDA extensions (#63230 ) Summary: See https://github.com/pytorch/pytorch/issues/55267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63230 Reviewed By: soulitzer Differential Revision: D34159119 Pulled By: malfet fbshipit-source-id: 6eef7582388bf6a42dcc1d82b6e4b1f40f418dd7 (cherry picked from commit 2056d0a0be7951602de22f8d3b4efc28dd71b6c2)	2022-02-24 08:32:32 +00:00
Andrey Talman	46f9e16afe	Documenting cuda 11.5 windows issue (#73013 ) Summary: Adding documentation about compiling extension with CUDA 11.5 and Windows Example of failure: https://github.com/pytorch/pytorch/runs/4408796098?check_suite_focus=true Note: Don't use torch/extension.h In CUDA 11.5 under windows in your C++ code: Use aten instead of torch interface in all cuda 11.5 code under windows. It has been failing with errors, due to a bug in nvcc. Example use: >>> #include <ATen/ATen.h> >>> at::Tensor SigmoidAlphaBlendForwardCuda(....) Instead of: >>> #include <torch/extension.h> >>> torch::Tensor SigmoidAlphaBlendForwardCuda(...) Currently open issue for nvcc bug: https://github.com/pytorch/pytorch/issues/69460 Complete Workaround code example: `cb170ac024` Pull Request resolved: https://github.com/pytorch/pytorch/pull/73013 Reviewed By: malfet, seemethere Differential Revision: D34306134 Pulled By: atalman fbshipit-source-id: 3c5b9d7a89c91bd1920dc63dbd356e45dc48a8bd (cherry picked from commit 87098e7f17fca1b98c90fafe2dde1defb6633f49)	2022-02-19 02:34:59 +00:00
Jithun Nair	8dfdc3df82	[ROCm] Refactor how to specify AMD gpu targets using PYTORCH_ROCM_ARCH (#61706 ) Summary: Remove all hardcoded AMD gfx targets PyTorch build and Magma build will use rocm_agent_enumerator as backup if PYTORCH_ROCM_ARCH env var is not defined PyTorch extensions will use same gfx targets as the PyTorch build, unless PYTORCH_ROCM_ARCH env var is defined torch.cuda.get_arch_list() now works for ROCm builds PyTorch CI dockers will continue to be built for gfx900 and gfx906 for now. PYTORCH_ROCM_ARCH env var can be a space or semicolon separated list of gfx archs eg. "gfx900 gfx906" or "gfx900;gfx906" cc jeffdaily sunway513 jithunnair-amd ROCmSupport KyleCZH Pull Request resolved: https://github.com/pytorch/pytorch/pull/61706 Reviewed By: seemethere Differential Revision: D32735862 Pulled By: malfet fbshipit-source-id: 3170e445e738e3ce373203e1e4ae99c84e645d7d	2021-12-13 15:41:40 -08:00
Nikita Shulga	bede18b061	Add support for C++ frontend wrapper on Linux (#69094 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69094 Partially addresses https://github.com/pytorch/pytorch/issues/68768 Test Plan: Imported from OSS Reviewed By: seemethere Differential Revision: D32730079 Pulled By: malfet fbshipit-source-id: 854e4215ff66e087bdf354fed7a17e87f2649c87	2021-12-02 16:47:00 -08:00
Nikita Shulga	c08e95dd9c	Introduce `IS_LINUX` and `IS_MACOS` global vars (#69093 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69093 Test Plan: Imported from OSS Reviewed By: samdow Differential Revision: D32730080 Pulled By: malfet fbshipit-source-id: aa3f218d09814b4edd96b01c7b57b85fd58c47fc	2021-12-01 09:47:38 -08:00
Nikita Shulga	f6f1b580f8	Fix mypy in cpp_extension.py (#69101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69101 Test Plan: Imported from OSS Reviewed By: atalman, janeyx99 Differential Revision: D32730081 Pulled By: malfet fbshipit-source-id: 76ace65b51850b74b175a3c4688c05e107873e8d	2021-11-30 16:01:55 -08:00
Jane Xu	78f970568c	Add dummy op to use instead of searchsorted (#66964 ) Summary: Would help unblock https://github.com/pytorch/pytorch/issues/66818 if this actually works Pull Request resolved: https://github.com/pytorch/pytorch/pull/66964 Reviewed By: mruberry Differential Revision: D31817942 Pulled By: janeyx99 fbshipit-source-id: 9e9a2bcb0c0479ec7000ab8760a2e64bf0e85e95	2021-10-21 12:56:22 -07:00
Pruthvi Madugundu	085e2f7bdd	[ROCm] Changes not to rely on CUDA_VERSION or HIP_VERSION (#65610 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65610 - Replace HIP_PLATFORM_HCC with USE_ROCM - Dont rely on CUDA_VERSION or HIP_VERSION and use USE_ROCM and ROCM_VERSION. - In the next PR - Will be removing the mapping from CUDA_VERSION to HIP_VERSION and CUDA to HIP in hipify. - HIP_PLATFORM_HCC is deprecated, so will add HIP_PLATFORM_AMD to support HIP host code compilation on gcc. cc jeffdaily sunway513 jithunnair-amd ROCmSupport amathews-amd Reviewed By: jbschlosser Differential Revision: D30909053 Pulled By: ezyang fbshipit-source-id: 224a966ebf1aaec79beccbbd686fdf3d49267e06	2021-09-29 09:55:43 -07:00
peterjc123	e6dc7bc61b	Subprocess encoding fixes for cpp extension (#63756 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/63584 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63756 Reviewed By: bdhirsh Differential Revision: D30485046 Pulled By: ezyang fbshipit-source-id: 4f0ac383da4e8843e2a602dceae85f389d7434ee	2021-08-24 10:46:11 -07:00
Nikita Shulga	9679fa7f30	Update cpp_extension.py (#61484 ) Summary: By default, majority of Python-3.[6789] installation comes with `pkg_resources.packaging` version 16.8 (or `setuptool` older than 49.6.0), which does not have major/minor properties on Version package, as one can observe in https://github.com/pypa/setuptools/blob/v49.5.0/pkg_resources/_vendor/packaging/version.py On the other hand, compare operators exists, so why not use it to check for version equality Fixes https://github.com/pytorch/pytorch/issues/61036 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61484 Reviewed By: walterddr, seemethere Differential Revision: D29643883 Pulled By: malfet fbshipit-source-id: 3db9168c1b009ac3a278709083ea8c5b417471b8	2021-07-13 07:11:58 -07:00
Rong Rong (AI Infra)	2d0c6e60a7	going back to use packaging.version.parse instead (#61053 ) Summary: I think this may be related to https://app.circleci.com/pipelines/github/pytorch/vision/9352/workflows/9c8afb1c-6157-4c82-a5c8-105c5adac57d/jobs/687003 Apparently `pkg_resource.parse_version` returns a type of `pkg_resources.extern.packaging.version.Version` instead of `packaging.version.Version` and seems on some older version of the setuptools it doesn't support `.major/minor` operation. changing it back to using `packaging.version.parse` Pull Request resolved: https://github.com/pytorch/pytorch/pull/61053 Test Plan: CI Reviewed By: samestep Differential Revision: D29494322 Pulled By: walterddr fbshipit-source-id: 294572a10b167677440d7404e5ebe007ab59d299	2021-06-30 16:23:59 -07:00
Edgar Andrés Margffoy Tuay	d46eb77b04	Improve CUDA extension building error/warning messages (#59665 ) Summary: See https://github.com/pytorch/pytorch/issues/55267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59665 Reviewed By: mruberry Differential Revision: D29462248 Pulled By: ezyang fbshipit-source-id: 9de13a284a14a7cd24200b9684151ce652e1eb1e	2021-06-29 13:03:30 -07:00
Edgar Andrés Margffoy Tuay	6322f66878	Add python version and cuda-specific folder to store extensions (#60592 ) Summary: See https://github.com/pytorch/pytorch/issues/55267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60592 Reviewed By: albanD Differential Revision: D29353368 Pulled By: ezyang fbshipit-source-id: 1fbcd021f1030132c0f950f33ce4a3a2fef351e0	2021-06-25 10:27:04 -07:00
albanD	0a0e024648	use importlib instead of imp as it support python 3.5+ (#57160 ) Summary: Prevent some annoying deprecation warning when importing cpp_extensions Pull Request resolved: https://github.com/pytorch/pytorch/pull/57160 Reviewed By: astaff Differential Revision: D28096751 Pulled By: albanD fbshipit-source-id: f169ad4c4945b0fff54c0339052a29f95b9f1831	2021-05-03 05:56:25 -07:00
davidriazati@fb.com	4b96fc060b	Remove distutils (#57040 ) Summary: [distutils](https://docs.python.org/3/library/distutils.html) is on its way out and will be deprecated-on-import for Python 3.10+ and removed in Python 3.12 (see [PEP 632](https://www.python.org/dev/peps/pep-0632/)). There's no reason for us to keep it around since all the functionality we want from it can be found in `setuptools` / `sysconfig`. `setuptools` includes a copy of most of `distutils` (which is fine to use according to the PEP), that it uses under the hood, so this PR also uses that in some places. Fixes #56527 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57040 Pulled By: driazati Reviewed By: nikithamalgifb Differential Revision: D28051356 fbshipit-source-id: 1ca312219032540e755593e50da0c9e23c62d720	2021-04-29 12:10:11 -07:00
Sam Estep	75024e228c	Add lint for unqualified `type: ignore` (#56290 ) Summary: The other half of https://github.com/pytorch/pytorch/issues/56272. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56290 Test Plan: CI should pass on the tip of this PR, and we know that the lint works because the following CI runs (before this PR was finished) failed: - https://github.com/pytorch/pytorch/runs/2384511062 - https://github.com/pytorch/pytorch/actions/runs/765036024 Reviewed By: seemethere Differential Revision: D27867219 Pulled By: samestep fbshipit-source-id: e648f07b6822867e70833e23ddafe7fb7eaca235	2021-04-21 08:07:23 -07:00
Pruthvi Madugundu	b383b63550	[ROCm] Updating ROCM_HOME handling for >ROCm 4.0 (#55968 ) Summary: - This change is required to handle the case when hipcc is updated to the latest using update-alternatives. - Update-alternatives support for few ROCm binaries is available from ROCm 4.1 onwards. - This change doesnt not affect any previous versions of ROCm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/55968 Reviewed By: mruberry Differential Revision: D27785123 Pulled By: ezyang fbshipit-source-id: 8467e468d8d51277fab9b0c8cbd57e80bbcfc7f7	2021-04-15 07:48:36 -07:00
Aleksei Kashapov	0b8bd22614	Fix bug with rebuilding extensions every import (#56015 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56015 Reviewed By: mruberry Differential Revision: D27765934 Pulled By: ezyang fbshipit-source-id: 65cace951fce5f2284ab91d8bd687ac89a2311fb	2021-04-14 13:25:01 -07:00
Jeff Daily	e5b97777e3	[ROCm] allow PYTORCH_ROCM_ARCH in cpp_extension.py (#54341 ) Summary: Allows extensions to override ROCm gfx arch targets. Reuses the same env var used during cmake build for consistency. Pull Request resolved: https://github.com/pytorch/pytorch/pull/54341 Reviewed By: bdhirsh Differential Revision: D27244010 Pulled By: heitorschueroff fbshipit-source-id: 279e1a41ee395a0596aa7f696b6e908cf7f5bb83	2021-03-23 13:06:00 -07:00
Colin Gravill	65087dd1d4	Fix broken link from load_inline to new test location (#53701 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53701 Reviewed By: jbschlosser Differential Revision: D27047406 Pulled By: ezyang fbshipit-source-id: 0be6e669cf41527d3ffeb101e5f36db07e41b4af	2021-03-15 13:53:15 -07:00
peterjc123	44ff79d849	Automatically set BUILD_SPLIT_CUDA for cpp exts (#52503 ) Summary: Fixes https://github.com/pytorch/vision/pull/3418#issuecomment-781673110 Pull Request resolved: https://github.com/pytorch/pytorch/pull/52503 Reviewed By: malfet Differential Revision: D26546857 Pulled By: janeyx99 fbshipit-source-id: a100b408e7cd28695145a1dda7f2fa081bb7f21f	2021-02-19 12:22:55 -08:00
Jane Xu	550c965b2e	Re-enable test_standalone_load for Windows 11.1 (#51596 ) Summary: This fixes the previous erroring out by adding stricter conditions in cpp_extension.py. To test, run a split torch_cuda build on Windows with export BUILD_SPLIT_CUDA=ON && python setup.py develop and then run the following test: python test/test_utils.py TestStandaloneCPPJIT.test_load_standalone. It should pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51596 Reviewed By: malfet Differential Revision: D26213816 Pulled By: janeyx99 fbshipit-source-id: a752ce7f9ab9d73dcf56f952bed2f2e040614443	2021-02-03 08:58:34 -08:00
Nikita Shulga	f7313b3105	Fix Python.h discovery logic on some MacOS platforms (#51586 ) Summary: On all non-Windows platforms we should use 'posix_prefix' schema to discover location of Python.h header Pull Request resolved: https://github.com/pytorch/pytorch/pull/51586 Reviewed By: ezyang Differential Revision: D26208684 Pulled By: malfet fbshipit-source-id: bafa6d79de42231629960c642d535f1fcf7a427f	2021-02-02 21:38:37 -08:00
Jane Xu	88af2149e1	Add build option to split torch_cuda library into torch_cuda_cu and torch_cuda_cpp (#49050 ) Summary: Because of the size of our `libtorch_cuda.so`, linking with other hefty binaries presents a problem where 32bit relocation markers are too small and end up overflowing. This PR attempts to break up `torch_cuda` into `torch_cuda_cu` and `torch_cuda_cpp`. `torch_cuda_cu`: all the files previously in `Caffe2_GPU_SRCS` that are * pure `.cu` files in `aten`match * all the BLAS files * all the THC files, except for THCAllocator.cpp, THCCachingHostAllocator.cpp and THCGeneral.cpp * all files in`detail` * LegacyDefinitions.cpp and LegacyTHFunctionsCUDA.cpp * RegisterCUDA.cpp CUDAHooks.cpp * CUDASolver.cpp * TensorShapeCUDA.cpp `torch_cuda_cpp`: all other files in `Caffe2_GPU_SRCS` Accordingly, TORCH_CUDA_API and TORCH_CUDA_BUILD_MAIN_LIB usages are getting split as well to TORCH_CUDA_CU_API and TORCH_CUDA_CPP_API. To test this locally, you can run `export BUILD_SPLIT_CUDA=ON && python setup.py develop`. In your `build/lib` folder, you should find binaries for both `torch_cuda_cpp` and `torch_cuda_cu`. To see that the SPLIT_CUDA option was toggled, you can grep the Summary of running cmake and make sure `Split CUDA` is ON. This build option is tested on CI for CUDA 11.1 builds (linux for now, but windows soon). Pull Request resolved: https://github.com/pytorch/pytorch/pull/49050 Reviewed By: walterddr Differential Revision: D26114310 Pulled By: janeyx99 fbshipit-source-id: 0180f2519abb5a9cdde16a6fb7dd3171cff687a6	2021-02-01 18:42:35 -08:00
Jithun Nair	327539ca79	Fix bug in hipify if include_dirs is not specified in setup.py (#50703 ) Summary: Bugs: 1) would introduce -I* in compile commands 2) wouldn't hipify source code directly in build_dir, only one level down or more Pull Request resolved: https://github.com/pytorch/pytorch/pull/50703 Reviewed By: mrshenli Differential Revision: D25949070 Pulled By: ngimel fbshipit-source-id: 018c2a056b68019a922e20e5db2eb8435ad147fe	2021-01-19 16:30:17 -08:00
Ralf Gommers	e29082b2a6	Run mypy over test/test_utils.py (#50278 ) Summary: _resubmission of gh-49654, which was reverted due to a cross-merge conflict_ This caught one incorrect annotation in `cpp_extension.load`. xref gh-16574. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50278 Reviewed By: walterddr Differential Revision: D25865278 Pulled By: ezyang fbshipit-source-id: 25489191628af5cf9468136db36f5a0f72d9d54d	2021-01-11 08:16:23 -08:00
Rong Rong (AI Infra)	e3c56ddde6	Revert D25757691: [pytorch][PR] Run mypy over test/test_utils.py Test Plan: revert-hammer Differential Revision: D25757691 (`c86cfcd81d`) Original commit changeset: 145ce3ae532c fbshipit-source-id: 3dfd68f0c42fc074cde15c6213a630b16e9d8879	2021-01-05 13:40:13 -08:00
Ralf Gommers	c86cfcd81d	Run mypy over test/test_utils.py (#49654 ) Summary: This caught one incorrect annotation in `cpp_extension.load`. xref gh-16574. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49654 Reviewed By: heitorschueroff Differential Revision: D25757691 Pulled By: ezyang fbshipit-source-id: 145ce3ae532cc585d9ca3bbd5381401bad0072e2	2021-01-05 09:32:06 -08:00
Samuel Marks	e6779d4357	[*.py] Rename "Arguments:" to "Args:" (#49736 ) Summary: I've written custom parsers and emitters for everything from docstrings to classes and functions. However, I recently came across an issue when I was parsing/generating from the TensorFlow codebase: inconsistent use of `Args:` and `Arguments:` in its docstrings. ```sh (pytorch#c348fae)$ for name in 'Args:' 'Arguments:'; do printf '%-10s %04d\n' "$name" "$(rg -IFtpy --count-matches "$name" \| paste -s -d+ -- \| bc)"; done Args: 1095 Arguments: 0336 ``` It is easy enough to extend my parsers to support both variants, however it looks like `Arguments:` is wrong anyway, as per: - https://google.github.io/styleguide/pyguide.html#doc-function-args @ [`ddccc0f`](https://github.com/google/styleguide/blob/ddccc0f/pyguide.md) - https://chromium.googlesource.com/chromiumos/docs/+/master/styleguide/python.md#describing-arguments-in-docstrings @ [`9fc0fc0`](https://chromium.googlesource.com/chromiumos/docs/+/9fc0fc0/styleguide/python.md) - https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html @ [`c0ae8e3`](https://github.com/sphinx-contrib/napoleon/blob/c0ae8e3/docs/source/example_google.rst) Therefore, only `Args:` is valid. This PR replaces them throughout the codebase. PS: For related PRs, see tensorflow/tensorflow/pull/45420 PPS: The trackbacks automatically appearing below are sending the same changes to other repositories in the [PyTorch](https://github.com/pytorch) organisation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49736 Reviewed By: albanD Differential Revision: D25710534 Pulled By: soumith fbshipit-source-id: 61e8ff01abb433e9f78185c2d1d0cbd7c22c1619	2020-12-28 09:34:47 -08:00
Stas Bekman	60b4c40101	[extensions] fix `is_ninja_available` during cuda extension building (#49443 ) Summary: tldr: current version of `is_ninja_available` of `torch/utils/cpp_extension.py` fails to run in the recent incarnations of pip w/ new build isolation feature which is now a default. This PR fixes this problem. The full story follows: -------------------------- Currently trying to build https://github.com/facebookresearch/fairscale/ which builds cuda extensions fails with the recent pip versions. The build is failing to perform `is_ninja_available`, which runs a simple subprocess to run `ninja --version` but does it with some /dev/null stream override which seems to break with the new pip versions. Currently I have `pip==20.3.3`. The recent pip performs build isolation which first fetches all dependencies to somewhere under /tmp/pip-install-xyz and then builds the package. If I build: ``` pip install fairscale --no-build-isolation ``` everything works. When building normally (i.e. without `--no-build-isolation`), the failure is a long long trace, <details> <summary>Full log</summary> <pre> pip install fairscale Collecting fairscale Downloading fairscale-0.1.1.tar.gz (83 kB) \|████████████████████████████████\| 83 kB 562 kB/s Installing build dependencies ... done Getting requirements to build wheel ... error ERROR: Command errored out with exit status 1: command: /home/stas/anaconda3/envs/main-38/bin/python /home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/pip/_vendor/pep517/_in_process.py get_requires_for_build_wheel /tmp/tmpjvw00c7v cwd: /tmp/pip-install-1wq9f8fp/fairscale_347f218384a64f24b8d5ce846641213e Complete output (55 lines): running egg_info writing fairscale.egg-info/PKG-INFO writing dependency_links to fairscale.egg-info/dependency_links.txt writing requirements to fairscale.egg-info/requires.txt writing top-level names to fairscale.egg-info/top_level.txt Traceback (most recent call last): File "/home/stas/anaconda3/envs/main-38/bin/ninja", line 5, in <module> from ninja import ninja ModuleNotFoundError: No module named 'ninja' Traceback (most recent call last): File "/home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/pip/_vendor/pep517/_in_process.py", line 280, in <module> main() File "/home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/pip/_vendor/pep517/_in_process.py", line 263, in main json_out['return_val'] = hook(hook_input['kwargs']) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/pip/_vendor/pep517/_in_process.py", line 114, in get_requires_for_build_wheel return hook(config_settings) File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/build_meta.py", line 149, in get_requires_for_build_wheel return self._get_build_requires( File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/build_meta.py", line 130, in _get_build_requires self.run_setup() File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/build_meta.py", line 145, in run_setup exec(compile(code, __file__, 'exec'), locals()) File "setup.py", line 56, in <module> setuptools.setup( File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/__init__.py", line 153, in setup return distutils.core.setup(attrs) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/core.py", line 148, in setup dist.run_commands() File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/dist.py", line 966, in run_commands self.run_command(cmd) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/dist.py", line 985, in run_command cmd_obj.run() File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/command/egg_info.py", line 298, in run self.find_sources() File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/command/egg_info.py", line 305, in find_sources mm.run() File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/command/egg_info.py", line 536, in run self.add_defaults() File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/setuptools/command/egg_info.py", line 572, in add_defaults sdist.add_defaults(self) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/command/sdist.py", line 228, in add_defaults self._add_defaults_ext() File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/command/sdist.py", line 311, in _add_defaults_ext build_ext = self.get_finalized_command('build_ext') File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/cmd.py", line 298, in get_finalized_command cmd_obj = self.distribution.get_command_obj(command, create) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/distutils/dist.py", line 858, in get_command_obj cmd_obj = self.command_obj[command] = klass(self) File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/torch/utils/cpp_extension.py", line 351, in __init__ if not is_ninja_available(): File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/torch/utils/cpp_extension.py", line 1310, in is_ninja_available subprocess.check_call('ninja --version'.split(), stdout=devnull) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/subprocess.py", line 364, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['ninja', '--version']' returned non-zero exit status 1. ---------------------------------------- ERROR: Command errored out with exit status 1: /home/stas/anaconda3/envs/main-38/bin/python /home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/pip/_vendor/pep517/_in_process.py get_requires_for_build_wheel /tmp/tmpjvw00c7v Check the logs for full command output. </pre> </details> and the middle of it is what we want: ``` File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/torch/utils/cpp_extension.py", line 351, in __init__ if not is_ninja_available(): File "/tmp/pip-build-env-a5x2icen/overlay/lib/python3.8/site-packages/torch/utils/cpp_extension.py", line 1310, in is_ninja_available subprocess.check_call('ninja --version'.split(), stdout=devnull) File "/home/stas/anaconda3/envs/main-38/lib/python3.8/subprocess.py", line 364, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['ninja', '--version']' returned non-zero exit status 1. ``` For some reason pytorch fails to run this simple code: ``` # torch/utils/cpp_extension.py def is_ninja_available(): r''' Returns ``True`` if the `ninja <https://ninja-build.org/>`_ build system is available on the system, ``False`` otherwise. ''' with open(os.devnull, 'wb') as devnull: try: subprocess.check_call('ninja --version'.split(), stdout=devnull) except OSError: return False else: return True ``` I suspect that pip does something to `os.devnull` and that's why it fails. This PR proposes a simpler code which doesn't rely on anything but `subprocess.check_output`: ``` def is_ninja_available(): r''' Returns ``True`` if the `ninja <https://ninja-build.org/>`_ build system is available on the system, ``False`` otherwise. ''' try: subprocess.check_output('ninja --version'.split()) except Exception: return False else: return True ``` which doesn't use `os.devnull` and performs the same function. There could be a whole bunch of different exceptions there I think, so I went for the generic one - we don't care why it failed, since this function's only purpose is to suggest whether ninja can be used or not. Let's check ``` python -c "import torch.utils.cpp_extension; print(torch.utils.cpp_extension.is_ninja_available())" True ``` Look ma - no std noise to take care of. (i.e. no need for /dev/null). I was editing the installed environment-wide `cpp_extension.py` file directly, so didn't need to tweak `PYTHONPATH` - I made sure to replace `'ninja --version'.` with something that should fail and I did get `False` for the above command line. I next did a somewhat elaborate cheat to re-package an already existing binary wheel with this corrected version of `cpp_extension.py`, rather than building from source: ``` mkdir /tmp/pytorch-local-channel cd /tmp/pytorch-local-channel # get the latest nightly wheel wget https://download.pytorch.org/whl/nightly/cu110/torch-1.8.0.dev20201215%2Bcu110-cp38-cp38-linux_x86_64.whl # unpack it unzip torch-1.8.0.dev20201215+cu110-cp38-cp38-linux_x86_64.whl # edit torch/utils/cpp_extension.py to fix the python code with the new version as in this PR emacs torch/utils/cpp_extension.py & # pack the files back zip -r torch-1.8.0.dev20201215+cu110-cp38-cp38-linux_x86_64.whl caffe2 torch torch-1.8.0.dev20201215+cu110.dist-info ``` Now I tell pip to use my local channel, plus `--pre` for it to pick up the pre-release as an acceptable wheel ``` # install using this local channel git clone https://github.com/facebookresearch/fairscale/ cd fairscale pip install -v --disable-pip-version-check -e . -f file:///tmp/pytorch-local-channel --pre ``` and voila all works. ``` [...] Successfully installed fairscale ``` I noticed a whole bunch of ninja not found errors in the log, which I think is the same problem with other parts of the build system packages which also use this old check copied all over various projects and build tools, and which the recent pip breaks. ``` writing manifest file '/tmp/pip-modern-metadata-_nsdesbq/fairscale.egg-info/SOURCES.txt' Traceback (most recent call last): File "/home/stas/anaconda3/envs/main-38/bin/ninja", line 5, in <module> from ninja import ninja ModuleNotFoundError: No module named 'ninja' [...] /tmp/pip-build-env-fqflyevr/overlay/lib/python3.8/site-packages/torch/utils/cpp_extension.py:364: UserWarning: Attempted to use ninja as the BuildExtension backend but we could not find ninja.. Falling back to using the slow distutils backend. warnings.warn(msg.format('we could not find ninja.')) ``` but these don't prevent from the build completing and installing. I suppose these need to be identified and reported to various other projects, but that's another story. The new pip does something to `os.devnull` I think which breaks any code relying on it - I haven't tried to figure out what happens to that stream object, but this PR which removes its usage solves the problem. Also do notice that: ``` git clone https://github.com/facebookresearch/fairscale/ cd fairscale python setup.py bdist_wheel pip install dist/fairscale-0.1.1-cp38-cp38-linux_x86_64.whl ``` works too. So it is really a pip issue. Apologies if the notes are too many, I tried to give the complete picture and probably other projects will need those details as well. Thank you for reading. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49443 Reviewed By: mruberry Differential Revision: D25592109 Pulled By: ezyang fbshipit-source-id: bfce4420c28b614ead48e9686f4153c6e0fbe8b7	2020-12-16 18:02:11 -08:00
Gao, Xiang	d409da0677	Fix CUDA extension ninja build (#49344 ) Summary: I am submitting this PR on behalf of Janne Hellsten(nurpax) from NVIDIA, for the convenience of CLA. Thanks Janne a lot for the contribution! Currently, the ninja build decides whether to rebuild a .cu file or not pretty randomly. And there are actually two issues: First, the arch list in the building command is ordered randomly. When the order changes, it will unconditionally rebuild regardless of the timestamp. Second, the header files are not included in the dependency list, so if the header file changes, it is possible that ninja will not rebuild. This PR fixes both issues. The fix for the second issue requires nvcc >= 10.2. nvcc < 10.2 can still build CUDA extension as it used to be, but it will be unable to see the changes in header files. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49344 Reviewed By: glaringlee Differential Revision: D25540157 Pulled By: ezyang fbshipit-source-id: 197541690d7f25e3ac5ebe3188beb1f131a4c51f	2020-12-16 17:45:12 -08:00
Stas Bekman	02b63858f2	[CUDAExtension] support all visible cards when building a cudaextension (#48891 ) Summary: Currently CUDAExtension assumes that all cards are of the same type on the same machine and builds the extension with compute capability of the 0th card. This breaks later at runtime if the machine has cards of different types. Specifically resulting in: ``` RuntimeError: CUDA error: no kernel image is available for execution on the device ``` when the cards of the types that weren't compiled for are used. (and the error is far from telling what the problem is to the uninitiated) My current setup is: ``` $ CUDA_VISIBLE_DEVICES=0 python -c "import torch; print(torch.cuda.get_device_capability())" (8, 6) $ CUDA_VISIBLE_DEVICES=1 python -c "import torch; print(torch.cuda.get_device_capability())" (6, 1) ``` but the extension was getting built with `-gencode=arch=compute_80,code=sm_80`. This PR: * [x] introduces a loop over all visible at build time devices to ensure the extension will run on all of them (it sorts the new list generated by the loop, so that the output is easier to debug should a card with lower capacity come last) * [x] adds `+PTX` to the last entry of ccs derived from local cards (`if not _arch_list:`) to support other archs * [x] adds a digest of my conversation with ptrblck on slack in the form of docs which hopefully can help others know which archs to support, how to override defaults, when and how to add PTX, etc. Please kindly review that my prose is clear and easy to understand. ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/48891 Reviewed By: ngimel Differential Revision: D25358285 Pulled By: ezyang fbshipit-source-id: 8160f3adebffbc8e592ddfcc3adf153a9dc91557	2020-12-08 14:57:10 -08:00
Jithun Nair	5f62308739	Hipify revamp [REDUX] (#48715 ) Summary: [Refiled version of earlier PR https://github.com/pytorch/pytorch/issues/45451] This PR revamps the hipify module in PyTorch to overcome a long list of shortcomings in the original implementation. However, these improvements are applied only when using hipify to build PyTorch extensions, not for PyTorch or Caffe2 itself. Correspondingly, changes are made to cpp_extension.py to match these improvements. The list of improvements to hipify is as follows: 1. Hipify files in the same directory as the original file, unless there's a "cuda" subdirectory in the original file path, in which case the hipified file will be in the corresponding file path with "hip" subdirectory instead of "cuda". 2. Never hipify the file in-place if changes are introduced due to hipification i.e. always ensure the hipified file either resides in a different folder or has a different filename compared to the original file. 3. Prevent re-hipification of already hipified files. This avoids creation of unnecessary "hip/hip" etc. subdirectories and additional files which have no actual use. 4. Do not write out hipified versions of files if they are identical to the original file. This results in a cleaner output directory, with minimal number of hipified files created. 5. Update header rewrite logic so that it accounts for the previous improvement. 6. Update header rewrite logic so it respects the rules for finding header files depending on whether "" or <> is used. 7. Return a dictionary of mappings of original file paths to hipified file paths from hipify function. 8. Introduce a version for hipify module to allow extensions to contain back-compatible code that targets a specific point in PyTorch where the hipify functionality changed. 9. Update cuda_to_hip_mappings.py to account for the ROCm component subdirectories inside /opt/rocm/include. This also results in cleanup of the Caffe2_HIP_INCLUDE path to remove unnecessary additions to the include path. The list of changes to cpp_extension.py is as follows: 1. Call hipify when building a CUDAExtension for ROCm. 2. Prune the list of source files to CUDAExtension to include only the hipified versions of any source files in the list (if both original and hipified versions of the source file are in the list) 3. Add subdirectories of /opt/rocm/include to the include path for extensions, so that ROCm headers for subcomponent libraries are found automatically cc jeffdaily sunway513 ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/48715 Reviewed By: bdhirsh Differential Revision: D25272824 Pulled By: ezyang fbshipit-source-id: 8bba68b27e41ca742781e1c4d7b07c6f985f040e	2020-12-02 18:03:23 -08:00
Eli Uriegas	780f2b9a9b	torch: Stop using _nt_quote_args from distutils (#48618 ) Summary: They removed the specific function in Python 3.9 so we should just remake the function here and use our own instead of relying on hidden functions from the stdlib Signed-off-by: Eli Uriegas <eliuriegas@fb.com> Fixes https://github.com/pytorch/pytorch/issues/48617 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48618 Reviewed By: samestep Differential Revision: D25230281 Pulled By: seemethere fbshipit-source-id: 57216af40a4ae4dc8bafcf40d2eb3ba793b9b6e2	2020-12-02 16:53:25 -08:00
Taylor Robie	022c929145	Revert "Revert D25199264: Enable callgrind collection for C++ snippets" (#48720 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48720 This reverts commit 6646ff122d3215b77909f669fc26cf6a927030db. Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D25273994 Pulled By: malfet fbshipit-source-id: 61743176dc650136622e1b8f2384bbfbd7a46294	2020-12-02 11:10:11 -08:00
Taylor Robie	07f038aa9d	Add option for cpp_extensions to compile standalone executable (#47862 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47862 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D25199265 Pulled By: robieta fbshipit-source-id: eceb04dea60b82eb10434099639fa3afa61000ca	2020-12-01 20:03:08 -08:00
Nikita Shulga	8af9f2cc23	Revert D24924736: [pytorch][PR] Hipify revamp Test Plan: revert-hammer Differential Revision: D24924736 (`10b490a3e0`) Original commit changeset: 4af42b8ff4f2 fbshipit-source-id: 7f8f90d55d8a69a2890ec73622fcea559189e381	2020-11-18 11:48:30 -08:00
Jithun Nair	10b490a3e0	Hipify revamp (#45451 ) Summary: This PR revamps the hipify module in PyTorch to overcome a long list of shortcomings in the original implementation. However, these improvements are applied only when using hipify to build PyTorch extensions, not for PyTorch or Caffe2 itself. Correspondingly, changes are made to `cpp_extension.py` to match these improvements. The list of improvements to hipify is as follows: 1. Hipify files in the same directory as the original file, unless there's a "cuda" subdirectory in the original file path, in which case the hipified file will be in the corresponding file path with "hip" subdirectory instead of "cuda". 2. Never hipify the file in-place if changes are introduced due to hipification i.e. always ensure the hipified file either resides in a different folder or has a different filename compared to the original file. 3. Prevent re-hipification of already hipified files. This avoids creation of unnecessary "hip/hip" etc. subdirectories and additional files which have no actual use. 4. Do not write out hipified versions of files if they are identical to the original file. This results in a cleaner output directory, with minimal number of hipified files created. 5. Update header rewrite logic so that it accounts for the previous improvement. 6. Update header rewrite logic so it respects the rules for finding header files depending on whether `""` or `<>` is used. 7. Return a dictionary of mappings of original file paths to hipified file paths from `hipify` function. 8. Introduce a version for hipify module to allow extensions to contain back-compatible code that targets a specific point in PyTorch where the hipify functionality changed. 9. Update `cuda_to_hip_mappings.py` to account for the ROCm component subdirectories inside `/opt/rocm/include`. This also results in cleanup of the `Caffe2_HIP_INCLUDE` path to remove unnecessary additions to the include path. The list of changes to `cpp_extension.py` is as follows: 1. Call `hipify` when building a CUDAExtension for ROCm. 2. Prune the list of source files to CUDAExtension to include only the hipified versions of any source files in the list (if both original and hipified versions of the source file are in the list) 3. Add subdirectories of /opt/rocm/include to the include path for extensions, so that ROCm headers for subcomponent libraries are found automatically cc jeffdaily sunway513 hgaspar lcskrishna ashishfarmer Pull Request resolved: https://github.com/pytorch/pytorch/pull/45451 Reviewed By: ezyang Differential Revision: D24924736 Pulled By: malfet fbshipit-source-id: 4af42b8ff4f21c3782dedb8719b8f9f86b34bd2d	2020-11-18 08:37:49 -08:00
Chester Liu	17a6bc7c1b	Cleanup unused code for Python < 3.6 (#47822 ) Summary: I think these can be safely removed since the min version of supported Python is now 3.6 Pull Request resolved: https://github.com/pytorch/pytorch/pull/47822 Reviewed By: smessmer Differential Revision: D24954936 Pulled By: ezyang fbshipit-source-id: 5d4b2aeb78fc97d7ee4abaf5fb2aae21bf765e8b	2020-11-13 21:37:01 -08:00

1 2 3 4 5 ...

288 Commits