pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Xuehai Pan	2e0e08588e	[BE][PYFMT] migrate PYFMT for `torch/[e-n]*/` to `ruff format` (#144553 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144553 Approved by: https://github.com/ezyang ghstack dependencies: #144551	2025-06-17 08:18:47 +00:00
cyy	ee97d80be2	Apply Ruff fixes and pyupgrade to torch/jit (#144208 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/144208 Approved by: https://github.com/davidberard98	2025-01-16 00:28:50 +00:00
Tom Ritchford	c0582fd0f8	Remove unused Python variables in torch/[b-z]* (#136963 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/136963 Approved by: https://github.com/ezyang	2024-10-19 16:45:22 +00:00
Xuehai Pan	f3fce597e9	[BE][Easy][17/19] enforce style for empty lines in import segments in `torch/[a-c]/` and `torch/[e-n]/` (#129769 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129769 Approved by: https://github.com/ezyang	2024-08-04 10:24:09 +00:00
Aaron Orenstein	038b927590	Flip default value for mypy disallow_untyped_defs [7/11] (#127844 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127844 Approved by: https://github.com/oulgen ghstack dependencies: #127842, #127843	2024-06-08 18:49:45 +00:00
andrewor14	6e6891e843	[jit] Fix _batch_norm_with_update shape function (#122430 ) Summary: We used `native_batch_norm`'s shape function before, but the schemas are actually different. We need to create new shape functions for `_batch_norm_with_update` specifically. Test Plan: buck2 test '@fbcode//mode/opt-tsan' fbcode//caffe2/test/cpp/jit:jit -- --exact 'caffe2/test/cpp/jit:jit - TestShapeGraphLinting.Basic' Reviewers: bdhirsh, davidberard98, eellison Differential Revision: [D55211182](https://our.internmc.facebook.com/intern/diff/D55211182) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122430 Approved by: https://github.com/eellison, https://github.com/bdhirsh	2024-03-22 14:21:57 +00:00
andrewor14	773ae817f7	Batch Norm Consolidation (#116092 ) Summary: This commit simplifies the existing decomposition hierarchy of batch norm ops by adding a single, backend agnostic op: `batch_norm_with_update`. The existing hierarchy looks like: ``` aten.batch_norm -> aten._batch_norm_impl_index -> [ aten.native_batch_norm -> aten._native_batch_norm_legit (export only) -> _batch_norm_legit_cpu/cuda (kernels, export only) -> _batch_norm_cpu/cuda (kernels) ] OR [ aten.cudnn_batch_norm ] OR [ aten.miopen_batch_norm ] ``` Aside from complexity, an important problem with the above decomposition hierarchy is cuda numerics in export flows. We observed significantly worse convergence when training a mobilenetv2-like model when using the `_batch_norm_cuda` kernel instead of the `cudnn_batch_norm` kernel. This means users who export their models on CPU first then move the models to cuda later may silently see worse accuracies even when cudnn is installed, because they are using the worse kernel. This issue is summarized in https://github.com/pytorch/pytorch/issues/111384. Instead, the new hierarchy proposed by consolidating existing batch norm ops will look like: ``` aten.batch_norm -> aten.batch_norm_with_update -> [ _batch_norm_cpu (kernel) ] OR [ _batch_norm_cuda (kernel) ] OR [ cudnn_batch_norm (kernel) ] OR [ miopen_batch_norm (kernel) ] ``` The new op `batch_norm_with_update` hides backend implementation details and automatically picks the right kernel based on what is installed. This commit also adds the following variants to this op: ``` batch_norm_with_update_functional batch_norm_with_update.out batch_norm_no_update batch_norm_no_update.out batch_norm_backward ``` Note that this commit only adds this op and its variants, but does not actually change the decomps to produce these ops in the graph. This will be done after the 2 week FC window, and the ops used in the old stack is planned to be removed after the 6 month BC window. Test Plan: `OpInfo` tests for `batch_norm_with_update`. Reviewers: albanD, bdhirsh Subscribers: albanD, bdhirsh, supriyar Tasks: https://github.com/pytorch/pytorch/issues/111384 Differential Revision: [D54805279](https://our.internmc.facebook.com/intern/diff/D54805279) Co-authored-by: Tugsbayasgalan Manlaibaatar <tmanlaibaatar@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116092 Approved by: https://github.com/bdhirsh, https://github.com/albanD	2024-03-18 21:01:30 +00:00
PyTorch MergeBot	fd0dbcd891	Revert "Batch Norm Consolidation (#116092 )" This reverts commit 7b4f70eda519ccd7f28de17689edd43c52743bc9. Reverted https://github.com/pytorch/pytorch/pull/116092 on behalf of https://github.com/osalpekar due to Causes build failure in //caffe2:aten-hip (AMD build) target. See [D54707318](https://www.internalfb.com/diff/D54707318) for more details, may require internal build system changes to resolve. ([comment](https://github.com/pytorch/pytorch/pull/116092#issuecomment-1989542965))	2024-03-11 22:22:41 +00:00
andrewor14	7b4f70eda5	Batch Norm Consolidation (#116092 ) Summary: This commit simplifies the existing decomposition hierarchy of batch norm ops by adding a single, backend agnostic op: `batch_norm_with_update`. The existing hierarchy looks like: ``` aten.batch_norm -> aten._batch_norm_impl_index -> [ aten.native_batch_norm -> aten._native_batch_norm_legit (export only) -> _batch_norm_legit_cpu/cuda (kernels, export only) -> _batch_norm_cpu/cuda (kernels) ] OR [ aten.cudnn_batch_norm ] OR [ aten.miopen_batch_norm ] ``` Aside from complexity, an important problem with the above decomposition hierarchy is cuda numerics in export flows. We observed significantly worse convergence when training a mobilenetv2-like model when using the `_batch_norm_cuda` kernel instead of the `cudnn_batch_norm` kernel. This means users who export their models on CPU first then move the models to cuda later may silently see worse accuracies even when cudnn is installed, because they are using the worse kernel. This issue is summarized in https://github.com/pytorch/pytorch/issues/111384. Instead, the new hierarchy proposed by consolidating existing batch norm ops will look like: ``` aten.batch_norm -> aten.batch_norm_with_update -> [ _batch_norm_cpu (kernel) ] OR [ _batch_norm_cuda (kernel) ] OR [ cudnn_batch_norm (kernel) ] OR [ miopen_batch_norm (kernel) ] ``` The new op `batch_norm_with_update` hides backend implementation details and automatically picks the right kernel based on what is installed. This commit also adds the following variants to this op: ``` batch_norm_with_update_functional batch_norm_with_update.out batch_norm_no_update batch_norm_no_update.out batch_norm_backward ``` Note that this commit only adds this op and its variants, but does not actually change the decomps to produce these ops in the graph. This will be done after the 2 week FC window, and the ops used in the old stack is planned to be removed after the 6 month BC window. Test Plan: `OpInfo` tests for `batch_norm_with_update`. Reviewers: albanD, bdhirsh Subscribers: albanD, bdhirsh, supriyar Tasks: https://github.com/pytorch/pytorch/issues/111384 Co-authored-by: Tugsbayasgalan Manlaibaatar <tmanlaibaatar@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116092 Approved by: https://github.com/bdhirsh, https://github.com/albanD	2024-03-08 15:07:15 +00:00
PyTorch MergeBot	b529c19bdf	Revert "Batch Norm Consolidation (#116092 )" This reverts commit 5680f565d5b7d4aa412a3988d3d91ca4c5679303. Reverted https://github.com/pytorch/pytorch/pull/116092 on behalf of https://github.com/jeffdaily due to broke ROCm, PR signal was clean but trunk was not, the merge should have been blocked but wasn't ([comment](https://github.com/pytorch/pytorch/pull/116092#issuecomment-1981373237))	2024-03-06 17:10:01 +00:00
Tugsbayasgalan Manlaibaatar	5680f565d5	Batch Norm Consolidation (#116092 ) Summary: This commit simplifies the existing decomposition hierarchy of batch norm ops by adding a single, backend agnostic op: `batch_norm_with_update`. The existing hierarchy looks like: ``` aten.batch_norm -> aten._batch_norm_impl_index -> [ aten.native_batch_norm -> aten._native_batch_norm_legit (export only) -> _batch_norm_legit_cpu/cuda (kernels, export only) -> _batch_norm_cpu/cuda (kernels) ] OR [ aten.cudnn_batch_norm ] OR [ aten.miopen_batch_norm ] ``` Aside from complexity, an important problem with the above decomposition hierarchy is cuda numerics in export flows. We observed significantly worse convergence when training a mobilenetv2-like model when using the `_batch_norm_cuda` kernel instead of the `cudnn_batch_norm` kernel. This means users who export their models on CPU first then move the models to cuda later may silently see worse accuracies even when cudnn is installed, because they are using the worse kernel. This issue is summarized in https://github.com/pytorch/pytorch/issues/111384. Instead, the new hierarchy proposed by consolidating existing batch norm ops will look like: ``` aten.batch_norm -> aten.batch_norm_with_update -> [ _batch_norm_cpu (kernel) ] OR [ _batch_norm_cuda (kernel) ] OR [ cudnn_batch_norm (kernel) ] OR [ miopen_batch_norm (kernel) ] ``` The new op `batch_norm_with_update` hides backend implementation details and automatically picks the right kernel based on what is installed. This commit also adds the following variants to this op: ``` batch_norm_with_update_functional batch_norm_with_update.out batch_norm_no_update batch_norm_no_update.out batch_norm_backward ``` Note that this commit only adds this op and its variants, but does not actually change the decomps to produce these ops in the graph. This will be done after the 2 week FC window, and the ops used in the old stack is planned to be removed after the 6 month BC window. Test Plan: `OpInfo` tests for `batch_norm_with_update`. Reviewers: albanD, bdhirsh Subscribers: albanD, bdhirsh, supriyar Tasks: https://github.com/pytorch/pytorch/issues/111384 Co-authored-by: Tugsbayasgalan Manlaibaatar <tmanlaibaatar@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116092 Approved by: https://github.com/bdhirsh, https://github.com/albanD	2024-03-06 04:50:46 +00:00
Edward Z. Yang	3bf922a6ce	Apply UFMT to low traffic torch modules (#106249 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106249 Approved by: https://github.com/Skylion007	2023-07-29 23:37:30 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Nikita Karetnikov	b1c31b1d26	[pt2] metas and `SymInt` support for `max_pool` ops (#103951 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103951 Approved by: https://github.com/Chillee, https://github.com/kulinseth	2023-07-01 01:33:35 +00:00
ecao	223f232928	Fix shape function for transpose convolution (#102139 ) Fixes #98129. Fixes the shape function for jit conv_transpose, as defined by the documentation https://pytorch.org/docs/stable/generated/torch.nn.ConvTranspose2d.html#torch.nn.ConvTranspose2d, includes output_padding. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102139 Approved by: https://github.com/mingfeima, https://github.com/davidberard98	2023-06-21 17:50:56 +00:00
PyTorch MergeBot	380ccfd442	Revert "Added round_with_scale_factor arg to ATen (#97868 )" This reverts commit aa99c5b4eda345f792687c490e72c8575110977a. Reverted https://github.com/pytorch/pytorch/pull/97868 on behalf of https://github.com/osalpekar due to Caused breakages in the glow compiler - see [D45374622](https://www.internalfb.com/diff/D45374622) for more details	2023-04-28 20:47:00 +00:00
vfdev-5	aa99c5b4ed	Added round_with_scale_factor arg to ATen (#97868 ) Addresses #62396 following the strategy described in https://github.com/pytorch/pytorch/pull/64983#issuecomment-1026177629. Fixing output size to match opencv, scikit-image, scipy if scale factor is specified on ATen side only due to JIT FC. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97868 Approved by: https://github.com/lezcano, https://github.com/mikaylagawarecki	2023-04-26 18:48:37 +00:00
Vivek Khandelwal	bb4998b531	Add shape function for `aten::cross_entropy_loss` (#97875 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/97875 Approved by: https://github.com/davidberard98	2023-04-12 22:11:56 +00:00
Vivek Khandelwal	5810f5ad1a	Fix `aten::squeeze.dims` shape function (#98078 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Fixes https://github.com/llvm/torch-mlir/issues/1690#issuecomment-1491931180. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98078 Approved by: https://github.com/davidberard98	2023-03-31 20:24:09 +00:00
Vivek Khandelwal	428540001d	Add shape function for squeeze.dims op (#93919 ) Changes to `_native_batch_norm_legit` and `upsample_nearest2d` in `serialized_shape_function_registry.cpp` are made just because this file is auto-generated, and the file was not auto-generated after the changes in `_shape_functions.py` for those two ops. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93919 Approved by: https://github.com/davidberard98	2023-03-28 14:55:00 +00:00
lijiahao	3d5eba811a	Add shape function for stack op (#92205 ) As @ramiro050 requested in https://github.com/llvm/torch-mlir/pull/1747, this PR moved the shape code for stack op from torch-mlir to pytorch upstream. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92205 Approved by: https://github.com/eellison	2023-03-07 20:45:56 +00:00
Vivek Khandelwal	f77a9a585c	Add shape function for movedim op (#91696 ) Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91696 Approved by: https://github.com/davidberard98	2023-01-06 18:24:52 +00:00
Jane Xu	8695f0cced	Rectify `native_batch_norm` schema by splitting it into two legit schemas (#88697 ) Using the same repro from the issue (but with BatchNorm2D) Rectifies native_batch_norm schema by splitting the schema into 2: 1. one will have NON-optional alias-able running_mean and running_var inputs 2. the other will just not have those parameters at all (no_stats variation) Calling for name suggestions! ## test plan I've added tests in test_functionalization.py as well as an entry in common_method_invocations.py for `native_batch_norm_legit` CI should pass. ## next steps Because of bc/fc reasons, we reroute native_batch_norm to call our new schemas ONLY through the python dispatcher, but in 2 weeks or so, we should make `native_batch_norm_legit` the official batch_norm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88697 Approved by: https://github.com/albanD	2022-11-23 23:23:17 +00:00
Prashant Kumar	7ddf167ba5	Move the asserts in shape functions upsample_nearest_2d op. (#85801 ) The assert check are moved to top and the function now returns out. This is needed by the downstream torch-mlir project to correctly determine the output type. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/85801 Approved by: https://github.com/eellison	2022-09-30 18:30:06 +00:00
George Petterson	35d4fa444b	Fix for transposed convolution shape functions (#83557 ) This fixes an issue with #80860 when in channels and out channels are different. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83557 Approved by: https://github.com/Gamrix	2022-08-22 19:05:41 +00:00
John Clow	652fb03355	Symbolic Shape Analaysis: Add Generalized List of Tensor Shape Support (#78679 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78679 Approved by: https://github.com/davidberard98	2022-08-17 19:13:26 +00:00
Ramiro Leal-Cavazos	59fccab857	[Shape Fns] Fix handling of empty dim list in sum_mean_dim shape fn (#83357 ) The current implementation of the `sum_mean_dim` shape function takes `dim=[]` and `dim=None` to mean "no reduction". However, in the ops `torch.sum` and `torch.mean`, both `dim=[]` and `dim=None` are equivalent to "reduce along all dimensions". This commit fixes the handling of `dim` in the `sum_mean_dim` shape function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83357 Approved by: https://github.com/Gamrix	2022-08-16 17:13:21 +00:00
John Clow	c177a7124c	Adding additional debug logging and documentation for shape functions (#77115 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/77115 Approved by: https://github.com/eellison	2022-08-15 23:39:28 +00:00
George Petterson	a7e7fbab82	Add shape functions for conv_transpose2d.input and convolution (#80860 ) As @silvasean requested in [this issue](https://github.com/llvm/torch-mlir/pull/917#discussion_r896154545) here is the shape code from Torch-MLIR for conv_transpose2d.input and convolution (updated for the transposed case). Pull Request resolved: https://github.com/pytorch/pytorch/pull/80860 Approved by: https://github.com/Gamrix	2022-08-13 01:19:59 +00:00
John Clow	c8eae2de52	[Shape Fns] Fix optional None for the actual shape functions (#83092 ) I think the person who edited `mean.dim` edited the python and the associated Torchscript version manually in two different ways. This diff fixes that all up. It also fixes the inconsistencies in the `torch.nonzero` generated file Pull Request resolved: https://github.com/pytorch/pytorch/pull/83092 Approved by: https://github.com/eellison	2022-08-10 18:20:18 +00:00
Kurt Mohler	2bfae07a79	Enable `dim=None` for `torch.mean` (#81286 ) Part of #79525 This will require coordination with XLA before merging, just like #79881 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81286 Approved by: https://github.com/albanD	2022-07-28 22:34:56 +00:00
Kurt Mohler	23bdb570cf	Reland: Enable `dim=None` for `torch.sum` (#79881 ) Part of #29137 Reland of #75845 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79881 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-07-09 00:54:42 +00:00
PyTorch MergeBot	ee6ebfc06b	Revert "Enable `dim=None` for `torch.sum` (#75845 )" This reverts commit e79a51f7db181be2e6e196d6d9d90403022bc465. Reverted https://github.com/pytorch/pytorch/pull/75845 on behalf of https://github.com/malfet due to Breaks MacOS builds, see `e79a51f7db`	2022-06-16 22:01:41 +00:00
Kurt Mohler	e79a51f7db	Enable `dim=None` for `torch.sum` (#75845 ) Part of #29137 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75845 Approved by: https://github.com/ezyang	2022-06-16 20:17:07 +00:00
John Clow	dbee7e5499	Adding SSA support for convolution_backward Pull Request resolved: https://github.com/pytorch/pytorch/pull/77283 Approved by: https://github.com/Krovatkin	2022-05-20 18:39:47 +00:00
John Clow	2a99018147	Adding a way to register both upper and lower bound functions Pull Request resolved: https://github.com/pytorch/pytorch/pull/77388 Approved by: https://github.com/eellison	2022-05-18 17:34:07 +00:00
Sean Silva	1136965aa1	Upstream remaining shape functions from Torch-MLIR. (#76889 ) Follow-on to https://github.com/pytorch/pytorch/pull/76592 adding the rest. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76889 Approved by: https://github.com/eellison	2022-05-13 17:46:58 +00:00
John Clow	db21e22b4b	[EASY] Quick Fix for broken shape function autogen. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76703 Approved by: https://github.com/eellison	2022-05-03 17:34:05 +00:00
Sean Silva	6b6c63ce5e	Upstream `argmax` shape function. Keeping this first commit simple to test out the flow. Will bulk-add the rest once this one goes through. Shape function taken from: `5192a4e9f3/python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/shape_lib_gen.py (L488)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76592 Approved by: https://github.com/eellison	2022-05-03 16:52:09 +00:00
Elias Ellison	f65eb09d6b	[JIT] Move Shape Function definition to python Moves jit shape function registration to python. Like jit decompositions, a script must be run after adding new definitions which serializes them in a c++ file. This was a request so that torch-mlir could define functions in python and upstream their shape functions. cc @silvasean @makslevental Pull Request resolved: https://github.com/pytorch/pytorch/pull/75546 Approved by: https://github.com/davidberard98	2022-04-19 20:59:44 +00:00

40 Commits