pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-22 06:11:27 +08:00

Author	SHA1	Message	Date
PyTorch MergeBot	d1c157c598	Revert "[reland] Update custom Function preserve torch function when inputs r… (#110679 )" This reverts commit 563728f61c39379070661af3a431aa49eaf5c8ac. Reverted https://github.com/pytorch/pytorch/pull/110679 on behalf of https://github.com/kit1980 due to The diff has Meta-internal changes, please land from Phabricator ([comment](https://github.com/pytorch/pytorch/pull/110679#issuecomment-1753523182))	2023-10-09 19:09:01 +00:00
soulitzer	563728f61c	[reland] Update custom Function preserve torch function when inputs r… (#110679 ) …eturned as-is reland of https://github.com/pytorch/pytorch/pull/109825#issuecomment-1749803837 Opening this without ghstack to do codev. In our PR, we changed the signature of `_wrap_outputs`. There is some internal code that calls `_wrap_outputs` directly, so we also need to update that callsite. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110679 Approved by: https://github.com/albanD	2023-10-07 00:27:45 +00:00
PyTorch MergeBot	236afe73a2	Revert "Update custom Function preserve torch function when inputs returned as-is (#109825 )" This reverts commit 4e73eee93f411596fcabb32cc8e7686890d1c7fb. Reverted https://github.com/pytorch/pytorch/pull/109825 on behalf of https://github.com/PaliC due to causing a plethora of internal failures ([comment](https://github.com/pytorch/pytorch/pull/109825#issuecomment-1749802739))	2023-10-05 23:49:41 +00:00
soulitzer	4e73eee93f	Update custom Function preserve torch function when inputs returned as-is (#109825 ) Fixes https://github.com/pytorch/pytorch/issues/109805 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109825 Approved by: https://github.com/albanD	2023-10-04 22:45:11 +00:00
cyy	d0ad848aa5	Enable misc clang-tidy checks (#110283 ) This PR enables the misc-XX checks in clang-tidy. Meanwhile, I excluded some of them that require a lot of code changes and have no immediate benefits. Some additional fixes and suppression were also given. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110283 Approved by: https://github.com/albanD	2023-09-30 10:39:52 +00:00
Pritam Damania	550b0ec3d4	Release GIL around VariableInfo::zeros to avoid deadlocks (#109454 ) See https://github.com/pytorch/pytorch/issues/109074#issue-1891369807 and https://github.com/pytorch/pytorch/issues/109074#issuecomment-1718825855 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109454 Approved by: https://github.com/albanD	2023-09-18 22:28:48 +00:00
cyy	a14d30d8d1	[1/N] apply clang-tidy in torch/csrc/autograd (#109032 ) This PR begins a new series of patches for enabling clang-tidy checks in torch/csrc/augograd Pull Request resolved: https://github.com/pytorch/pytorch/pull/109032 Approved by: https://github.com/albanD, https://github.com/Skylion007	2023-09-15 23:28:43 +00:00
cyy	36b8ca4e48	[2/N] apply clang-tidy in torch/csrc/autograd (#109277 ) This PR follows the work of PR #109032. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109277 Approved by: https://github.com/albanD	2023-09-15 00:39:12 +00:00
Alex Settle	9ba0558d48	Add sequence_nr to aot_autograd to map forward ops to their corresponding backward ops (#103129 ) Fixes #102375 Sequence_nr increments in the forward pass and decrements in the backward pass. Backward ops with the same sequence_nr as a forward op represent the backward implementation for the op. The long term goal is to make this information available to the profiler so users can observe which ops are fused by the inductor openai triton kernels. Added a test for this feature test/dynamo/test_aot_autograd.py::AotAutogradFallbackTests::test_aot_sequence_nr. The test case uses aot_export_module() to create a joint fwd/bwd fx graph. Then it walks all the nodes in fx graph using fx_graph.graph.nodes. The seq_nr of each node is recorded in node.meta. During the fwd pass the seq_nr increments and it decrements during the bwd pass. This allows the user to map forward ops to their corresponding bwd ops which is useful for performance analysis. Expected output from the test case. SeqNr\|OrigAten\|SrcFn 0\|aten.convolution.default\|l__self___conv1 0\|aten.add.Tensor\|l__self___bn1 1\|aten._native_batch_norm_legit_functional.default\|l__self___bn1 2\|aten.relu.default\|l__self___relu1 3\|aten.add.Tensor\|add 4\|aten.view.default\|flatten 5\|aten.t.default\|l__self___fc1 6\|aten.unsqueeze.default\|l__self___fc1 7\|aten.mm.default\|l__self___fc1 8\|aten.squeeze.dim\|l__self___fc1 9\|aten.add.Tensor\|l__self___fc1 10\|aten.sub.Tensor\|l__self___loss_fn 11\|aten.abs.default\|l__self___loss_fn 12\|aten.mean.default\|l__self___loss_fn 12\|aten.ones_like.default\| 12\|aten.expand.default\| 12\|aten.div.Scalar\| 11\|aten.sgn.default\| 11\|aten.mul.Tensor\| 8\|aten.unsqueeze.default\| 7\|aten.t.default\| 7\|aten.mm.default\| 7\|aten.t.default\| 7\|aten.t.default\| 7\|aten.mm.default\| 6\|aten.squeeze.dim\| 5\|aten.t.default\| 4\|aten.view.default\| 2\|aten.threshold_backward.default\| 1\|aten.native_batch_norm_backward.default\| 0\|aten.convolution_backward.default\| 0\|aten.add.Tensor\| Pull Request resolved: https://github.com/pytorch/pytorch/pull/103129 Approved by: https://github.com/soulitzer	2023-08-02 00:52:52 +00:00
Jason Ansel	457d01bcfd	[Compiled Autograd] Remove TORCH_API from generated autograd nodes (#105286 ) This works around the Windows symbol count issues in #103822. Unfortunately, removing TORCH_API only works on Windows, but causes build issues on Linux, so we need the `#ifdef`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105286 Approved by: https://github.com/albanD	2023-07-27 02:33:14 +00:00
Jason Ansel	5a114f72bf	[Compiled Autograd] Move to torch::dynamo::autograd namespace (#105854 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105854 Approved by: https://github.com/albanD	2023-07-27 00:36:47 +00:00
PyTorch MergeBot	e60af5c8e4	Revert "[Compiled Autograd] Move to torch::dynamo::autograd namespace (#105854 )" This reverts commit 26e3b4020f01d4fc2b7f63e1de4c94d2c8b362b5. Reverted https://github.com/pytorch/pytorch/pull/105854 on behalf of https://github.com/PaliC due to breaking internal embedded device tests (details shared with author) ([comment](https://github.com/pytorch/pytorch/pull/105854#issuecomment-1650559375))	2023-07-25 21:09:18 +00:00
Jason Ansel	26e3b4020f	[Compiled Autograd] Move to torch::dynamo::autograd namespace (#105854 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105854 Approved by: https://github.com/albanD	2023-07-25 01:14:04 +00:00
Jason Ansel	c902b84e0b	Compiled autograd (#103822 ) This branch: 1) converts the autograd tape into an FX graph 2) caches that conversion using a "shadow" graph 3) compiles and runs the generated FX graph instead of the normal autograd What works currently: 1) Caching, capture, and initial integration 2) Backwards hooks 3) Inlining AotAutograd generated subgraphs 4) torch.compiling the generated FX graph 5) Auto-detecting dynamic shapes based on changes Future work 1) Larger scale testing 1) Boxed calling convention, so memory can be freed incrementally 1) Support hooks on SavedTensor 1) Additional testing by running eager autograd tests under compiled_autograd.enable() Pull Request resolved: https://github.com/pytorch/pytorch/pull/103822 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-07-24 21:12:05 +00:00
soulitzer	c85468a94c	[autograd Function] Add private API to not materialize grads for non-differentiable outputs (#104291 ) Fixes https://github.com/pytorch/pytorch/issues/104272 This PR adds a new private API `materialize_non_diff_grads` (default True) such that when set to False, grad outputs corresponding to outputs marked non-differentiable would receive None instead of a zero-filled tensor. This is overrides the setting of `materialize_grads`, i.e. grad outputs corresponding non-differentiable outputs would still be None even if `materialize_grads=True` (the default). Pull Request resolved: https://github.com/pytorch/pytorch/pull/104291 Approved by: https://github.com/albanD	2023-07-08 14:53:54 +00:00
Thiago Crepaldi	3834582327	[ONNX] Add autograd_inlining flag to torch.onnx.export (#104067 ) Fixes #88286, Fixes #97160 Repro: ```python import torch import io from torch.utils.checkpoint import checkpoint class A(torch.nn.Module): # A supported module. def __init__(self): super(A, self).__init__() self.l1 = torch.nn.Linear(2, 2) def forward(self, x): return self.l1(x) class B(torch.nn.Module): # This module is not exportable to ONNX because it # uses gradient-checkpointing. However, its two sub-module's # are exportable, so ORTModule should be used to compute them. def __init__(self): super(B, self).__init__() self.l1 = torch.nn.Linear(2, 2) self.a = A() def forward(self, x): def custom(): def custom_forward(x_): return self.a(x_) return custom_forward z = self.l1(checkpoint(custom(), x)) return z torch.onnx.export( B(), (torch.randn(2, 2),), io.BytesIO(), autograd_inlining=True ) ``` `torch.onnx.export(autograd_inlining=True)` should repro the user error as this is the original execution path. ```bash Traceback (most recent call last): File "repro88286.py", line 36, in <module> torch.onnx.export( File "<@beartype(torch.onnx.utils.export) at 0x7f0f011faee0>", line 385, in export File "/opt/pytorch/torch/onnx/utils.py", line 511, in export _export( File "/opt/pytorch/torch/onnx/utils.py", line 1576, in _export graph, params_dict, torch_out = _model_to_graph( File "<@beartype(torch.onnx.utils._model_to_graph) at 0x7f0f01187dc0>", line 11, in _model_to_graph File "/opt/pytorch/torch/onnx/utils.py", line 1130, in _model_to_graph graph, params, torch_out, module = _create_jit_graph(model, args) File "/opt/pytorch/torch/onnx/utils.py", line 1006, in _create_jit_graph graph, torch_out = _trace_and_get_graph_from_model(model, args) File "/opt/pytorch/torch/onnx/utils.py", line 910, in _trace_and_get_graph_from_model trace_graph, torch_out, inputs_states = torch.jit._get_trace_graph( File "/opt/pytorch/torch/jit/_trace.py", line 1269, in _get_trace_graph outs = ONNXTracedModule(f, strict, _force_outplace, return_inputs, _return_inputs_states)(args, kwargs) File "/opt/pytorch/torch/nn/modules/module.py", line 1502, in _wrapped_call_impl return self._call_impl(args, *kwargs) File "/opt/pytorch/torch/nn/modules/module.py", line 1511, in _call_impl return forward_call(args, *kwargs) File "/opt/pytorch/torch/jit/_trace.py", line 128, in forward graph, out = torch._C._create_graph_by_tracing( File "/opt/pytorch/torch/jit/_trace.py", line 119, in wrapper outs.append(self.inner(trace_inputs)) File "/opt/pytorch/torch/nn/modules/module.py", line 1502, in _wrapped_call_impl return self._call_impl(args, kwargs) File "/opt/pytorch/torch/nn/modules/module.py", line 1511, in _call_impl return forward_call(args, *kwargs) File "/opt/pytorch/torch/nn/modules/module.py", line 1492, in _slow_forward result = self.forward(input, *kwargs) File "repro88286.py", line 32, in forward z = self.l1(checkpoint(custom(), x)) File "/opt/pytorch/torch/utils/checkpoint.py", line 412, in checkpoint return CheckpointFunction.apply(function, preserve, args) File "/opt/pytorch/torch/autograd/function.py", line 506, in apply return super().apply(args, *kwargs) # type: ignore[misc] RuntimeError: _Map_base::at ``` By using `autograd_inlining=False`, the export still fail with a different error because autograd inlining is not enabled: ```bash Traceback (most recent call last): File "repro88286.py", line 36, in <module> torch.onnx.export( File "<@beartype(torch.onnx.utils.export) at 0x7f6088b32ee0>", line 385, in export File "/opt/pytorch/torch/onnx/utils.py", line 511, in export _export( File "/opt/pytorch/torch/onnx/utils.py", line 1615, in _export ) = graph._export_onnx( # type: ignore[attr-defined] RuntimeError: ONNX export failed: Couldn't export Python operator CheckpointFunction ``` To allow `CheckpointFunction` into the onnx graph, `operator_export_type=torch.onnx.OperatorExportTypes.ONNX_FALLTHROUGH` flag can be added to `torch.onnx.export`, which would lead to the following ONNX graph: ```bash Exported graph: graph(%prim::PythonOp_0 : Float(2, 2, strides=[2, 1], requires_grad=0, device=cpu), %l1.weight : Float(2, 2, strides=[2, 1], requires_grad=1, device=cpu), %l1.bias : Float(2, strides=[1], requires_grad=1, device=cpu)): %/PythonOp_output_0 : Float(2, 2, strides=[2, 1], requires_grad=0, device=cpu) = ^CheckpointFunction[inplace=0, module="torch.utils.checkpoint", onnx_name="/PythonOp"](<function B.forward.<locals>.custom.<locals>.custom_forward at 0x7fdf9182f670>, True)(%prim::PythonOp_0), scope: __main__.B:: # /opt/pytorch/torch/autograd/function.py:506:0 %6 : Float(2, 2, strides=[2, 1], requires_grad=1, device=cpu) = onnx::Gemm[alpha=1., beta=1., transB=1, onnx_name="/l1/Gemm"](%/PythonOp_output_0, %l1.weight, %l1.bias), scope: __main__.B::/torch.nn.modules.linear.Linear::l1 # /opt/pytorch/torch/nn/modules/linear.py:114:0 return (%6) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104067 Approved by: https://github.com/BowenBao, https://github.com/kit1980	2023-07-05 15:27:36 +00:00
soulitzer	896d997dd0	Remove incorrect THP{Cpp,}Function_traverse PyObject traversals (#102860 ) Fixes https://github.com/pytorch/pytorch/issues/102174 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102860 Approved by: https://github.com/albanD	2023-06-02 22:05:25 +00:00
PandaNinjas	f0786ad776	Use %zu instead of %ld when formatting size_t (#101412 ) This fixes compiling on systems where `size_t` is an `unsigned int` instead of an `unsigned long int` (32 bit Raspberry Pi OS is one example). `%ld` expects an `unsigned long int`, while `%zu` specifies that it's an unsigned size_t. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101412 Approved by: https://github.com/albanD	2023-05-16 02:45:55 +00:00
soulitzer	abe96654de	[reland][BE][autograd Function] Raise an error if input is returned a… (#98051 ) …s-is and saved for forward or backward in setup_context Fixes #ISSUE_NUMBER Relanding this in a new non-ghstack PR so I can import this to do co-dev Pull Request resolved: https://github.com/pytorch/pytorch/pull/98051 Approved by: https://github.com/zou3519	2023-04-11 15:42:54 +00:00
PyTorch MergeBot	45acfc8574	Revert "[BE][autograd Function] Raise an error if input is returned as-is and saved for forward or backward in setup_context (#97212 )" This reverts commit 313db584f33991c8c2520c79b6dbe11fd93d4179. Reverted https://github.com/pytorch/pytorch/pull/97212 on behalf of https://github.com/soulitzer due to Internally someone is rely on _wrap_outputs and we updated its signature	2023-03-30 22:03:07 +00:00
soulitzer	313db584f3	[BE][autograd Function] Raise an error if input is returned as-is and saved for forward or backward in setup_context (#97212 ) Fixes https://github.com/pytorch/pytorch/issues/96887 We error out in BOTH the case when graph is created and when it is not created. Still bc-breaking, but not as severe because we are limiting to the case where someone uses setup_context. This makes setup_context and non-setup_context versions diverge in their behavior - With the non-setup_context version, saved variables are assumed to have the grad_fn of the inputs. - But now with the setup_context version, we produce an error for this case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97212 Approved by: https://github.com/zou3519	2023-03-29 17:54:00 +00:00
PyTorch MergeBot	2ef6ffdfa1	Revert "[BE][autograd Function] Raise an error if input is returned as-is and saved for forward or backward in setup_context (#97212 )" This reverts commit f3aca45a163cf1aafd4f5fa65a0adce53b33abfa. Reverted https://github.com/pytorch/pytorch/pull/97212 on behalf of https://github.com/soulitzer due to TestAutogradFunctionCUDA.test_function_returns_input_inner_requires_grad_True_save_for_vjp_save_tensors_output_mark_dirty_True_cuda leaks	2023-03-28 18:30:51 +00:00
soulitzer	f3aca45a16	[BE][autograd Function] Raise an error if input is returned as-is and saved for forward or backward in setup_context (#97212 ) Fixes https://github.com/pytorch/pytorch/issues/96887 We error out in BOTH the case when graph is created and when it is not created. Still bc-breaking, but not as severe because we are limiting to the case where someone uses setup_context. This makes setup_context and non-setup_context versions diverge in their behavior - With the non-setup_context version, saved variables are assumed to have the grad_fn of the inputs. - But now with the setup_context version, we produce an error for this case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97212 Approved by: https://github.com/zou3519	2023-03-28 03:14:32 +00:00
Aaron Gokaslan	8c8cd9539d	Add missing moves to torch autograd (#92772 ) Applies some additional std::move functions to torch/csrc/autograd to opportunities that were found via static analysis. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92772 Approved by: https://github.com/ezyang	2023-01-24 02:01:52 +00:00
soulitzer	a112814a7f	Simplify retains grad hook implementation (#92604 ) How the old retains_grad hooks was implemented: - retains_grad hooks are stored on the autograd_meta, as entries in a vector - upon registration, a wrapper hook CppFunctionTensorPreHook is created to wrap that vector, and then that wrapper hook is registered to the grad_fn, i.e., by appending it to a vector of retains_grad hooks on the grad_fn - upon in-place, for the old grad_fn we set the retains_grad hook to nullptr, so that even though the old grad_fn still references the vector, the vector contains a single nullptr. For the new grad_fn, we create a new wrapper hook around the vector (storing the single retains_grad hook) on autograd_meta. The new retains_grad hook implementation: - we store std::function by value, and we store it on the grad_fn rather than the autograd_meta - a single grad_fn can have multiple outputs, so it can potentially hold multiple retains_grad hooks. We use an unordered_map (previously a vector). - on in-place we remove the hook from the old grad_fn and put it in the new grad_fn (small implication of this change is that we we now need to have access to both the old grad_fn and new grad_fn, this isn't a problem) Other details: - CppFunctionTensorPreHook took a shared_ptr to vector of std::function. In our new implementation, we add a new wrapper hook CppFunctionSingleTensorPreHook, which takes a single std::function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92604 Approved by: https://github.com/albanD	2023-01-23 20:10:46 +00:00
soulitzer	1bc60c6b31	[reland] Improve hooks ordering behavior (#92559 ) This reverts commit e525f433e15de1f16966901604a8c4c662828a8a. Original PR: #85849 Fixes #ISSUE_NUMBER In addition to reverting the revert, this PR: - defines the virtual destructor of FunctionPreHook in the header. Why? Presumably the internal build imports the header from somewhere, but does not have function_hooks.cpp (where the virtual destructor was previously defined) in the same compilation unit. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92559 Approved by: https://github.com/albanD	2023-01-19 08:17:32 +00:00
PyTorch MergeBot	e525f433e1	Revert "Improve hooks ordering behavior (#85849 )" This reverts commit 049838f2496bd1d29e4e8292714acb0042cc706e. Reverted https://github.com/pytorch/pytorch/pull/85849 on behalf of https://github.com/albanD due to fails internal build	2023-01-18 15:27:22 +00:00
Richard Zou	98b78aa11c	[autograd.Function] setup_context always appears on the Function (#92312 ) Previously, we used the existence of setup_context to switch between if forward should take a ctx object or not. To be consistent with all other staticmethod (which always exist on the autograd.Function), this PR change it so that we use IF setup_context gets overriden by the user to switch between if forward should take a ctx object or not. Fixes https://github.com/pytorch/pytorch/issues/91451 Test Plan: - existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/92312 Approved by: https://github.com/albanD, https://github.com/soulitzer	2023-01-18 02:55:42 +00:00
soulitzer	049838f249	Improve hooks ordering behavior (#85849 ) Addresses: https://github.com/pytorch/pytorch/issues/35802 Design doc: https://docs.google.com/document/d/19xSib7FFknRQ5f3ptGFUmiOt3BrgXSUlTQH2xMcZJYg/edit# ### Changes in this PR #### Implementation - We have now have 3 fields: pre_hooks, retains_grad_hooks, and tensor_pre_hooks so that we can more precisely define their ordering and when they are executed. - Since retains grad uses an entirely new field, we cannot reuse the old retains grad, logic. We refactor retains grad to call directly into the variable.cpp logic. Other logic in variable.cpp that handle cpp hooks must also be updated. #### Hooks ordering and execution: - Defines pre-hooks registered on tensor to run before pre-hooks registered on grad_fn - Updates pre-hooks registered on tensor to always run, even if they are the inputs= to .grad() - Post hooks (and pre hooks) can now observe the modifications to gradient by the tensor pre hook #### Retains grad hooks - retains grad hooks always execute last, even if there are other tensor pre-hooks registered #### Unchanged: - pre_hooks registered to grad_fn aren't expected to execute if they are the inputs= to .grad() Follow ups: - simplify retains_grad field to not be a vector, since it always holds a single hook - potentially merge capture hooks with tensor pre hooks, this would involve some additional refactoring since - python hooks registered to tensor behavior on in-place is still wrong Pull Request resolved: https://github.com/pytorch/pytorch/pull/85849 Approved by: https://github.com/albanD	2023-01-17 16:23:21 +00:00
Richard Zou	81cc9bba5e	[autograd.Function] Kill the extension feature flag (#92026 ) This PR removes the autograd.Function extension feature flag. This was previously used for development of the functorch <> autograd.Function interaction. It's been in master for long enough with the feature flag defaulting to True, so it's time to remove it. Test Plan: - existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/92026 Approved by: https://github.com/soulitzer	2023-01-17 13:36:42 +00:00
Richard Zou	7aaad0b832	Rename flag that enables/disables _SingleLevelFunction for functorch (#92025 ) functorch used to have a switch that enables/disables autograd.Function. That switch now enables/disables torch.autograd.function._SingleLevelFunction, so I've renamed it accordingly. We could just delete the switch because users should not be directly working with torch.autograd.function._SingleLevelFunction. However, it was useful for debugging when something went wrong when I was implementing the autograd.Function <> functorch interaction, so I want to keep it around as a debugging tool for a while since the code is already there. Test Plan: - updated tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/92025 Approved by: https://github.com/soulitzer	2023-01-17 13:36:41 +00:00
PyTorch MergeBot	b3603f8129	Revert "Deduplicate c10 error and PyTorchError hierarchy (#87855 )" This reverts commit 34f2d3e6ae56744c20c2f859f97101dff291bbbc. Reverted https://github.com/pytorch/pytorch/pull/87855 on behalf of https://github.com/osalpekar due to perf regression in quantization tests	2023-01-06 19:56:35 +00:00
William Phetsinorath	34f2d3e6ae	Deduplicate c10 error and PyTorchError hierarchy (#87855 ) Fixes #53370 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87855 Approved by: https://github.com/albanD	2023-01-02 15:53:36 +00:00
soulitzer	1b2ee4d0e1	Update functorch supported autograd.Function to allow mark_dirty (#91222 ) Fixes https://github.com/pytorch/pytorch/issues/90225 Uses what was originally in `32a57bcdb6` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91222 Approved by: https://github.com/zou3519	2022-12-28 03:53:47 +00:00
soulitzer	b66862ba87	[autograd Function] Don't materialize forward grad for non-differentiable types (#91183 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91183 Approved by: https://github.com/zou3519	2022-12-21 05:05:44 +00:00
Richard Zou	7342251281	functorch.grad support for autograd.Function (#89860 ) Happy to split this PR more if it helps. This PR adds functorch.grad support for autograd.Function. There's a lot going on; here is the high level picture and there are more details as comments in the code. Mechanism (PyOperator) - Somehow, autograd.Function needs to dispatch with functorch. This is necessary because every layer of functorch needs to see the autograd.Function; grad layers need to preserve the backward pass. - The mechanism for this is via PyOperator. If functorch transforms are active, then we wrap the autograd.Function in a `custom_function_call` PyOperator where we are able to define various rules for functorch transforms. - `custom_function_call` has a rule for the functorch grad transform. autograd.Function changes - I needed to make some changes to autograd.Function to make this work. - First, this PR splits autograd.Function into a _SingleLevelFunction (that works with a single level of functorch transform) and autograd.Function (which works with multiple levels). This is necessary because functorch's grad rule needs some way of specifying a backward pass for that level only. - This PR changes autograd.Function's apply to eitehr call `custom_function_call` (if functorch is active) or super().apply (if functorch isn't active). Testing - Most of this PR is just testing. It creates an autograd.Function OpInfo database that then gets passed to the functorch grad-based tests (grad, vjp, vjpvjp). - Since functorch transform tests are autogenerated from OpInfo tests, this is the easiest way to test various autograd.Function with functorch. Future - jvp and vmap support coming next - better error message (functorch only supports autograd.Function that have the optional setup_context staticmethod) - documentation to come when we remove the feature flag Pull Request resolved: https://github.com/pytorch/pytorch/pull/89860 Approved by: https://github.com/soulitzer	2022-12-08 19:31:04 +00:00
Richard Zou	eb314f9b1a	Add setup_context staticmethod to autograd.Function (#89859 ) Adds a setup_context staticmethod to autograd.Function. If it exists, then the user splits the ctx-specific logic from the forward() and puts it in the setup_context staticmethod. Docs will come later when we remove the feature flag. Test Plan: - some light tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/89859 Approved by: https://github.com/soulitzer	2022-12-08 19:31:04 +00:00
Nikita Shulga	a268b9e53c	Fix yet another C++17 Windows build issue (#90228 ) Not sure why, but top-level `using namespace` directive causes VC++ fail with (if C++17 standard is used, but everything is fine with C++14): ``` C:\actions-runner\_work\pytorch\pytorch\third_party\pybind11\include\pybind11\detail\../pytypes.h(1520): error C2872: 'attr': ambiguous symbol C:\actions-runner\_work\pytorch\pytorch\aten\src\ATen/core/interned_strings.h(349): note: could be 'c10::attr' C:\actions-runner\_work\pytorch\pytorch\torch/csrc/jit/ir/ir.h(75): note: or 'torch::jit::attr' C:\actions-runner\_work\pytorch\pytorch\cmake\..\third_party\pybind11\include\pybind11/pybind11.h(1094): note: see reference to function template instantiation 'pybind11::str pybind11::str::format<_Ty1&>(_Ty1 &) const' being compiled with [ _Ty1=pybind11::handle ] ``` Solve this by replacing global `using namespace torch::jit;` with specific usages of objects/methods from namespaces Another prep change for https://github.com/pytorch/pytorch/70188 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90228 Approved by: https://github.com/kit1980, https://github.com/albanD	2022-12-06 01:35:19 +00:00
soulitzer	b567742038	Add ability to register prehooks to grad_fn (#83226 ) This simply replicates the implementation of PyFunctionPostHooks Fixes https://github.com/pytorch/pytorch/issues/83120 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83226 Approved by: https://github.com/albanD	2022-08-13 00:05:07 +00:00
BowenBao	cb2cb94074	[ONNX] Look at owningBlock instead of graph when recording autograd subgraph (#82852 ) Small adjustment to ensure the node always exists. `graph->nodes()` might not contain the autograd node, if it resides in additional subgraphs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82852 Approved by: https://github.com/shubhambhokare1, https://github.com/abock, https://github.com/malfet	2022-08-12 23:25:14 +00:00
Horace He	ea51e87b52	Added list clearing codegen to AOTAutograd (hidden behind config.aot_clear_list (#83137 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83137 Approved by: https://github.com/jansel, https://github.com/albanD	2022-08-12 22:52:16 +00:00
soulitzer	ccb7d56a18	Rename PyFunctionPreHook to PyFunctionTensorPreHook (#83225 ) Now that there will be two types of Python function prehooks, I prefer have the PyFunction hook taking all grad_outputs and returning all grad_inputs as the more "canonical" one Pull Request resolved: https://github.com/pytorch/pytorch/pull/83225 Approved by: https://github.com/albanD	2022-08-12 22:14:32 +00:00
shubhambhokare1	95d873855e	[ONNX] Inline prim::PythonOp for Autograd Function Export (#74765 ) Add flag (inline_autograd) to enable inline export of model consisting of autograd functions. Currently, this flag should only be used in TrainingMode.EVAL and not for training. An example: If a model containing ``autograd.Function`` is as follows ``` class AutogradFunc(torch.autograd.Function): @staticmethod def forward(ctx, i): result = i.exp() result = result.log() ctx.save_for_backward(result) return result ``` Then the model is exported as ``` graph(%0 : Float): %1 : Float = ^AutogradFunc(%0) return (%1) ``` If inline_autograd is set to True, this will be exported as ``` graph(%0 : Float): %1 : Float = onnx::Exp(%0) %2 : Float = onnx::Log(%1) return (%2) ``` If one of the ops within the autograd module is not supported, that particular node is exported as is mirroring ONNX_FALLTHROUGH mode Fixes: #61813 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74765 Approved by: https://github.com/BowenBao, https://github.com/malfet	2022-08-03 23:30:19 +00:00
Edward Z. Yang	df69660832	Revert "Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )"" (#82599 ) This reverts commit 532b8a9e00d7eea2636e67621bfcfa34d9c85bcb. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82599 Approved by: https://github.com/albanD	2022-08-02 19:37:02 +00:00
PyTorch MergeBot	532b8a9e00	Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )" This reverts commit 9465c0e0b50f3c37bc150ef0016238ba33eca6f4. Reverted https://github.com/pytorch/pytorch/pull/82552 on behalf of https://github.com/zengk95 due to This seems to be breaking windows binary wheels	2022-08-01 20:25:35 +00:00
Edward Z. Yang	9465c0e0b5	Add a lint rule for torch/csrc/util/pybind.h include (#82552 ) We define specializations for pybind11 defined templates (in particular, PYBIND11_DECLARE_HOLDER_TYPE) and consequently it is important that these specializations always be #include'd when making use of pybind11 templates whose behavior depends on these specializations, otherwise we can cause an ODR violation. The easiest way to ensure that all the specializations are always loaded is to designate a header (in this case, torch/csrc/util/pybind.h) that ensures the specializations are defined, and then add a lint to ensure this header is included whenever pybind11 headers are included. The existing grep linter didn't have enough knobs to do this conveniently, so I added some features. I'm open to suggestions for how to structure the features better. The main changes: - Added an --allowlist-pattern flag, which turns off the grep lint if some other line exists. This is used to stop the grep lint from complaining about pybind11 includes if the util include already exists. - Added --match-first-only flag, which lets grep only match against the first matching line. This is because, even if there are multiple includes that are problematic, I only need to fix one of them. We don't /really/ need this, but when I was running lintrunner -a to fixup the preexisting codebase it was annoying without this, as the lintrunner overall driver fails if there are multiple edits on the same file. I excluded any files that didn't otherwise have a dependency on torch/ATen, this was mostly caffe2 and the valgrind wrapper compat bindings. Note the grep replacement is kind of crappy, but clang-tidy lint cleaned it up in most cases. See also https://github.com/pybind/pybind11/issues/4099 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82552 Approved by: https://github.com/albanD	2022-08-01 17:16:58 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
alexmsettle	c0a6add7ee	Changes to support input sequence ID tracking (#70264 ) Summary: in the NVTX markers. This feature adds additional information to the NVTX marker string eg seq_ids=[101, 102, 103]. This indicates the sequence id of the op which produced the input tensor based on its position index in the array. In the above example input tensor 0 was produced by the node with sequence id 101, input tensor 1 is from node 102, input tensor 2 is from node with sequence id 103. This is the same way the sizes array is organized. If you know the sequence id of the node and the sequence ids of the input edges, then you have enough information to construct the network graph. Fixes https://github.com/pytorch/pytorch/issues/66105 Pull Request resolved: https://github.com/pytorch/pytorch/pull/70264 Reviewed By: chaekit Differential Revision: D34792707 Pulled By: robieta fbshipit-source-id: 4407b853c929a737505803b0db77a8ecd966cce2 (cherry picked from commit cd3c0c8c9d4d63d7897f60521c407883240d1d5b)	2022-03-31 22:15:39 +00:00
Alban Desmaison	b2a5507654	Fix deadlock in some edge case in autograd (#73961 ) Summary: Minimal example that deadlocks before but not after: ```python import torch from torch.autograd import Function class Foo(Function): staticmethod def forward(ctx, x): return x.clone() staticmethod def forward(ctx, gO): return gO.clone() def get_out(): inp = torch.rand(2, requires_grad=True) # The python function is first so that it runs # last in the backward pass right = Foo.apply(inp) # An op that creates new memory left1 = inp.clone() # An op that saves its input left2 = left1 ** 2 # Inplace modify so that the backward for # left2 always raises an error left1 += 1 # An op that takes both side as input. # After running, both side's last op will be in # the ready queue # And the op for left will run first as it was # executed last during the forward out = left2 + right return out # Nothing should be global variables here as, from what # I can see, python leaks all the global objects get_out().sum().backward() ``` Since this requires the python interpreter to die, it is hard to test in CI. Let me know if you have an idea how to do it though. Pull Request resolved: https://github.com/pytorch/pytorch/pull/73961 Reviewed By: malfet Differential Revision: D34752747 Pulled By: albanD fbshipit-source-id: 1a537b1f733e161e8d3ff053cd432b37b34d432a (cherry picked from commit 17943e4c04c782d81deab439e010195f04e75bbd)	2022-03-09 20:42:15 +00:00
BowenBao	341e20a1b6	[ONNX] Add module name as PythonOp attribute (#67193 ) (#73281 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73281 * Add module name as pythonOp attr * Move to trace_post_record * Add tests * Code compactness Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D34625647 Pulled By: malfet fbshipit-source-id: b04b2a4f1dc2cf733fcf50a3b022337f80d6eead (cherry picked from commit 56e8658974e0a5f7faab211d51b3e425886bff8a)	2022-03-09 14:26:18 +00:00

1 2 3 4 5 ...

285 Commits