pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-11-11 22:34:53 +08:00

Author	SHA1	Message	Date
Edward Z. Yang	e6ec0efaf8	Apply UFMT to all non test/torch files (#106205 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106205 Approved by: https://github.com/albanD	2023-07-29 02:56:24 +00:00
Richard Zou	4068c5467d	[Reland] Move functorch/_src to torch/_functorch (#88756 ) (#90091 ) This will be the last disruptive functorch internals change. Why are we moving these files? - As a part of rationalizing functorch we are moving the code in functorch/_src to torch/_functorch - This is so that we can offer the functorch APIs as native PyTorch APIs (coming soon) and resolve some internal build issues. Why are we moving all of these files at once? - It's better to break developers all at once rather than many times Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/90091 Approved by: https://github.com/anijain2305, https://github.com/ezyang	2022-12-03 14:17:15 +00:00
Edward Z. Yang	8d333761a9	When dealing with dupe arguments, prefer leafifying if possible (#89896 ) See code comment for details. I also had to do some extra fixes: * `run_functionalized_fw_and_collect_metadata` now is able to handle duplicated arguments * `aot_wrapper_dedupe` now always returns boxed compiled functions * `aot_wrapper_dedupe` is now applied to inference compiler along with autograd compiler (preexisting) Fixes https://github.com/pytorch/torchdynamo/issues/1939 Fixes DebertaV2ForQuestionAnswering DebertaForMaskedLM DebertaForQuestionAnswering DebertaV2ForMaskedLM Repro command: ``` python benchmarks/dynamo/huggingface.py --performance --float32 -dcuda --training --inductor --no-skip --dashboard --only DebertaForQuestionAnswering --cold_start_latency ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89896 Approved by: https://github.com/bdhirsh	2022-12-01 13:42:29 +00:00
PyTorch MergeBot	218d9c6e09	Revert "Move functorch/_src to torch/_functorch (#88756 )" This reverts commit 52bc5c1cfe098fd4b4b13902b4fea83b455b9773. Reverted https://github.com/pytorch/pytorch/pull/88756 on behalf of https://github.com/clee2000 due to broke imports in tests `52bc5c1cfe` https://github.com/pytorch/pytorch/actions/runs/3574742513/jobs/6010814968 probably a landrace	2022-11-29 17:17:11 +00:00
Richard Zou	52bc5c1cfe	Move functorch/_src to torch/_functorch (#88756 ) This will be the last disruptive functorch internals change. Why are we moving these files? - As a part of rationalizing functorch we are moving the code in functorch/_src to torch/_functorch - This is so that we can offer the functorch APIs as native PyTorch APIs (coming soon) and resolve some internal build issues. Why are we moving all of these files at once? - It's better to break developers all at once rather than many times Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/88756 Approved by: https://github.com/ezyang	2022-11-29 13:55:42 +00:00
Brian Hirsh	e20ec44544	fixes for inductor <> batch norm (#89603 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89603 Approved by: https://github.com/albanD	2022-11-29 02:16:52 +00:00
WeberXie	c18da597e0	[skip ci] documentation update for the kwargs defaults section of fun… (#89719 ) In this doc, it's better to multiply the scale instead of the constant 4.0 to illustrate the default of kwargs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89719 Approved by: https://github.com/kit1980, https://github.com/malfet	2022-11-28 21:49:26 +00:00
Edward Z. Yang	e8643ded6d	Revert "Don't allow recomputing a node that must be materialized in the backwards pass (#89171 )" (#89770 ) This reverts commit e36d68af8885f27d8c0b4727ab078bf53e55e7a0. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89770 Approved by: https://github.com/anijain2305	2022-11-28 20:02:07 +00:00
Edward Z. Yang	b87c45d5a7	Make aot_module_simplified accept fake tensors (#89670 ) Strategy taken from voz's #89392 but my implementation strategy is a bit different. If a fake tensor is provided, we use its FakeTensorMode (and more importantly, its ShapeEnv--this is what is tested in the new unit test). Only one tensor needs to be fake; if nothing is fake we just make a fresh mode as before. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89670 Approved by: https://github.com/voznesenskym	2022-11-28 18:39:18 +00:00
Edward Z. Yang	abf91562bd	Change aot_module_simplified to take take arguments directly (#89669 ) This is extracted from voz's #89392 Previously, the implementation did some half-assed caching where it returned a callable, that when invoked for the first time, actually performed the compilation. Delaying the compilation like this... seems totally unnecessary? To make matters worse, this has cost (we have to check if we hit the cache) and unsound (because the compiled function may not be valid for other arguments.) So instead, we ask user to provide arguments, and compile everything immediately. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89669 Approved by: https://github.com/voznesenskym, https://github.com/Chillee	2022-11-28 18:39:15 +00:00
Edward Z. Yang	b589e726d9	Refactor how AOTAutograd backends are defined (#89736 ) There was a lot of strangeness in how AOTAutograd backends were previously defined. This refactor replaces the strangeness with something simple and straightforward. The improvements: - There is no longer a footgun aot_autograd "backend" which doesn't actually work. No more mistyping `torch._dynamo.optimize("aot_autograd")` when you meant "aot_eager" - Deleted aot_print because it's annoying and anyway there's no uses of it - Instead of having BOTH the backend Subgraph and AotAutogradStrategy, there is now only an aot_autograd function which takes the kwargs to configure AOTAutograd, and then gives you a compiler function that does AOTAutograd given those kwargs. Easy. - The primary downside is that we are now eagerly populating all of the kwargs, and that can get us into import cycle shenanigans. Some cycles I resolved directly (e.g., we now no longer manually disable the forward function before passing it to aot_autograd; aot_autograd it does it for us), but for getting inductor decompositions I had to make it take a lambda so I could lazily populate the decomps later. New code is 130 lines shorter! Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89736 Approved by: https://github.com/anjali411, https://github.com/albanD	2022-11-28 18:39:12 +00:00
Michael Voznesensky	b5616cd5f4	Add simple assert to detect fake tensors on modules (#89723 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89723 Approved by: https://github.com/ezyang	2022-11-28 08:57:33 +00:00
Edward Z. Yang	db1f1144f1	Beef up AOTAutograd logging with aot_id and input descriptions (#89710 ) A few things in this PR, that I found useful while debugging some recent issues: - We now allocate an aot_id to each aot_function/aot_module invocation, and print it whenever we report error messages and graph output logging. Check the comment for why this sort of thing is useful, and also why it's different from nth_graph. This number is now incorporated into aot_graph_name - I noticed that nth_graph only gets incremented when backwards is compiled. Because backwards is compiled lazily, this means that multiple forward graphs would have gotten the same ID! I change nth_graph to always increment to avoid confusion here. - I added a simple describe_input function, which makes use of num_params_buffers to tell the user if the input index they're looking at is a param/buffer or an input. With the help of https://github.com/pytorch/pytorch/pull/89709 we could give even more detailed information about inputs (we could also easily give detailed information about parameters if we stored a mapping of index to parameter name, but I didn't need this when debugging so I'll let someone else add it if they need it.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89710 Approved by: https://github.com/bdhirsh	2022-11-28 04:52:05 +00:00
Edward Z. Yang	0e7c100c9b	Add debug asserts to AOTAutograd for input consistency with compilation (#89702 ) Fixes https://github.com/pytorch/torchdynamo/issues/1927 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89702 Approved by: https://github.com/bdhirsh	2022-11-28 00:36:58 +00:00
Edward Z. Yang	1f95f24d30	Factor input deduplication into a separate function (#89701 ) It turns out that instead of having a giant blobby aot_dispatch_autograd function, we can factor it into a series of wrapper functions, each of which successively guarantees more invariants on the inner compilation function until the final inner function is quite trivial. How exactly you have to wrap the input user functions and the output compiled functions can be expressed concisely in Haskell, so I've included the Haskell formulation in code comments. This PR shows how to do this for input deduplication. Dealing with the rest of the view handling is left to future work. This PR should also be a slight performance improvement as deduplicating is skipped entirely when there are no duplicate inputs. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89701 Approved by: https://github.com/bdhirsh	2022-11-28 00:36:58 +00:00
Horace He	e36d68af88	Don't allow recomputing a node that must be materialized in the backwards pass (#89171 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89171 Approved by: https://github.com/ngimel	2022-11-27 19:09:24 +00:00
Edward Z. Yang	c9a0cc8640	Simplify aot_module_simplified by removing top_args/top_kwargs (#89666 ) This makes good on Chillee's CR comment at `af30d351cc (r843315222)` which was never done in the original PR. There is no logic change, just unpack the args/kwargs at the top level and remove the inner function indirection. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89666 Approved by: https://github.com/voznesenskym	2022-11-25 20:43:13 +00:00
Edward Z. Yang	6168f22fae	Don't support kwargs at runtime in aot_module_simplified (#89664 ) The preexisting logic here added in https://github.com/pytorch/functorch/pull/970 was very peculiar: if top_kwargs was non-empty, then the inner compiled function supports kwargs. Naively, this would leave you to expect that there is some sort of correlation between top_kwargs and kwargs. But in fact, they're completely unrelated! top_kwargs is the AOTAutograd configuration knobs (e.g., fw_compiler/bw_compiler), but kwargs is the RUNTIME kwargs that are to be passed to the compiled function. But (1) we don't support this (the function to be compiled only takes a list of tensors) and (2) even if we did support it, conditioning on whether or not you had passed AOTAutograd configuration kwargs to support kwargs at runtime is bonkers. So delete it. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89664 Approved by: https://github.com/voznesenskym	2022-11-25 20:43:13 +00:00
Jason Ansel	95ea47ef0c	torchdynamo to torch._dynamo in aot_autograd.py (#89385 ) Test Plan: Run torchbench models Differential Revision: D41429573 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89385 Approved by: https://github.com/soumith, https://github.com/malfet	2022-11-25 04:28:36 +00:00
Jane Xu	8695f0cced	Rectify `native_batch_norm` schema by splitting it into two legit schemas (#88697 ) Using the same repro from the issue (but with BatchNorm2D) Rectifies native_batch_norm schema by splitting the schema into 2: 1. one will have NON-optional alias-able running_mean and running_var inputs 2. the other will just not have those parameters at all (no_stats variation) Calling for name suggestions! ## test plan I've added tests in test_functionalization.py as well as an entry in common_method_invocations.py for `native_batch_norm_legit` CI should pass. ## next steps Because of bc/fc reasons, we reroute native_batch_norm to call our new schemas ONLY through the python dispatcher, but in 2 weeks or so, we should make `native_batch_norm_legit` the official batch_norm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88697 Approved by: https://github.com/albanD	2022-11-23 23:23:17 +00:00
Brian Hirsh	57353c9608	first draft of input mutation handling for aot autograd (#88817 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88817 Approved by: https://github.com/ezyang, https://github.com/wconstab	2022-11-23 19:20:11 +00:00
Sean Ross-Ross	cdb798faef	_get_nested_attr should return a value in the general case (#88822 ) Fixes https://github.com/pytorch/functorch/issues/1053 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88822 Approved by: https://github.com/zou3519	2022-11-14 18:39:45 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
Edward Z. Yang	0e3031f7e7	Functionalize and compute joint simultaneously. (#88063 ) This also comes with some bug fixes that were uncovered from doing this: - Forward device calls to inner tensor in FunctionalTensorWrapper - Make legacyExtractDispatchKey exclude Functionalize, so that it can get at the real device type key. This is noncontroversial. - Stop stripping dense from key set. The reason for this is FunctionalWrapperTensor may be used in contexts where people query if it is dense or not. If it doesn't report this correctly (from the dispatch key), it will cause errors. This caused some torchbench models to fail when I did one-pass tracing. - Save and restore reapply views TLS correctly Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/88063 Approved by: https://github.com/bdhirsh	2022-11-05 03:52:40 +00:00
samdow	9946041a3e	[functorch] make hessian docs actually use hessian function (#88451 ) I was going through the hessian docs to find an example and noticed that these docs don't actually use the hessian function.... Pull Request resolved: https://github.com/pytorch/pytorch/pull/88451 Approved by: https://github.com/zou3519, https://github.com/Skylion007	2022-11-03 21:50:52 +00:00
Elias Ellison	ce961b3443	Dont hold onto references of saved tensors in backward (#88247 ) This improves memory compression of resnet18 on inductor non-cudagraphs from .78 -> .0.84. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88247 Approved by: https://github.com/ezyang	2022-11-03 21:24:32 +00:00
Edward Z. Yang	97d3b200ca	Unconditionally enable python dispatcher in AOTAutograd (#88365 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/88365 Approved by: https://github.com/Chillee	2022-11-03 12:52:19 +00:00
Edward Z. Yang	1ff52225f1	Unify SymIntNode and SymFloatNode into SymNode (#87817 ) This refactor was prompted by challenges handling mixed int/float operations in C++. A previous version of this patch added overloads for each permutation of int/float and was unwieldy https://github.com/pytorch/pytorch/pull/87722/ This PR takes a different approach. The general outline of the patch is to combine the C++ types SymIntNode and SymFloatNode into a single type, SymNode. This is type erased; we no longer know statically at C++ if we have an int/float and have to test it with the is_int()/is_float() virtual methods. This has a number of knock on effects. - We no longer have C++ classes to bind to Python. Instead, we take an entirely new approach to our Python API, where we have a SymInt/SymFloat class defined entirely in Python, which hold a SymNode (which corresponds to the C++ SymNode). However, SymNode is not pybind11-bound; instead, it lives as-is in Python, and is wrapped into C++ SymNode using PythonSymNode when it goes into C++. This implies a userland rename. In principle, it is also possible for the canonical implementation of SymNode to be written in C++, and then bound to Python with pybind11 (we have this code, although it is commented out.) However, I did not implement this as we currently have no C++ implementations of SymNode. Because we do return SymInt/SymFloat from C++ bindings, the C++ binding code needs to know how to find these classes. Currently, this is done just by manually importing torch and getting the attributes. - Because SymInt/SymFloat are easy Python wrappers, __sym_dispatch__ now takes SymInt/SymFloat, rather than SymNode, bringing it in line with how __torch_dispatch__ works. Some miscellaneous improvements: - SymInt now has a constructor that takes SymNode. Note that this constructor is ambiguous if you pass in a subclass of SymNode, so an explicit downcast is necessary. This means toSymFloat/toSymInt are no more. This is a mild optimization as it means rvalue reference works automatically. - We uniformly use the caster for c10::SymInt/SymFloat, rather than going the long way via the SymIntNode/SymFloatNode. - Removed some unnecessary toSymInt/toSymFloat calls in normalize_* functions, pretty sure this doesn't do anything. - guard_int is now a free function, since to guard on an int you cannot assume the method exists. A function can handle both int and SymInt inputs. - We clean up the magic method definition code for SymInt/SymFloat/SymNode. ONLY the user classes (SymInt/SymFloat) get magic methods; SymNode gets plain methods; this is to help avoid confusion between the two types. Signed-off-by: Edward Z. Yang <ezyang@fb.com> cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87817 Approved by: https://github.com/albanD, https://github.com/anjali411	2022-10-27 20:56:02 +00:00
Richard Zou	0826863962	[functorch][docs] Downgrade the warning about forward-mode AD coverage (#87383 ) Previously we claimed that "forward-mode AD coverage is not that good". We've since improved it so I clarified the statement in our docs and downgraded the warning to a note. Test Plan: - view docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/87383 Approved by: https://github.com/samdow	2022-10-20 18:51:13 +00:00
Edward Z. Yang	c97ffcff46	[discussion] fix for aot autograd outputs that dont require grad (#86838 ) Fixes https://github.com/pytorch/functorch/issues/1052 I got here after some discussion with Alban. Today, if you aot_function() trace a program where some of its inputs have `requires_grad=True`, but some outputs are expected to have `requires_grad=False`, we will incorrectly set all outputs to have `requires_grad=True`. A simple solution is to use autograd.function's API for marking outputs as non-differentiable, based on what we witnessed when we traced the forward. This will make the `autograd.Function` that we return wrong, if you created it using inputs that required grad, and tried to re-use it with inputs that have different `requires_grad` field. But as long as we're hiding behind dynamo, which should guard on requires_grad, then we'll re-run `aot_function()` and get out a new compiled function that does the right thing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86838 Approved by: https://github.com/ezyang	2022-10-19 23:41:54 +00:00
Horace He	2c1bc216b8	Fixed partitioner issue with getitem and made metadata a storage more consistent (#87012 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87012 Approved by: https://github.com/ngimel	2022-10-15 17:58:55 +00:00
Ivan Yashchuk	2cfc4cb367	Add optional recomputable_ops argument for the min cut partitioner (#86686 ) `min_cut_rematerialization_partition` has a default set of hard-coded operations that are allowed to be recomputed in the backward pass. This PR adds customization ability to this function allowing users to control the behavior by passing `recomputable_ops` instead of relying on the default setting. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86686 Approved by: https://github.com/Chillee	2022-10-14 12:15:30 +00:00
Horace He	b3b9786fdd	Unified symbolic shape variables between AOTAutograd and Inductor (#86659 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86659 Approved by: https://github.com/wconstab	2022-10-14 00:24:43 +00:00
Elias Ellison	c4f0b93f86	Disable autocast in aot autograd (#86515 ) Fix for https://github.com/pytorch/torchdynamo/issues/1368 From comment: > When we invoke a Composite Implicit autograd operator that has an autocast rule, such as Einsum, autocast is disabled during its invocation. When we trace out the operators in an implicit op, re-applying on autocast rules on those operators might yield divergence from what was executed at runtime. This pass checks for divergence. If divergence is found, we will disable autocast. We would like to avoid disabling autocast if possible because accessing TLS is slow. Concretely, the problem found was when invoked `sum` in `einsum`: As seen by the following divergence: ``` >>> with torch.cuda.amp.autocast(enabled=True): ... print(torch.ops.aten.sum.dim_IntList(torch.rand([2, 2, 2], device="cuda", dtype=torch.half), [1, 2]).dtype) ... torch.float32 >>> print(torch.ops.aten.sum.dim_IntList(torch.rand([2, 2, 2], device="cuda", dtype=torch.half), [1, 2]).dtype) torch.float16 ``` Edit: we've decided to accept the overhead of universally disabling autocast instead Pull Request resolved: https://github.com/pytorch/pytorch/pull/86515 Approved by: https://github.com/bdhirsh, https://github.com/Chillee	2022-10-12 01:43:35 +00:00
Will Constable	d8b971ed25	Fixes for partitioner with symbolic shapes (#86425 ) - supports saving symint (and symfloat..) values between fw/bwd, using sketchy logic that probably needs to be improved but seems to work so far - sets a correct weight=1 for sym nodes for cost purposes - lets user functions return symints/floats (but if the same symfloat is saved for backward, that gets duplicated annoyingly) - makes partitioning decisions based on observed trace-time sizes without guarding! (this is sketchy, but it isn't clear that it will lead to bad partitioning choices either) - improves infra for tracking symint-family of types: is_sym_node() and _py_sym_types Pull Request resolved: https://github.com/pytorch/pytorch/pull/86425 Approved by: https://github.com/ezyang	2022-10-11 01:42:28 +00:00
Elias Ellison	d04889323e	Add Context Manager for Disabling Multithreading in Backwards, use in aot autograd (#86245 ) We were running into a few issues with running multithreaded backwards in aot_autograd: such as https://github.com/pytorch/pytorch/issues/86136, and `FakeTensorMode` getting into a weird state as a result of not executing functions completely sequentially. The multithreaded backwards is lost in translation when we trace out the backwards anyway, and adds a lot of additional complexity. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86245 Approved by: https://github.com/albanD, https://github.com/yf225	2022-10-06 03:27:42 +00:00
Horace He	0e03dc5f1e	Remove softmax from recomputable ops (#86268 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86268 Approved by: https://github.com/ezyang	2022-10-05 14:16:53 +00:00
Richard Zou	a262ccea58	Change torch.autograd.graph.disable_saved_tensors_hooks to be public API (#85994 ) Also addresses some comments from the review in https://github.com/pytorch/pytorch/pull/85971 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85994 Approved by: https://github.com/albanD, https://github.com/soulitzer	2022-10-03 16:25:01 +00:00
Horace He	9bf9db57be	Refactored recomputable ops a bit and added a bunch more ops (#85993 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85993 Approved by: https://github.com/ngimel	2022-09-30 23:50:20 +00:00
Horace He	e09a84a184	Removed debug output that doesn't work with faketensors (#85992 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85992 Approved by: https://github.com/ngimel	2022-09-30 23:50:20 +00:00
Richard Zou	7238ca4c2e	Disallow saved tensor hooks in functorch transforms (#85972 ) Technically they may only be a problem with the grad transform. Though the branch cut is soon, this is the more conservative change, it also lets us disable checkpointing for functorch (which definitely doesn't work with all transforms) and not a lot of people use saved tensor hooks with functorch (I discovered this while testing). Test Plan: - new tests Differential Revision: [D39970934](https://our.internmc.facebook.com/intern/diff/D39970934) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85972 Approved by: https://github.com/samdow	2022-09-30 20:03:58 +00:00
samdow	1fae890a07	fix grad silent correctness issue from view fn followed by an inplace fn (#85374 ) From https://github.com/pytorch/functorch/issues/1007, which was an issue where we would wrap aliases of unwrapped tensors and miss the inplace error message where we should have gotten it. Instead of keeping aliases unwrapped like I had originally wanted, this simplifies it slightly such that: (1) All tensors that were previously wrapped are still wrapped. This is occasionally important because of the 1-1 relationship between a tensor and autograd meta. By keeping the same number of wrapper tensors before, we'll never have autograd try to write multiple autograd metas to the same tensor when it wouldn't before (2) The tensors that either were unwrapped tensors or aliases of unwrapped tensors now get a flag on them (now called `alias_of_unwrapped`). This way, they are still wrapper tensors (and don't have to potentially break autograd) but we can identify that they should be treated like an unwrapped tensor Pull Request resolved: https://github.com/pytorch/pytorch/pull/85374 Approved by: https://github.com/zou3519	2022-09-30 19:33:11 +00:00
PyTorch MergeBot	24adadd4db	Revert "Disallow saved tensor hooks in functorch transforms (#85829 )" This reverts commit d8277d9075396a3188490c322648605927384ba5. Reverted https://github.com/pytorch/pytorch/pull/85829 on behalf of https://github.com/atalman due to Reverting since failed build-fisp-diff-linux_platform010-opt	2022-09-30 14:32:51 +00:00
Richard Zou	d8277d9075	Disallow saved tensor hooks in functorch transforms (#85829 ) Technically they may only be a problem with the grad transform. Though the branch cut is soon, this is the more conservative change, it also lets us disable checkpointing for functorch (which definitely doesn't work with all transforms) and not a lot of people use saved tensor hooks with functorch (I discovered this while testing). Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/85829 Approved by: https://github.com/samdow	2022-09-28 22:49:28 +00:00
Horace He	0e256c2550	removed compile cache and static argnums (#85783 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85783 Approved by: https://github.com/wconstab	2022-09-28 08:33:59 +00:00
Richard Zou	848437590f	Delete functorch's monkeypatching (#85430 ) By upstreaming functorch's tensor printing logic into PyTorch. There's no way of creating a custom print function for a TensorImpl subclass (as opposed to a torch_dispatch or torch_function tensor subclass, which can just override repr()) right now, so we need to directly interpose inside regular Tensor printing in PyTorch. Monkey patching is bad; users do not expect `import blah` to change something about another library. Fixes https://github.com/pytorch/functorch/issues/900 Test Plan: - existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/85430 Approved by: https://github.com/ezyang	2022-09-22 18:47:12 +00:00
Richard Zou	5e5c319549	Move functorch python bindings to torch/csrc (#85426 ) This moves functorch's python bindings to torch/csrc/functorch/init.cpp. Coming next is the torchdim move. I didn't do torchdim yet because moving functorch's python bindings unblocks some other things that I want to do first. Test Plan: - tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/85426 Approved by: https://github.com/ezyang	2022-09-22 18:47:12 +00:00
Edward Z. Yang	25a5ada426	Typofix (#85421 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85421 Approved by: https://github.com/wconstab, https://github.com/malfet	2022-09-21 18:27:54 +00:00
Elias Ellison	d9aa6dfe88	Add Fake Cross Ref Mode, migrate sparse to it (#85382 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85382 Approved by: https://github.com/ezyang	2022-09-21 17:15:47 +00:00
Horace He	2f4a517d67	Ported matmul compositeimplicitautograd impl into core (#85239 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85239 Approved by: https://github.com/ezyang, https://github.com/lezcano	2022-09-21 09:25:24 +00:00

1 2

63 Commits