pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Ryan Guo	7183f52675	[dynamo] Support namedtuple subclass (#153982 ) Fixes #133762. This involves 1. support tuple subclass constructed inside compile region. 2. handle the "fake" global scope associated with NamedTuple-generated `__new__`. 3. handle `namedtuple._tuplegetter` more faithfully. Differential Revision: [D75488091](https://our.internmc.facebook.com/intern/diff/D75488091) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153982 Approved by: https://github.com/jansel ghstack dependencies: #154176	2025-05-30 16:14:37 +00:00
Ryan Guo	8002d22ce3	[dynamo] Trace into descriptor with `__set__` (#154176 ) As title, this patch basically implements https://github.com/python/cpython/blob/3.11/Objects/object.c#L1371-L1452, and make the `__get__` handling more robust. I ran into this while fixing #133762. Differential Revision: [D75488090](https://our.internmc.facebook.com/intern/diff/D75488090) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154176 Approved by: https://github.com/jansel	2025-05-30 16:14:37 +00:00
Ryan Guo	33535b3eee	[dynamo] Support Tensor subclass that has dynamic attributes or calls `Parameter.__torch_function__` (#149482 ) This fixes most of https://github.com/huggingface/diffusers/issues/10795, except for `torch.Tensor._make_subclass`, which will be fixed in a subsequent patch. The relevant tensor subclass from the aforementioned issue is defined here: `fbf6b856cc/src/diffusers/quantizers/gguf/utils.py (L398-L435)`. There are two things to note about the tensor subclass: 1. it calls `super().__torch_function__`, which is `torch._C._disabled_torch_function_impl`, so this patch updates `SuperVariable.call_method` to handle it (we can't do a simpler polyfill due to some bug with `var_getattr` raising `NotImplementedError`, which forgot to restore symbolic context). 2. it sets and reads attributes (`quant_type`), and defines new methods (`as_data`), so this patch adds support for those. 3. it has a `__init__`, which Dynamo needs to trace through in `TensorSubclassVariable.call_function`. Differential Revision: [D71906140](https://our.internmc.facebook.com/intern/diff/D71906140) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149482 Approved by: https://github.com/jansel, https://github.com/mlazos	2025-04-02 20:56:43 +00:00
PyTorch MergeBot	03c879d59b	Revert "[dynamo] Support Tensor subclass that has dynamic attributes or calls `Parameter.__torch_function__` (#149482 )" This reverts commit 98453c135a7778d12ff881d8b0a717257be9fc38. Reverted https://github.com/pytorch/pytorch/pull/149482 on behalf of https://github.com/malfet due to Broke trunk, see `b03c42109c/1` ([comment](https://github.com/pytorch/pytorch/pull/149482#issuecomment-2773650522))	2025-04-02 20:30:33 +00:00
Ryan Guo	98453c135a	[dynamo] Support Tensor subclass that has dynamic attributes or calls `Parameter.__torch_function__` (#149482 ) This fixes most of https://github.com/huggingface/diffusers/issues/10795, except for `torch.Tensor._make_subclass`, which will be fixed in a subsequent patch. The relevant tensor subclass from the aforementioned issue is defined here: `fbf6b856cc/src/diffusers/quantizers/gguf/utils.py (L398-L435)`. There are two things to note about the tensor subclass: 1. it calls `super().__torch_function__`, which is `torch._C._disabled_torch_function_impl`, so this patch updates `SuperVariable.call_method` to handle it (we can't do a simpler polyfill due to some bug with `var_getattr` raising `NotImplementedError`, which forgot to restore symbolic context). 2. it sets and reads attributes (`quant_type`), and defines new methods (`as_data`), so this patch adds support for those. 3. it has a `__init__`, which Dynamo needs to trace through in `TensorSubclassVariable.call_function`. Differential Revision: [D71906140](https://our.internmc.facebook.com/intern/diff/D71906140) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149482 Approved by: https://github.com/jansel, https://github.com/mlazos	2025-04-02 17:05:12 +00:00
Guilherme Leobas	4e7d264cf8	Introduce `UserDefinedExceptionClassVariable` (#146504 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146504 Approved by: https://github.com/anijain2305	2025-03-11 18:55:45 +00:00
Xuehai Pan	3ce352e389	[BE][PYFMT] migrate PYFMT for `torch._dynamo` to `ruff format` (#144549 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144549 Approved by: https://github.com/jansel	2025-02-28 03:03:53 +00:00
William Wen	cf6d1e6824	[dynamo] add generic graph break hints (#147429 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147429 Approved by: https://github.com/jansel, https://github.com/zou3519 ghstack dependencies: #147385	2025-02-26 09:20:28 +00:00
William Wen	3fd68e4e2f	[dynamo] make some more graph break messages readable in English [2/N] (#147385 ) This is for "for some large number Z, make sure the error messages are readable English." - beginning to audit all `unimplemented` sites and making sure that all messages are at least English-readable. Hints may not necessarily be provided. Pull Request resolved: https://github.com/pytorch/pytorch/pull/147385 Approved by: https://github.com/jansel	2025-02-26 09:20:28 +00:00
Animesh Jain	9dc702875d	[dynamo][mappingproxy][inspect] Support existing types.MappingProxyType (#147217 ) Fixes https://github.com/pytorch/pytorch/issues/147162 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147217 Approved by: https://github.com/williamwen42	2025-02-15 07:59:33 +00:00
Raymond Li	21c2565f35	Document dynamo (#146736 ) Many files in dynamo are currently lacking file/module-level documentation, which makes it hard to know what they do at a glance and without digging into the code. This fixes that. Note: documentation was AI-generated and could be incorrect, please review carefully. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146736 Approved by: https://github.com/jansel, https://github.com/StrongerXi, https://github.com/anijain2305, https://github.com/zou3519	2025-02-13 00:02:21 +00:00
Animesh Jain	d6513f3246	[dynamo] Support list subclasses and fix dict subclasses mutation bugs (#146819 ) This PR adds support for list subclasses. Among other things are 1) Tracking the mutations on internal vts like `_dict_vt` and `_list_vt` using sources. This helps identify if there was a mutation in the underlying data structures, and we need to reconstruct it. 2) `UserDefinedObjectVariable` now has a new method - `is_modified` which `side_effect` infra relies upon to check mutations in the underlying vts (like `_dict_vt`). 3) `reconstruction` logic ensures that we use `dict.__getitem__` and `list.__getitem__` methods. This is super important because we don't want to call the overridden `__getitem__` methods. If this PR is hard to review, please let me know. I can break it into several small PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146819 Approved by: https://github.com/StrongerXi, https://github.com/jansel	2025-02-12 17:46:02 +00:00
Animesh Jain	ee8a06f1f6	[dynamo][user-defined] User class.__new__ instead of special casing (#146677 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146677 Approved by: https://github.com/jansel	2025-02-10 17:31:13 +00:00
Guilherme Leobas	580a305681	Raise MutationError if there are side effects when returning generator (#145223 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145223 Approved by: https://github.com/zou3519 ghstack dependencies: #141055, #144421, #144422, #144423, #144424, #144420	2025-02-08 22:42:12 +00:00
Animesh Jain	cef856faa9	[dynamo][enum] Trace through enum.py for enum construction (#146070 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146070 Approved by: https://github.com/jansel ghstack dependencies: #146062, #146198, #146258, #146214	2025-02-02 03:12:36 +00:00
Animesh Jain	7e1c7253e9	[dynamo][builtin-skipfile-cleanup] Support tuple.__new__ (#145558 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145558 Approved by: https://github.com/jansel, https://github.com/StrongerXi ghstack dependencies: #145519, #145547	2025-01-27 21:42:43 +00:00
Aaron Orenstein	a79100ab11	PEP585 update - torch/_dynamo (#145105 ) See #145101 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145105 Approved by: https://github.com/bobrenjc93	2025-01-18 20:47:11 +00:00
Animesh Jain	c5c897c3a1	[dynamo][easy] Miscellaneous fixes (#144141 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144141 Approved by: https://github.com/williamwen42 ghstack dependencies: #144129, #144130	2025-01-03 18:22:56 +00:00
Animesh Jain	0f474a960b	[dynamo] Remove dead code after introducing UserDefinedDictVariable (#143699 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143699 Approved by: https://github.com/williamwen42, https://github.com/yanboliang, https://github.com/jansel ghstack dependencies: #143722	2024-12-27 04:51:35 +00:00
PyTorch MergeBot	ee25daef5a	Revert "[dynamo] Remove dead code after introducing UserDefinedDictVariable (#143699 )" This reverts commit 7d1c6661397f9bff93c1ea389506c8a163d7a2ab. Reverted https://github.com/pytorch/pytorch/pull/143699 on behalf of https://github.com/wdvr due to failing internal tests ([comment](https://github.com/pytorch/pytorch/pull/143722#issuecomment-2563127017))	2024-12-26 22:04:35 +00:00
Animesh Jain	7d1c666139	[dynamo] Remove dead code after introducing UserDefinedDictVariable (#143699 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143699 Approved by: https://github.com/williamwen42, https://github.com/yanboliang, https://github.com/jansel ghstack dependencies: #143722	2024-12-24 02:00:18 +00:00
Animesh Jain	0da004f3dd	[dynamo] Remove transformers ModelOutput hack (#143567 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143567 Approved by: https://github.com/williamwen42, https://github.com/jansel ghstack dependencies: #143548	2024-12-21 01:46:14 +00:00
Animesh Jain	4627cfd1f9	[dynamo] Support user defined dicts (#143548 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143548 Approved by: https://github.com/yanboliang, https://github.com/jansel, https://github.com/williamwen42	2024-12-21 01:46:14 +00:00
Ryan Guo	0efd184685	[dynamo] Fix side effects for range iterator that escapes the graph (#141716 ) `wrap_range_iterator` mistakenly used `ValueMutationNew`, when it should've used `ValueMutationExisting`, because this code path always has a source. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141716 Approved by: https://github.com/jansel ghstack dependencies: #141713, #141714, #141715, #141902	2024-12-03 09:18:06 +00:00
Ryan Guo	9eb0520d75	[dynamo] Fix side-effect handling for pre-existing `collections.deque` (#141714 ) Previously we never replayed side effects to `DequeVariable` with a source; the bug was already in the `test_deque_input` test, but went unnoticed because we didn't check the deque objects. This patch adds limited but practical support for this (see comments in `side_effects.py` for why limited), and updates the deque tests to check for this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141714 Approved by: https://github.com/jansel ghstack dependencies: #141713	2024-12-03 09:18:06 +00:00
Ryan Guo	f2ce2d435b	[dynamo] Add test for returning a nested recursive function and update documentation (#141713 ) Addresses https://github.com/pytorch/pytorch/pull/137905#discussion_r1806923085. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141713 Approved by: https://github.com/jansel	2024-12-03 09:18:06 +00:00
Guilherme Leobas	34127fc688	Only reconstruct dict if needed (#141606 ) Fixes #141452 This is a follow-up of PR #134876, which optimized dict reconstruct to codegen only if any value changed. In this PR we cover the general case and do not codegen any instruction if the dictionary remains the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141606 Approved by: https://github.com/zou3519	2024-12-03 02:22:34 +00:00
Ryan Guo	2d708752f0	[dynamo] Remove `AutoDerefLocalSource` and simplify cell handling (#141629 ) This patch 1. removes `AutoDerefLocalSource` in favor of `LocalSource`, thereby removing its special handling in guards. 2. introduces a `LocalCellSource` for cells from the root frame, with only `reconstruct` implemented, to programmatically enforce that thse cells should never be used by other components like guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141629 Approved by: https://github.com/jansel ghstack dependencies: #141628	2024-12-02 19:09:30 +00:00
Ryan Guo	e14d8c980f	[dynamo][NFC] Rename `NewCellVariable` to `CellVariable` (#141628 ) It was named `NewCellVariable` because we originally used it to represent cells by the code Dynamo is tracing through. However, now we use it to represent pre-existing cells as well, so this patch renames it to avoid confusion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141628 Approved by: https://github.com/williamwen42, https://github.com/jansel	2024-12-02 19:09:30 +00:00
Ryan Guo	ea1d11cf74	[dynamo] Represent all cells as `NewCellVariable` (#140153 ) In addition to `NewCellVariable`, Dynamo has 3 ways of modeling cell objects: 1. For cells captured and created by the root frame, represent them as their contents in `root_tx.symbolic_locals`, which `LOAD_DEREF` and `STORE_DEREF` update directly, without going through `SideEffects`. 2. `ClosureVariable`: this is created when cells from (1) are captured by a newly created function Dynamo is about to inline. It's a handle with a name that redirects `LOAD_DEREF` and `STORE_DEREF` back (1), to make `root_tx.symbolic_locals` up-to-date. 3. For cells that are captured by both the root frame and some pre-existing function Dynamo is about to inline, represent those cells as contents, and do not allow writes to them. Note that (2) and (3) are mainly to conform with (1) -- to make sure Dynamo has a consistent modeling of cells for the same cell objects. In this patch, we represent all of these cells as `NewCellVariable`. The main new code paths introduced are: - using `NewCellVariable` to model cell objects created by the root frame (the cells are passed in as input to `InstructionTranslator`), this is what allows us to get rid of all 3 legacy paths above. - adding a new `AutoDerefLocalSource` to deal with the python-code level (guards) and bytecode level (codegen) auto-dereferencing behavior, when accessing pre-existing python cells. This also involves a tiny update to guard manager generation. - plumbing some extra info into `LocalSource` and `CellVariable` so that we can still emit `LOAD_DEREF`, `STORE_DEREF`, `LOAD_CLOSURE` (instead of `make_cell`, `cell_contents` attribute access, and `LOAD_FAST`), which is important for readability, performance, and some assumptions `bytecode_transformation.py` makes. As a result, this patch removes a lot of the now-dead code paths and TODOs. Notably, it significantly simplified the `prune_dead_locals` function, which was duplicating a lot of the logic from `prune_dead_object_new`; this conveniently closes #137123. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140153 Approved by: https://github.com/jansel ghstack dependencies: #140330, #140152, #140436, #140435	2024-11-15 17:17:30 +00:00
Ryan Guo	7faee6bf15	[dynamo] Track from registered tensor hooks in `prune_dead_object_new` (#140435 ) Registed tensor hooks contain `NestedUserFunctionVariable` which might capture a `NewCellVariable` for cell objects created during Dynamo tracing, so we must make sure it doesn't get pruned away. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140435 Approved by: https://github.com/jansel, https://github.com/zou3519 ghstack dependencies: #140330, #140152, #140436	2024-11-15 17:17:30 +00:00
Ryan Guo	65615915ed	[dynamo] Fix bugs in side-effect pruning and codegen (#140201 ) This patch fixes 2 things which are exposed if we have `NewCellVariable` rather than `ClosureVariable` to model python cells: 1. `codegen_save_tempvars` must run first, to establish `source` for objects, otherwise they can't reconstruct. 2. `prune_dead_object_new` must account for `OutputGraph.backward_state` as well, since it also contains variables that must live. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140201 Approved by: https://github.com/jansel ghstack dependencies: #140035, #140036, #140149, #140150, #140151	2024-11-13 03:14:23 +00:00
Ryan Guo	d34d5ccec5	[dynamo] Fix some corner cases for modeling pre-existing cells (#140150 ) In `UserFunctionVariable.bind_args`, there's a rare case when the underlying function satisfies all conditions below 1. The function captures a pre-existing cell 2. The cell isn't captured by root frame 3. `UserFunctionVariable.source` is `None` In such cases, Dynamo would model the cell as its content (just like what we do for cells in the root frame). However, this could break in two cases: - We could have multiple instances of `UserFunctionVariable`, where some have source and others don't. This means sometimes we'll model the cell as a `NewCellVariable`, and sometimes as its content. This causes issues because writes to the `NewCellVariable` would be buffered in `SideEffects` and never get picked up by the other modeling. - Only when `UserFunctionVariable` has a source, do we check whether we already had a `NewCellVariable` for the captured cell. This again causes Dynamo to potentially have multiple representations for the same cell object, resulting in a similar "buffered writes not reflected" issue as above. This patch fixes the above 2 issues by 1. modeling captured cells of sourceless `UserFunctionVariable` as immutable `NewCellVariable`, and adds a few lines in `SideEffects` to account for its immutability. 2. always checking whether we already had a `NewCellVariable` for the captured cell, before constructing a new one. Tests are added for each aforementioned case. I also left a TODO to investigate why exactly we would lose source information for `UserFunctionVariable`. Some cases are easily fixable, but others not so much. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140150 Approved by: https://github.com/jansel ghstack dependencies: #140035, #140036, #140149	2024-11-13 03:14:23 +00:00
Ryan Guo	6a821c9e6a	[dynamo] Remove cell unboxing/restart optimization (#140149 ) We added an unboxing optimization to avoid writes to cells that existed before Dynamo tracing (such writes interfere with HOPs). However, the avoided write shouldn't be there in the first place, since we were basically creating an empty `NewCellVariable`, and then write the pre-existing content into the variable. This patch 1. adds logic to bypass the initial write for pre-existing cells without undermining correctness. 2. removes the unboxing optimization and the restart code path. Fixes #137456, #138491; also see those issues for more historical context. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140149 Approved by: https://github.com/ezyang, https://github.com/jansel ghstack dependencies: #140035, #140036	2024-11-13 03:14:23 +00:00
Ryan Guo	f459c3095f	[dynamo] Document codegen and clean up some code paths (#139670 ) This patch 1. Adds documentation to `PyCodegen.__call__`, `PyCodegen.tempvars` and the `allow_cache` flag. 2. Merges a few existing code paths in `PyCodegen.__call__`. 3. removes the `elif var in cg.tempvars` code path in `codegen_save_tempvars`, because it's no longer needed after #113725, as we have up-to-date `VariableTracker.source` now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/139670 Approved by: https://github.com/jansel ghstack dependencies: #139538	2024-11-07 03:14:16 +00:00
Ryan Guo	2b3a227b35	[dynamo] Add `is_mutable()` and `is_immutable()` methods to `VariableTracker` (#139341 ) This patch adds 2 simple methods `VariableTracker.is_mutable()` and `VariableTracker.is_immutable()`, which helps clarify intention. For instance, rather than writing ```python if var.mutation_type: ... ``` After this patch one can write ```python if var.is_mutable(): ... ``` This patch also simplifies `mutation_type` propagation in some `ListVariable` methods. Pull Request resolved: https://github.com/pytorch/pytorch/pull/139341 Approved by: https://github.com/mlazos, https://github.com/anijain2305 ghstack dependencies: #139339, #139340	2024-11-05 19:11:41 +00:00
Ryan Guo	0ba3962b80	[dynamo][NFC] Move `MutationType` classes into `variables/base.py` (#139340 ) As title, this addresses https://github.com/pytorch/pytorch/pull/137905/files#r1806800222. Pull Request resolved: https://github.com/pytorch/pytorch/pull/139340 Approved by: https://github.com/anijain2305 ghstack dependencies: #139339	2024-11-05 19:11:41 +00:00
Ryan Guo	693a0a1bd4	[dynamo][NFC] Rename `mutable_local` and add documentation (#139339 ) This patch addresses the renaming part of #133027, specifically, it renames the following and adds documentation for relevant classes. 1. `VariableTracker.mutable_local` to `mutation_type` 2. `MatableLocal `to `ValueMutationNew` 3. `MutableSideEffects `to `ValueMutationExisting` 4. `MutableLocalSource` to `SourceType` 5. `MutableLocalSource.Local` to `New` Note that (2), (3) and (5) are mainly to bring consistency between them and `AttributeMutationNew`, `AttributeMutationExisting`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/139339 Approved by: https://github.com/jansel, https://github.com/mlazos, https://github.com/anijain2305	2024-11-05 19:11:41 +00:00
Ryan Guo	162eba2dee	[dynamo] Remove `mutable_local.source` and index on `VariableTracker` rather than `MutableLocalBase` (#137905 ) This patch addresses parts of the side-effect refactor proposed in #133027; specifically, it does 3 things: 1. Change `SideEffects.store_attr_mutations` and `PyCodegen.tempvars` to index on `VariableTracker` rather than `MutableLocalBase`. 2. Remove the `source` field from `MutableSideEffects` and `AttributeMutation`, and use `VariableTracker.source` instead. 3. Plumb a `overridden_sources: Dict[Source, Source]` from `handle_aliases_for_stolen_lists` to `PyCodegen` so that we don't update `VariableTracker.source` in place, while still preserving what `handle_aliases_for_stolen_lists` needed (i.e., modifying codegen for certain `VariableTracker`). (1) and (2) are merged in 1 patch because of some dependency between a. `OutputGraph.handle_aliases_for_stolen_lists` which iterates over `sideSideEffects.store_attr_mutations.keys()`, and potentially update its source field to be completely different. b. `SideEffects.codegen_update_mutated`, which happens after the above and uses `cg(var.mutable_local.source)`. where if we apply (1) only, (b) breaks, and if we apply (2) only, (a) breaks. (3) is needed for correctness, see comments in the PR for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/137905 Approved by: https://github.com/jansel, https://github.com/anijain2305, https://github.com/mlazos	2024-10-18 20:20:42 +00:00
Ryan Guo	a89cf2b59a	[dynamo] Don't codegen temporary cells for pre-existing cells (#137907 ) This patch removes tempvar codegen for the `NewCellVariable` that has `AttributeMutationExisting`, because these tempvar will never get used. Note that tempvar codegen for other objects also follow this pattern, i.e., it only fires on `AttributeMutationNew`. To visualize, in the following program, we'll see the modified bytecode contains redundant `make_cell` calls, and stores the result to a local `tmp_2` which is never used again. ```python import torch def test_write_cell(): count = torch.ones(1) def inc(): nonlocal count count = count + 1 torch.compile() def fn(): inc() fn() test_write_cell() ``` ``` $ TORCH_LOGS="bytecode" TORCH_LOGS_FORMAT="short" python test.py ...... 0 COPY_FREE_VARS 1 2 RESUME 0 4 LOAD_GLOBAL 9 (NULL + __compiled_fn_2) 14 LOAD_DEREF 3 (inc) 16 LOAD_ATTR 6 (__closure__) 36 LOAD_CONST 1 (0) 38 BINARY_SUBSCR 42 LOAD_ATTR 4 (cell_contents) 62 CALL 1 70 STORE_FAST 0 (graph_out_0) 72 LOAD_GLOBAL 0 (__import_torch_dot__dynamo_dot_utils) 82 LOAD_ATTR 3 (NULL\|self + make_cell) 102 CALL 0 110 STORE_FAST 2 (tmp_2) 112 LOAD_FAST 0 (graph_out_0) 114 LOAD_CONST 1 (0) 116 BINARY_SUBSCR 120 LOAD_DEREF 3 (inc) 122 LOAD_ATTR 6 (__closure__) 142 LOAD_CONST 1 (0) 144 BINARY_SUBSCR 148 STORE_ATTR 2 (cell_contents) 158 DELETE_FAST 0 (graph_out_0) 160 RETURN_CONST 0 (None) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/137907 Approved by: https://github.com/anijain2305	2024-10-15 00:49:45 +00:00
Michael Lazos	e41dffbedd	[Dynamo] Trace enter/exit of TorchFunctionModes (#135422 ) (#137114 ) This PR implements tracing of with contexts with TorchFunction modes which have the default enter/exit behavior (ie pushing/popping the mode) Typically the bytecode for a context manager looks like this during a graph break: 1. graph call 2. enter context 3. unsupported code 4. exit context 5. resume call resume fn structure: 1. enter context 2. jump ... 3. exit context The issue with torch function modes is that side effects will replay any mutations to the torch function stack performed during tracing. So, we do not need to enter and exit around the unsupported code in the original function (doing so would result in a duplicate torch function mode entry during execution of the unsupported code), and we don't need to enter again in the resume function (the mode that was pushed from the side effects bytecode would still be on the stack). So for torch function modes the structure of our output code is this: 1. graph call 2. mutate tf mode stack to replay mutations 4. unsupported code 5. on exception restore stack 6. resume function Then our resume fn looks like this: 1. no-op enter torch function mode 2. jump 3. exit tf mode To implement the no-op enter of the torch function mode I added torch function mode in polyfill which no-op enters, but normally exits. This is needed because we still want to trace the with context in the resume function, and exit properly (the exit instructions will still be in the function, so we need to generate instructions to set up the context). Separately from the bytecode, dynamo also tracks contexts on the block stack, which is how the SETUP_* instructions are implemented. Naturally at a graph break, we exit these block stacks to properly reset the contexts entirely, so that we can re-enter around the unsupported code soundly. However once again, in the torch function mode case, in the event of a graph we do not want to perform any exit side effects because we want to preserve the state of the mode stack as is so that we will properly update the stack with bytecode mentioned in the first section. If we exited here, dynamo would pop the mode off of the symbolic stack, and not update the true python torch function mode stack with the suffix bytecode. All in all, for torch function modes we enter exactly once, update the global torch function mode stack with side effects bytecode, re-read this stack when compiling the resume function, and exit exactly once in the resume function. This matches the semantics of eager exactly. Approved by: https://github.com/williamwen42 ghstack dependencies: #134732, #133137, #135443, #135444 Pull Request resolved: https://github.com/pytorch/pytorch/pull/137114 Approved by: https://github.com/yanboliang	2024-10-09 02:29:40 +00:00
PyTorch MergeBot	d34b617bb9	Revert "[Dynamo] Trace enter/exit of TorchFunctionModes (#135422 ) (#137114 )" This reverts commit 51bc839b94829f176e3c1b7f62e3448d6028c480. Reverted https://github.com/pytorch/pytorch/pull/137114 on behalf of https://github.com/huydhn due to The top of the stack has been reverted but it leaves trunk in a broken state, so I try to revert the rest of the stack ([comment](https://github.com/pytorch/pytorch/pull/137114#issuecomment-2400765603))	2024-10-08 20:33:17 +00:00
Michael Lazos	51bc839b94	[Dynamo] Trace enter/exit of TorchFunctionModes (#135422 ) (#137114 ) This PR implements tracing of with contexts with TorchFunction modes which have the default enter/exit behavior (ie pushing/popping the mode) Typically the bytecode for a context manager looks like this during a graph break: 1. graph call 2. enter context 3. unsupported code 4. exit context 5. resume call resume fn structure: 1. enter context 2. jump ... 3. exit context The issue with torch function modes is that side effects will replay any mutations to the torch function stack performed during tracing. So, we do not need to enter and exit around the unsupported code in the original function (doing so would result in a duplicate torch function mode entry during execution of the unsupported code), and we don't need to enter again in the resume function (the mode that was pushed from the side effects bytecode would still be on the stack). So for torch function modes the structure of our output code is this: 1. graph call 2. mutate tf mode stack to replay mutations 4. unsupported code 5. on exception restore stack 6. resume function Then our resume fn looks like this: 1. no-op enter torch function mode 2. jump 3. exit tf mode To implement the no-op enter of the torch function mode I added torch function mode in polyfill which no-op enters, but normally exits. This is needed because we still want to trace the with context in the resume function, and exit properly (the exit instructions will still be in the function, so we need to generate instructions to set up the context). Separately from the bytecode, dynamo also tracks contexts on the block stack, which is how the SETUP_* instructions are implemented. Naturally at a graph break, we exit these block stacks to properly reset the contexts entirely, so that we can re-enter around the unsupported code soundly. However once again, in the torch function mode case, in the event of a graph we do not want to perform any exit side effects because we want to preserve the state of the mode stack as is so that we will properly update the stack with bytecode mentioned in the first section. If we exited here, dynamo would pop the mode off of the symbolic stack, and not update the true python torch function mode stack with the suffix bytecode. All in all, for torch function modes we enter exactly once, update the global torch function mode stack with side effects bytecode, re-read this stack when compiling the resume function, and exit exactly once in the resume function. This matches the semantics of eager exactly. Approved by: https://github.com/williamwen42 ghstack dependencies: #134732, #133137, #135443, #135444 Pull Request resolved: https://github.com/pytorch/pytorch/pull/137114 Approved by: https://github.com/yanboliang	2024-10-07 18:55:26 +00:00
Animesh Jain	af86a6fdba	[dynamo][user-defined-class] Fallback when object.__new__ fails (#137044 ) Seen in https://github.com/vllm-project/vllm/pull/8949 Pull Request resolved: https://github.com/pytorch/pytorch/pull/137044 Approved by: https://github.com/jansel	2024-10-02 14:15:36 +00:00
Animesh Jain	289df45cee	Revert "[Dynamo] Trace enter/exit of TorchFunctionModes (#135422 )" (#136590 ) This reverts commit 7743149b2be4a9eba7e0997ccdc6abe552bec266. Reverts * https://github.com/pytorch/pytorch/pull/135503 * https://github.com/pytorch/pytorch/pull/135502 * https://github.com/pytorch/pytorch/pull/135422 This passes this test. Earlier, the getitem would stay like a getitem in the Fx graph. But now the fake tensor propagations fails saying that .item is called. It seems that torch function is not getting triggered while fake tensor propagation. ``` import torch from torch.nn.attention.flex_attention import BlockMask, _mask_mod_signature, _score_mod_signature, flex_attention from torch._inductor.lowering import make_pointwise, register_lowering from torch._inductor.virtualized import ops from torch.nn.attention.flex_attention import create_block_mask torch.set_default_device('cuda') flex_attention = torch.compile(flex_attention, dynamic=False) prefix_lengths = torch.arange(8) def prefix_lm(b, h, q, kv): return prefix_lengths[b] >= kv mask = create_block_mask(prefix_lm, 8, None, 512, 512, _compile=True) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/136590 Approved by: https://github.com/Chillee	2024-09-25 21:10:43 +00:00
Guilherme Leobas	52c917b0ba	Optimize dict reconstruct to not codegen untouched values (#134876 ) PR changes how `reconstruct` is done for a ConstDict. As of today, it works as follow: (1) codegen(...) each pair of key/value (2) create a new dictionary to hold the new items (3) clear the original dictionary (4) update the original dict with the one created in (2) We do a micro optimization in the generated bytecode to: - Only codegen the items that changed. - Only clear the original dictionary if a key was removed. Fixes: #133487 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134876 Approved by: https://github.com/zou3519	2024-09-23 21:45:44 +00:00
PyTorch MergeBot	3b5e2689a1	Revert "Optimize dict reconstruct to not codegen untouched values (#134876 )" This reverts commit a1a57a424dc992f4dc2d44bdc1e4e7e500881a9c. Reverted https://github.com/pytorch/pytorch/pull/134876 on behalf of https://github.com/jeanschmidt due to new introduced test test_reconstruct.py::ReconstructTest::test_functional_call_reconstruct is breaking internally. @zou3519 may you help get those changes merged back to main? ([comment](https://github.com/pytorch/pytorch/pull/134876#issuecomment-2355697685))	2024-09-17 13:00:01 +00:00
Will Feng	386884e553	[Traceable FSDP2] Ignore FSDP2 forward hook side-effects in AC; Support FSDP2 + AC (#134997 ) > Ignore FSDP2 forward hook side-effects in AC Under AC, FSDP2 does not rely on forward hook to all-gather weights to do recomputation, instead it relies on pre-backward hook to do this job: `451eaf0ff2/torch/distributed/_composable/fsdp/_fsdp_state.py (L219-L220)` So when we use `speculate_subgraph` to trace the utils.checkpoint AC region, we don't actually need to worry about FSDP2 forward hook's side effects and can safely ignore it, because we are not and we don't expect to re-run the FSDP2 forward hook during backward recomputation. ---- Test commands: - `pytest -rA test/distributed/_composable/fsdp/test_fully_shard_compile.py::TestFullyShardCompile::test_nested_fully_shard_backend_inductor` - `pytest -rA test/distributed/_composable/fsdp/test_fully_shard_compile.py::TestFullyShardCompile::test_transformer_backend_inductor` Pull Request resolved: https://github.com/pytorch/pytorch/pull/134997 Approved by: https://github.com/zou3519 ghstack dependencies: #135727	2024-09-15 02:00:17 +00:00
Guilherme Leobas	a1a57a424d	Optimize dict reconstruct to not codegen untouched values (#134876 ) PR changes how `reconstruct` is done for a ConstDict. As of today, it works as follow: (1) codegen(...) each pair of key/value (2) create a new dictionary to hold the new items (3) clear the original dictionary (4) update the original dict with the one created in (2) We do a micro optimization in the generated bytecode to: - Only codegen the items that changed. - Only clear the original dictionary if a key was removed. Fixes: #133487 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134876 Approved by: https://github.com/zou3519	2024-09-14 23:25:28 +00:00
Michael Lazos	1b9daeb240	[Dynamo] Trace enter/exit of TorchFunctionModes (#135422 ) This PR implements tracing of with contexts with TorchFunction modes which have the default enter/exit behavior (ie pushing/popping the mode) Typically the bytecode for a context manager looks like this during a graph break: 1. graph call 2. enter context 3. unsupported code 4. exit context 5. resume call resume fn structure: 1. enter context 2. jump ... 3. exit context The issue with torch function modes is that side effects will replay any mutations to the torch function stack performed during tracing. So, we do not need to enter and exit around the unsupported code in the original function (doing so would result in a duplicate torch function mode entry during execution of the unsupported code), and we don't need to enter again in the resume function (the mode that was pushed from the side effects bytecode would still be on the stack). So for torch function modes the structure of our output code is this: 1. graph call 2. mutate tf mode stack to replay mutations 4. unsupported code 5. on exception restore stack 6. resume function Then our resume fn looks like this: 1. no-op enter torch function mode 2. jump 3. exit tf mode To implement the no-op enter of the torch function mode I added torch function mode in polyfill which no-op enters, but normally exits. This is needed because we still want to trace the with context in the resume function, and exit properly (the exit instructions will still be in the function, so we need to generate instructions to set up the context). Separately from the bytecode, dynamo also tracks contexts on the block stack, which is how the SETUP_* instructions are implemented. Naturally at a graph break, we exit these block stacks to properly reset the contexts entirely, so that we can re-enter around the unsupported code soundly. However once again, in the torch function mode case, in the event of a graph we do not want to perform any exit side effects because we want to preserve the state of the mode stack as is so that we will properly update the stack with bytecode mentioned in the first section. If we exited here, dynamo would pop the mode off of the symbolic stack, and not update the true python torch function mode stack with the suffix bytecode. All in all, for torch function modes we enter exactly once, update the global torch function mode stack with side effects bytecode, re-read this stack when compiling the resume function, and exit exactly once in the resume function. This matches the semantics of eager exactly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/135422 Approved by: https://github.com/williamwen42 ghstack dependencies: #134732, #133137, #135443, #135444	2024-09-14 18:52:22 +00:00

1 2 3

111 Commits