pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 21:49:24 +08:00

Author	SHA1	Message	Date
Edward Z. Yang	d01ee10b25	Add detect_fake_mode (#98321 ) This replaces fake_mode_from_tensors but it preferentially looks for fake_mode in TracingContext and also if there is an active fake mode on the dispatch stack, before groveling in tensors to find it. This advances PegasusForCausalLM, which was previously failing because we generated a graph that had a parameter (non-fake) and a SymInt, and thus previously we failed to detect the correct fake mode. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98321 Approved by: https://github.com/voznesenskym	2023-04-05 22:15:16 +00:00
Will Constable	c1a6dde79e	Make dynamo-FSDP skip guards (#97463 ) Create a new GuardSource for FSDP modules, and use it to opt out of guard installation. Based on @awgu's work in https://github.com/pytorch/pytorch/pull/97091 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97463 Approved by: https://github.com/voznesenskym, https://github.com/jansel, https://github.com/awgu	2023-03-28 04:04:34 +00:00
Michael Voznesensky	f9ce593267	Extend aot autograd dedup guards to params, stop using positions (#96774 ) The purpose of this PR is to remove reliance on argument positions in dedup guards, AND extend the functionality to params. A version of this PR was stamped prior https://github.com/pytorch/pytorch/pull/95831 - but was kinda gross, because it was based on an underlying PR that did way too much with source names. This PR leaves most of that alone, in favor of just reusing the same name standardization logic that dynamo module registration does. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96774 Approved by: https://github.com/ezyang	2023-03-21 05:59:33 +00:00
Avik Chaudhuri	e4e761b277	record caller frame instead of function frame (#96882 ) Previously, when starting to trace a function, we would record a frame summary recording the definition loc. This would lead to an unconventional-looking stack trace when used for debugging, e.g., shape guards. ``` File ".../scripts/avik/pt2/example.py", line 407, in forward def forward(self, x): ... File ".../transformers/models/bert/modeling_bert.py", line 912, in forward @add_start_docstrings_to_model_forward(BERT_INPUTS_DOCSTRING.format("batch_size, sequence_length")) ... File ".../transformers/models/bert/modeling_bert.py", line 562, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 484, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 416, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 275, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 351, in forward attention_scores = attention_scores + attention_mask ``` As noted in https://github.com/pytorch/pytorch/pull/95848#discussion_r1134397096, we would like to change this to record function calls instead, like conventional stack traces do. This diff makes this change. The above stack now looks like the following, which is way more helpful at a glance to understand what's going on. ``` File ".../scripts/avik/pt2/example.py", line 408, in forward bert_out = self.bert(**x) ... File ".../transformers/models/bert/modeling_bert.py", line 1021, in forward encoder_outputs = self.encoder( ... File ".../transformers/models/bert/modeling_bert.py", line 610, in forward layer_outputs = layer_module( ... File ".../transformers/models/bert/modeling_bert.py", line 496, in forward self_attention_outputs = self.attention( ... File ".../transformers/models/bert/modeling_bert.py", line 426, in forward self_outputs = self.self( ... File ".../transformers/models/bert/modeling_bert.py", line 351, in forward attention_scores = attention_scores + attention_mask ``` Differential Revision: [D44101882](https://our.internmc.facebook.com/intern/diff/D44101882/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96882 Approved by: https://github.com/ezyang	2023-03-17 00:06:16 +00:00
Avik Chaudhuri	178d2a38e0	debug shape guards (#95848 ) Adds logging when shape guards are added and when symbols are specialized to constants. Differential Revision: [D43719743](https://our.internmc.facebook.com/intern/diff/D43719743/) Differential Revision: [D43719743](https://our.internmc.facebook.com/intern/diff/D43719743) Pull Request resolved: https://github.com/pytorch/pytorch/pull/95848 Approved by: https://github.com/ezyang	2023-03-14 16:05:28 +00:00
Michael Voznesensky	d7db5b05b4	Context manager to push/pop frame summaries (#96054 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96054 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-08 04:01:49 +00:00
Andrew Gu	cbac56e244	[BE] Simplify `Source.is_nn_module`; add some types (#95292 ) I am still reading Dynamo source code... This is an easy PR to simplify `Source.is_nn_module()` to reuse `GuardSource.is_nn_module()` instead of having the `in (...)` check implemented twice. While simplifying that, I thought I might as well add some type annotations for `Source` methods. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95292 Approved by: https://github.com/ezyang	2023-02-22 22:33:58 +00:00
Edward Z. Yang	89e16c4f18	Assume sympy is always installed (#94903 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94903 Approved by: https://github.com/Skylion007, https://github.com/malfet	2023-02-16 14:09:58 +00:00
Edward Z. Yang	f8740db410	Properly resolve source_ref when constructing shape guards (#91058 ) Whenever you guard on something, you're supposed to tell GuardBuilder about it, so GuardBuilder knows that it has to actually bind it in scope when it creates the guard function. But shape env guards bypass that mechanism completely. Well, now they don't. For the most part, this didn't matter in practice, because we usually had a `TENSOR_MATCH` guard floating around that made sure that the guard stayed live. But if we ever eliminate those guards (e.g., because we build it into the shape guard directly; something we'll probably want to do when https://github.com/pytorch/pytorch/pull/89707 goes online) then this will indeed matter. One complication: some of the shape env guards are on globals. You have to make sure to shunt the usage to the correct guard builder in that case. Maybe it would be better if we refactored things so there is only one GuardBuilder. Not sure. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91058 Approved by: https://github.com/voznesenskym	2022-12-30 05:56:56 +00:00
Edward Z. Yang	bcf15cd93b	Store source, not sname, in Symbol (#91057 ) I'm going to need this in the follow up PR. Instead of storing only Source.name() in Symbol, I now store a full on Source. Lots of replumbing reoccurs. In particular: - Move Source to torch._guards to break cycles - I have to add TensorPropertySource and NegateSource to handle x.size()[0] and -x codegen that I was doing with string manipulation previously - I tighten up invariants so that I never pass source=None; instead I pass ConstantSource (these are constant sources right) and test for that rather than source being missing. I think this is more parsimonious - Some mypy wobbles from new imports I didn't move LocalSource and friends to torch._guards, but I ended up needing to access them in a few places. The main annoyance with moving these is that then I also need to move the bytecode codegen stuff, and that's not so easy to move without bringing in the kitchen sink. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91057 Approved by: https://github.com/albanD, https://github.com/voznesenskym, https://github.com/zou3519	2022-12-30 05:56:56 +00:00
PyTorch MergeBot	b68fd7e319	Revert "Store source, not sname, in Symbol (#91057 )" This reverts commit 88c581be87ac59ea1251f35a57b610ae81b9362d. Reverted https://github.com/pytorch/pytorch/pull/91057 on behalf of https://github.com/atalman due to causing internal build failures	2022-12-21 22:33:15 +00:00
Edward Z. Yang	88c581be87	Store source, not sname, in Symbol (#91057 ) I'm going to need this in the follow up PR. Instead of storing only Source.name() in Symbol, I now store a full on Source. Lots of replumbing reoccurs. In particular: - Move Source to torch._guards to break cycles - I have to add TensorPropertySource and NegateSource to handle x.size()[0] and -x codegen that I was doing with string manipulation previously - I tighten up invariants so that I never pass source=None; instead I pass ConstantSource (these are constant sources right) and test for that rather than source being missing. I think this is more parsimonious - Some mypy wobbles from new imports I didn't move LocalSource and friends to torch._guards, but I ended up needing to access them in a few places. The main annoyance with moving these is that then I also need to move the bytecode codegen stuff, and that's not so easy to move without bringing in the kitchen sink. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91057 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2022-12-21 04:51:51 +00:00
Edward Z. Yang	57390116e0	Restructure ShapeEnv so it uses GuardBuilder.SHAPE_ENV directly (#91055 ) The idea is to make ShapeEnv guards less of a one-off special snowflake, and integrate it more closely with the regular builder infrastructure. But it is not so easy: the shape env code has to live after tensor match code, because we need to know that the values in question are tensors before we start matching on them. So we introduce a new `shape_env_code` field to put the special shape env code, so we can add it to the final constructed code after tensor. Everything else works the obvious way. There's a new ShapeEnvSource for constructing the singleton SHAPE_ENV guard that drives the shape env guard construction. I added some more docs and also made the printed code for guards include the enclosing lambda for more clarity. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91055 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2022-12-21 03:50:47 +00:00
Michael Voznesensky	b72caf311d	Introduce guardexpr, aot autograd guarding of duplicates into torch._guards (#90955 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90955 Approved by: https://github.com/ezyang	2022-12-18 03:05:47 +00:00
Michael Voznesensky	53e71fad8f	Add shape_env guards to tracing context (#90876 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90876 Approved by: https://github.com/Chillee, https://github.com/ezyang	2022-12-16 09:05:05 +00:00
Edward Z. Yang	eef019c14a	Lint rule to forbid direct use of logging.info/etc APIs (#90907 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90907 Approved by: https://github.com/jansel	2022-12-16 05:13:51 +00:00
Michael Voznesensky	6c8ef6a4c2	Add tracing context, Integrate dynamo guards into torch._guards (#90647 ) As defined here: https://docs.google.com/document/d/1oniZEgAaHE1IMByPRWRKbUHeaW06E2HMfCTCQyMRLek/edit# This PR creates a new structure, a TracingContext, whose lifecycle matches that of the traced frame. It carries on it a GuardsContext, and eventually, a FakeTensorMode. It is the source of truth of all accumulated guards. In this PR, we create the structure, and integrate it into dynamo. We do so by mapping OutputGraph's guards structure to its guard structure. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90647 Approved by: https://github.com/ezyang	2022-12-14 07:35:32 +00:00
Michael Voznesensky	5adc18dcbc	Shape guard structure (#90679 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/90679 Approved by: https://github.com/ezyang	2022-12-12 09:50:00 +00:00
Michael Voznesensky	11442accc6	Make torch._guards, shuffle structures around for migration (#90636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90636 Approved by: https://github.com/ezyang	2022-12-11 23:16:07 +00:00
PyTorch MergeBot	15a4c60383	Revert "Make torch._guards, shuffle structures around for migration (#90636 )" This reverts commit 933b6c4eed675d33274d0bc1dfcb9d8446f412d8. Reverted https://github.com/pytorch/pytorch/pull/90636 on behalf of https://github.com/huydhn due to Breaking lint on master. Please rebase and run lintrunner -a before re-merging the PR	2022-12-11 10:15:47 +00:00
Michael Voznesensky	933b6c4eed	Make torch._guards, shuffle structures around for migration (#90636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90636 Approved by: https://github.com/ezyang	2022-12-11 06:04:17 +00:00

21 Commits