pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Avik Chaudhuri	03de15806e	group constraints by arg (#101815 ) Before, we would emit a soup of specializations / constraints without any obvious order to guide readability. With this diff, we group such results by arg, and add comments preceding each group. Empirically, the results read much better. Differential Revision: [D45995199](https://our.internmc.facebook.com/intern/diff/D45995199/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101815 Approved by: https://github.com/tugsbayasgalan	2023-05-20 06:01:14 +00:00
Aaron Gokaslan	3e2ea32dab	[BE]: Enable ruff rule TRY302 and apply fixes (#101874 ) Removes useless try statements and unreachable code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101874 Approved by: https://github.com/malfet	2023-05-19 17:30:52 +00:00
Avik Chaudhuri	e3c66ded86	remove default lower bound in dynamic_dim suggestions (#101636 ) So instead of `2 <= dynamic_dim(x, 0)` simply suggest `dynamic_dim(x, 0)`. This has exactly the same effect. Differential Revision: [D45933273](https://our.internmc.facebook.com/intern/diff/D45933273/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101636 Approved by: https://github.com/tugsbayasgalan, https://github.com/ydwu4	2023-05-17 19:55:04 +00:00
Avik Chaudhuri	41d668c9dc	work around precision error in constraint solver (#101607 ) In https://github.com/pytorch/pytorch/pull/101307 we tried to fix https://github.com/pytorch/pytorch/issues/101093 using `nsimplify` to convert floats into rationals, but the fix is not reliable: it is possible for `nsimplify` to pick constants that don't work. Currently, constraint solving is only used by `export`, but constraints are added in all modes. This means that we can hit this issue even in non-`export` modes. This diff works around this issue for such modes by delaying raising such failures until constraint solving. Differential Revision: [D45922797](https://our.internmc.facebook.com/intern/diff/D45922797/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101607 Approved by: https://github.com/ezyang	2023-05-17 03:25:04 +00:00
Avik Chaudhuri	9b6ccde0e6	fix precision error in constraint solver (#101307 ) When adding guards to the constraint solver, we check that they are consistent, i.e., they do not simplify to false when their free symbols are substituted with the corresponding concrete values. However this check may "spuriously" fail because it doesn't take into account precision errors when comparing floats. Since the symbols involved are all positive integers, we try to approximate floats in the guards with rationals, providing concrete values as hints: `sympy.nsimplify` does the job. As an alternative approach, we considered using `sympy.evalf` to compare with reduced precision. But we did not pursue it because * the choice of what is a good reduced precision feels arbitrary (`sympy` uses `1e15` by default); * more importantly, there is no guarantee that we will not encounter the same problem when solving downstream. Differential Revision: [D45826951](https://our.internmc.facebook.com/intern/diff/D45826951/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101307 Approved by: https://github.com/ezyang	2023-05-15 11:03:24 +00:00
Avik Chaudhuri	ca9f55f79d	misc. fixes to constraints warnings and errors (#100745 ) 1. Move constraint violation error after constraint discovery warning, and attach them when we have both. 2. Remove verbose internal traceback for relevant guard in constraint violation error. 3. Remove mention of `assume_static_by_default` in specialization warning. 4. Fix indenting of `specializations` body and make it assert individually instead of returning a conjunction. 5. Remove return annotation on signature used in generated `specializations` and `specify_constraints` functions. 6. Split `&` ranges because we don't support them yet. Differential Revision: [D45619852](https://our.internmc.facebook.com/intern/diff/D45619852/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100745 Approved by: https://github.com/tugsbayasgalan	2023-05-06 18:22:31 +00:00
ydwu4	61917a006d	Make DimConstraints create actionable message (#100103 ) This pr makes summary of dimension constraints actionable. Before the pr, it will print: ``` torch.fx.experimental.symbolic_shapes: [WARNING] Summary of dimension constraints: The following dimensions have been specialized and CANNOT be dynamic. NOTE: Specializations will happen by default with `assume_static_by_default=True`. L['c'].size()[1] == 3 L['a'].size()[2] == 3 L['a'].size()[1] == 3 L['b'].size()[2] == 2 L['b'].size()[1] == 2 L['c'].size()[2] == 3 The following dimensions CAN be dynamic. You can use the following code to specify the constraints they must satisfy: ''' constraints=[ dynamic_dim(L['c'], 0) == dynamic_dim(L['a'], 0), 2 <= dynamic_dim(L['b'], 0), 2 <= dynamic_dim(L['a'], 0), ] ''' ``` Users need to initialize the L environment manually and copy the constraints over. After the pr, we have: ``` [2023-04-26 05:43:12,849] torch._dynamo.eval_frame: [WARNING] Summary of dimension constraints: The following dimensions have been specialized and CANNOT be dynamic. NOTE: Specializations will happen by default with `assume_static_by_default=True`. ''' def specializations(a, b, c): return (a.size()[2] == 3 and c.size()[1] == 3 and a.size()[1] == 3 and c.size()[2] == 3 and b.size()[2] == 2 and b.size()[1] == 2) ''' The following dimensions CAN be dynamic. You can use the following code to specify the constraints they must satisfy: ''' def specify_constraints(a, b, c): return [ 2 <= dynamic_dim(b, 0), dynamic_dim(c, 0) == dynamic_dim(a, 0), 2 <= dynamic_dim(a, 0), ] ''' ``` , where dynamic_constraints has the same input signature as users code. This allow users to copy-paste and run the code to generate the constraints before exporting as shown below: ``` def specify_constraints(a, b, c): return [ 2 <= dynamic_dim(b, 0), dynamic_dim(c, 0) == dynamic_dim(a, 0), 2 <= dynamic_dim(a, 0), ] torch._dynamo.export(my_dyn_fn, x, y, z, constraints=specify_constriants(x, y, z)) ``` Implementation-wise, this pr also 1. changes shape_env.produce_guards to produce_guards_and_constraints, 2. adds contraints_export_fn hooks, The purpose is to surface the DimConstraints to dynamo.export, where we could reliably get the original function's signature. The alternative to the above is to get the function signature before creating SHAPE_ENV guard (https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/output_graph.py#L227) and pass it to DimConstraints, but I couldn't recover the signature before creating SHAPE_ENV because the frame's f_globals/locals don't contain the original function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100103 Approved by: https://github.com/guangy10, https://github.com/tugsbayasgalan	2023-04-27 21:24:18 +00:00
Avik Chaudhuri	f6f35135a4	suggest constraints to specify for export based on generated shape guards (#98463 ) The design of export API expects constraints to be specified on dynamic dimensions, while assuming all other dimensions are static by default. However a user who wishes to export a model may not be fully familiar with the code to plan what to specify. This diff provides support for discovering constraints to specify. The basic idea is to take the set of generated shape guards and convert them into appropriate constraints. However, we usually generate a LOT of shape guards, and there is often a LOT of redundancy in them. Thus, we also need to simplify the guards so that our suggested constraints are concise yet capture the information content in the guards. The algorithm for simplification uses `sympy` under the hood, but very surgically to avoid any risk of blowing up. See comments inline for a full description. Briefly, 1. We consider only univariate inequalities, and among them, solve for equalities first. 2. We substitute these exact solutions to convert multivariate inequalities progressively into univariate. 3. Remaining univariate inequalities are solved using `sympy.solvers.inequalities.reduce_inequalities`. 4. As pre-processing, we also eliminate all `//` and `%` operations to generate a set of linear congruence guards, and solve these using `sympy.ntheory.modular.solve_congruence`. The results are quite dramatic. For example, an internal model produced several hundreds of guards with `dynamic_shapes=True`, which were pretty much inscrutable for humans. The summary contains around 30 dimensions that were specialized and 3 constraints on dynamic dimensions. The output format looks like this: ``` The following dimensions have been specialized and CANNOT be dynamic. NOTE: Specializations will happen by default with `assume_static_by_default=True`. L['foo']['bar'].size()[0] == 4 ... L['baz']['qux'].size()[3] == 96 The following dimensions CAN be dynamic. You can use the following code to specify the constraints they must satisfy: constraints=[ dynamic_dim(L['blah']['bleh'], 1) == dynamic_dim(L['blah']['bloh'], 1), ..., 2 <= dynamic_dim(L['blah']['bloh'], 1), ] ``` Differential Revision: [D44731747](https://our.internmc.facebook.com/intern/diff/D44731747/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98463 Approved by: https://github.com/voznesenskym, https://github.com/ezyang	2023-04-19 21:56:36 +00:00
Edward Z. Yang	8372c5dc68	Refactor dynamic dims api, stateless internals, higher level export API (#96699 ) The purpose of this API is to execute a few large components of work: 1) Refactor all the internals of plumbing dynamic dimension information after dynamo to be stateless 2) Decouple allocation controls around dynamic dimensions from verification 3) For (2), for allocation, create an enum that dictates whether we are in DUCK (default today), STATIC (aka assume_static_default in the past), or DYNAMIC (aka user constrained, do not duck shape) 4) For (2), for verification, we separate out the list of dynamic ranges entirely from allocation. This means shape_env does not tracking for what we verify on, and instead, it is the callers job to invoke produce_guards() with the various things they want verified, specifically, with the valid ranges. We do use constrain ranges to refine value ranges when doing analysis. 5) We have decided, therefore, as an extension of (4) to double down on "late" checks versus "eager" checks, primarily because the mechanisms for gathering what actually matters happens during guards, and should be a purview of the caller seeking guards, not the shape env. However, for dynamo, these structures are essentially one and the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96699 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-29 16:55:49 +00:00
Brian Hirsh	35c9ea89fa	dont bake in defaults when tracing *_like factories (#97564 ) quick fix for https://github.com/pytorch/pytorch/issues/97541. letting CI run to see if there's any fallout Pull Request resolved: https://github.com/pytorch/pytorch/pull/97564 Approved by: https://github.com/ezyang	2023-03-27 22:53:44 +00:00
Edward Z. Yang	847d6520ed	Don't guard on the exact int value on conversion to bool (#96008 ) Fixes https://github.com/pytorch/pytorch/issues/95981 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96008 Approved by: https://github.com/ngimel	2023-03-07 00:40:06 +00:00
Edward Z. Yang	d78274b759	Automatically guard when SymInt is converted to int (#95479 ) During enablement, we disabled int() conversions because they were any easy way to footgun guards. We have enough of dynamic shapes working now that this is now causing spurious errors; e.g., if you feed a symbolic int to x.size(symint). We now allow for implicit conversions of SymInt to int here, posting a guard. We expect guard provenance to help people debug overspecialization. Fixes https://github.com/pytorch/pytorch/issues/95328 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95479 Approved by: https://github.com/wconstab, https://github.com/voznesenskym, https://github.com/ngimel	2023-02-25 19:41:51 +00:00
Edward Z. Yang	af202aea34	Add knobs for globally turning off 0/1 specialization and duck shaping (#95352 ) They're not wired up to anything right now but the most logical wiring would be to add torch._dynamo.config to toggle them. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95352 Approved by: https://github.com/voznesenskym	2023-02-23 16:29:10 +00:00
Edward Z. Yang	89e16c4f18	Assume sympy is always installed (#94903 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94903 Approved by: https://github.com/Skylion007, https://github.com/malfet	2023-02-16 14:09:58 +00:00
Natalia Gimelshein	5bc72bd019	sym_int simplification for integer args, attempt 3 (#94799 ) Per title, now propagates to inductor codegen. Where should I put the test and how should test look like? Pull Request resolved: https://github.com/pytorch/pytorch/pull/94799 Approved by: https://github.com/ezyang	2023-02-15 00:31:19 +00:00
Edward Z. Yang	c1c7eaf52b	Prevent sym_int from showing up in FX graph (#94595 ) Apply the optimization to floor instead of sym_int Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94595 Approved by: https://github.com/ngimel, https://github.com/bdhirsh	2023-02-11 01:43:05 +00:00
Natalia Gimelshein	715f3733ef	don't call floor for symint unless necessary (#94365 ) Per @ezyang's advice, added magic sym_int method. This works for 1.0 * s0 optimization, but can't evaluate `a>0` for some args, and still misses some optimization that model rewrite achieves, so swin still fails (rewrite replaces `B = int(windows.shape[0] / (H * W / window_size / window_size))` with `B = (windows.shape[0] // int(H * W / window_size / window_size))` and model passes) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94365 Approved by: https://github.com/ezyang	2023-02-10 07:17:11 +00:00
PyTorch MergeBot	490c8f67c5	Revert "WIP: don't call floor for symint unless necessary (#94365 )" This reverts commit 8a9ea44985725e57cb82f0d978fafae31577ae6d. Reverted https://github.com/pytorch/pytorch/pull/94365 on behalf of https://github.com/ZainRizvi due to This looks like it caused some inductor test to start failing: `8a9ea44985`	2023-02-09 17:42:23 +00:00
Natalia Gimelshein	8a9ea44985	WIP: don't call floor for symint unless necessary (#94365 ) Per @ezyang's advice, added magic sym_int method. This works for 1.0 * s0 optimization, but can't evaluate `a>0` for some args, and still misses some optimization that model rewrite achieves, so swin still fails (rewrite replaces `B = int(windows.shape[0] / (H * W / window_size / window_size))` with `B = (windows.shape[0] // int(H * W / window_size / window_size))` and model passes) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94365 Approved by: https://github.com/ezyang	2023-02-09 10:05:49 +00:00
Edward Z. Yang	dc70b00d0b	Track and record hint on SymNode and use when possible (#94201 ) Historically, we work out `size_hint` by working it out on the fly by doing a substitution on the sympy expression with the `var_to_val` mapping. With this change, we also maintain the hint directly on SymNode (in `expr._hint`) and use it in lieu of Sympy substitution when it is available (mostly guards on SymInt, etc; in particular, in idiomatic Inductor code, we typically manipulate Sympy expressions directly and so do not have a way to conveniently maintain hints.) While it's possible this will give us modest performance improvements, this is not the point of this PR; the goal is to make it easier to carefully handle unbacked SymInts, where hints are expected not to be available. You can now easily test if a SymInt is backed or not by checking `symint.node.hint is None`. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94201 Approved by: https://github.com/voznesenskym	2023-02-09 00:00:44 +00:00
Nikita Karetnikov	162e3ca58e	[fx] fix type promotion in `binary_magic_impl` (#91376 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91376 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-03 15:06:40 +00:00
Nikita Karetnikov	34bcbfbd6a	[fx] throw exceptions on invalid input in `FloorDiv` (#93143 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93143 Approved by: https://github.com/ezyang	2023-02-03 15:06:40 +00:00
Nikita Karetnikov	ba614f3a32	[fx] test `FloorDiv` against Python impl (#93142 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93142 Approved by: https://github.com/ezyang	2023-02-03 15:06:38 +00:00
Nikita Karetnikov	e7c63b962b	[fx] add SymPy assumptions to `FloorDiv` (#93185 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93185 Approved by: https://github.com/ezyang	2023-02-03 15:06:36 +00:00
Edward Z. Yang	306dc2ed1a	Make ShapeEnv deepcopy'able (#93403 ) We sometimes put ShapeEnv on GraphModule, and code in our testing utils assume that you can deepcopy a GraphModule, so it's good for ShapeEnv to be deepcopy'able too. This is done by making the TLS module-wide rather than per-ShapeEnv. We never really have multiple ShapeEnv so this is a good trade. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93403 Approved by: https://github.com/jbschlosser	2023-02-02 02:50:23 +00:00
PyTorch MergeBot	d37bc6d04e	Revert "[fx] add SymPy assumptions to `FloorDiv` (#93185 )" This reverts commit c4ccf7e12147671fdc3535a222260d687c2128a2. Reverted https://github.com/pytorch/pytorch/pull/93185 on behalf of https://github.com/ezyang due to appears to be breaking people outside of ci	2023-02-02 02:26:11 +00:00
Nikita Karetnikov	c4ccf7e121	[fx] add SymPy assumptions to `FloorDiv` (#93185 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93185 Approved by: https://github.com/ezyang	2023-02-01 13:50:59 +00:00
Edward Z. Yang	434eb16deb	Correctly restore pybind11 error_already_set (#93238 ) We would handle py::error_already_set correctly from pybind11 bindings, but not from our regular TH bindings, which meant that anything from an inner pybind11 function call was getting unconditionally transformed into a RuntimeError. Not too many cases where we do this, but PySymNodeImpl was one of them. To test this, I need to raise a non-RuntimeError from a function which is invoked from pybind11 and then propagated to a non-pybind11 call site. I introduce GuardOnDataDependentSymNode for expressly this purpose (this is how I discovered the bug anyway.) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93238 Approved by: https://github.com/Skylion007, https://github.com/albanD	2023-01-30 16:43:01 +00:00
Edward Z. Yang	5c6f5439b7	Implement SymBool (#92149 ) We have known for a while that we should in principle support SymBool as a separate concept from SymInt and SymFloat ( in particular, every distinct numeric type should get its own API). However, recent work with unbacked SymInts in, e.g., https://github.com/pytorch/pytorch/pull/90985 have made this a priority to implement. The essential problem is that our logic for computing the contiguity of tensors performs branches on the passed in input sizes, and this causes us to require guards when constructing tensors from unbacked SymInts. Morally, this should not be a big deal because, we only really care about the regular (non-channels-last) contiguity of the tensor, which should be guaranteed since most people aren't calling `empty_strided` on the tensor, however, because we store a bool (not a SymBool, prior to this PR it doesn't exist) on TensorImpl, we are forced to immediately compute these values, even if the value ends up not being used at all. In particular, even when a user allocates a contiguous tensor, we still must compute channels-last contiguity (as some contiguous tensors are also channels-last contiguous, but others are not.) This PR implements SymBool, and makes TensorImpl use SymBool to store the contiguity information in ExtraMeta. There are a number of knock on effects, which I now discuss below. * I introduce a new C++ type SymBool, analogous to SymInt and SymFloat. This type supports logical and, logical or and logical negation. I support the bitwise operations on this class (but not the conventional logic operators) to make it clear that logical operations on SymBool are NOT short-circuiting. I also, for now, do NOT support implicit conversion of SymBool to bool (creating a guard in this case). This does matter too much in practice, as in this PR I did not modify the equality operations (e.g., `==` on SymInt) to return SymBool, so all preexisting implicit guards did not need to be changed. I also introduced symbolic comparison functions `sym_eq`, etc. on SymInt to make it possible to create SymBool. The current implementation of comparison functions makes it unfortunately easy to accidentally introduce guards when you do not mean to (as both `s0 == s1` and `s0.sym_eq(s1)` are valid spellings of equality operation); in the short term, I intend to prevent excess guarding in this situation by unit testing; in the long term making the equality operators return SymBool is probably the correct fix. * ~~I modify TensorImpl to store SymBool for the `is_contiguous` fields and friends on `ExtraMeta`. In practice, this essentially meant reverting most of the changes from https://github.com/pytorch/pytorch/pull/85936 . In particular, the fields on ExtraMeta are no longer strongly typed; at the time I was particularly concerned about the giant lambda I was using as the setter getting a desynchronized argument order, but now that I have individual setters for each field the only "big list" of boolean arguments is in the constructor of ExtraMeta, which seems like an acceptable risk. The semantics of TensorImpl are now that we guard only when you actually attempt to access the contiguity of the tensor via, e.g., `is_contiguous`. By in large, the contiguity calculation in the implementations now needs to be duplicated (as the boolean version can short circuit, but the SymBool version cannot); you should carefully review the duplicate new implementations. I typically use the `identity` template to disambiguate which version of the function I need, and rely on overloading to allow for implementation sharing. The changes to the `compute_` functions are particularly interesting; for most of the functions, I preserved their original non-symbolic implementation, and then introduce a new symbolic implementation that is branch-less (making use of our new SymBool operations). However, `compute_non_overlapping_and_dense` is special, see next bullet.~~ This appears to cause performance problems, so I am leaving this to an update PR. * (Update: the Python side pieces for this are still in this PR, but they are not wired up until later PRs.) While the contiguity calculations are relatively easy to write in a branch-free way, `compute_non_overlapping_and_dense` is not: it involves a sort on the strides. While in principle we can still make it go through by using a data oblivious sorting network, this seems like too much complication for a field that is likely never used (because typically, it will be obvious that a tensor is non overlapping and dense, because the tensor is contiguous.) So we take a different approach: instead of trying to trace through the logic computation of non-overlapping and dense, we instead introduce a new opaque operator IsNonOverlappingAndDenseIndicator which represents all of the compute that would have been done here. This function returns an integer 0 if `is_non_overlapping_and_dense` would have returned `False`, and an integer 1 otherwise, for technical reasons (Sympy does not easily allow defining custom functions that return booleans). The function itself only knows how to evaluate itself if all of its arguments are integers; otherwise it is left unevaluated. This means we can always guard on it (as `size_hint` will always be able to evaluate through it), but otherwise its insides are left a black box. We typically do NOT expect this custom function to show up in actual boolean expressions, because we will typically shortcut it due to the tensor being contiguous. It's possible we should apply this treatment to all of the other `compute_` operations, more investigation necessary. As a technical note, because this operator takes a pair of a list of SymInts, we need to support converting `ArrayRef<SymNode>` to Python, and I also unpack the pair of lists into a single list because I don't know if Sympy operations can actually validly take lists of Sympy expressions as inputs. See for example `_make_node_sizes_strides` * On the Python side, we also introduce a SymBool class, and update SymNode to track bool as a valid pytype. There is some subtlety here: bool is a subclass of int, so one has to be careful about `isinstance` checks (in fact, in most cases I replaced `isinstance(x, int)` with `type(x) is int` for expressly this reason.) Additionally, unlike, C++, I do NOT define bitwise inverse on SymBool, because it does not do the correct thing when run on booleans, e.g., `~True` is `-2`. (For that matter, they don't do the right thing in C++ either, but at least in principle the compiler can warn you about it with `-Wbool-operation`, and so the rule is simple in C++; only use logical operations if the types are statically known to be SymBool). Alas, logical negation is not overrideable, so we have to introduce `sym_not` which must be used in place of `not` whenever a SymBool can turn up. To avoid confusion with `__not__` which may imply that `operators.__not__` might be acceptable to use (it isn't), our magic method is called `__sym_not__`. The other bitwise operators `&` and `\|` do the right thing with booleans and are acceptable to use. * There is some annoyance working with booleans in Sympy. Unlike int and float, booleans live in their own algebra and they support less operations than regular numbers. In particular, `sympy.expand` does not work on them. To get around this, I introduce `safe_expand` which only calls expand on operations which are known to be expandable. TODO: this PR appears to greatly regress performance of symbolic reasoning. In particular, `python test/functorch/test_aotdispatch.py -k max_pool2d` performs really poorly with these changes. Need to investigate. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/92149 Approved by: https://github.com/albanD, https://github.com/Skylion007	2023-01-21 02:21:56 +00:00
Edward Z. Yang	c4501593c3	Delete get_pyobj() entirely (#92638 ) Opt for the shorter and more direct node attribute access. I need to do this because I'm going to publicly document SymInt and SymFloat but I don't want to doc get_pyobj(). Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/92638 Approved by: https://github.com/Chillee, https://github.com/albanD, https://github.com/voznesenskym, https://github.com/bdhirsh	2023-01-20 19:06:56 +00:00
Edward Z. Yang	6420fecdc4	Introduce sym_min and sym_max (#92107 ) It turns out our old max/min implementation didn't do anything, because `__max__` and `__min__` are not actually magic methods in Python. So I give 'em the `sym_` treatment, similar to the other non-overrideable builtins. NB: I would like to use `sym_max` when computing contiguous strides but this appears to make `python test/functorch/test_aotdispatch.py -v -k test_aot_autograd_symbolic_exhaustive_nn_functional_max_pool2d_cpu_float32` run extremely slowly. Needs investigating. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/92107 Approved by: https://github.com/albanD, https://github.com/voznesenskym, https://github.com/Skylion007	2023-01-18 20:57:27 +00:00
PyTorch MergeBot	5c9c39a83f	Revert "[fx] rewrite `FloorDiv` to match Python better (#90906 )" This reverts commit d13207c7adf7f94620b1228dab547ff253c46d0b. Reverted https://github.com/pytorch/pytorch/pull/90906 on behalf of https://github.com/malfet due to eca_halonext26ts started failing after 2nd PR from the stack was landed, see `88b3810c94`, not sure which one of the two caused it	2023-01-17 19:26:38 +00:00
PyTorch MergeBot	013afc5abe	Revert "[fx] fix type promotion in `binary_magic_impl` (#91376 )" This reverts commit 88b3810c94b45f5982df616e2bc4c471d173f491. Reverted https://github.com/pytorch/pytorch/pull/91376 on behalf of https://github.com/malfet due to eca_halonext26ts started failing after this was landed, see `88b3810c94`	2023-01-17 19:04:04 +00:00
Nikita Karetnikov	88b3810c94	[fx] fix type promotion in `binary_magic_impl` (#91376 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91376 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-01-17 10:04:38 +00:00
Nikita Karetnikov	d13207c7ad	[fx] rewrite `FloorDiv` to match Python better (#90906 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90906 Approved by: https://github.com/ezyang	2023-01-17 10:04:38 +00:00
Edward Z. Yang	bcf15cd93b	Store source, not sname, in Symbol (#91057 ) I'm going to need this in the follow up PR. Instead of storing only Source.name() in Symbol, I now store a full on Source. Lots of replumbing reoccurs. In particular: - Move Source to torch._guards to break cycles - I have to add TensorPropertySource and NegateSource to handle x.size()[0] and -x codegen that I was doing with string manipulation previously - I tighten up invariants so that I never pass source=None; instead I pass ConstantSource (these are constant sources right) and test for that rather than source being missing. I think this is more parsimonious - Some mypy wobbles from new imports I didn't move LocalSource and friends to torch._guards, but I ended up needing to access them in a few places. The main annoyance with moving these is that then I also need to move the bytecode codegen stuff, and that's not so easy to move without bringing in the kitchen sink. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91057 Approved by: https://github.com/albanD, https://github.com/voznesenskym, https://github.com/zou3519	2022-12-30 05:56:56 +00:00
PyTorch MergeBot	b68fd7e319	Revert "Store source, not sname, in Symbol (#91057 )" This reverts commit 88c581be87ac59ea1251f35a57b610ae81b9362d. Reverted https://github.com/pytorch/pytorch/pull/91057 on behalf of https://github.com/atalman due to causing internal build failures	2022-12-21 22:33:15 +00:00
Edward Z. Yang	88c581be87	Store source, not sname, in Symbol (#91057 ) I'm going to need this in the follow up PR. Instead of storing only Source.name() in Symbol, I now store a full on Source. Lots of replumbing reoccurs. In particular: - Move Source to torch._guards to break cycles - I have to add TensorPropertySource and NegateSource to handle x.size()[0] and -x codegen that I was doing with string manipulation previously - I tighten up invariants so that I never pass source=None; instead I pass ConstantSource (these are constant sources right) and test for that rather than source being missing. I think this is more parsimonious - Some mypy wobbles from new imports I didn't move LocalSource and friends to torch._guards, but I ended up needing to access them in a few places. The main annoyance with moving these is that then I also need to move the bytecode codegen stuff, and that's not so easy to move without bringing in the kitchen sink. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91057 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2022-12-21 04:51:51 +00:00
Edward Z. Yang	49c674e155	Revert guaranteed symint allocation (#90381 ) So, uh, I have a new strategy for generating dupe guards, one where I don't actually need to allocate symints for every tensor that is fakeified. So I'm reverting the changes I made from earlier PRs in this one. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90381 Approved by: https://github.com/voznesenskym	2022-12-10 13:17:34 +00:00
Edward Z. Yang	b68dead20c	Keep track of source name on all allocated SymInts (#90295 ) Wow, I had to sweat so much to get this PR out lol. This PR enforces the invariant that whenever we allocate SymInts as part of fakeification, the SymInt is associated with a Source, and in fact we store the string source name on SymbolWithSourceName. We use 'sname' as the shorthand for source name, as 'name' is already used by sympy to name symbols. In order to store source names, we have to plumb source names from Dynamo to PyTorch. This made doing this PR a bit bone crushing, because there are many points in the Dynamo codebase where we are improperly converting intermediate tensors into fake tensors, where there is no source (and there cannot be, because it's a frickin' intermediate tensor). I've fixed all of the really awful cases in earlier PRs in the stack. This PR is just plumbing in source names from places where we do have it. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90295 Approved by: https://github.com/voznesenskym	2022-12-10 13:17:34 +00:00
Edward Z. Yang	a1ab06ab65	ShapeEnv.create_symbolic_sizes_strides_storage_offset (#89962 ) Instead of having storage offset hang out on its own, allocate all of these symbols all in one go. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89962 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2022-12-06 21:27:02 +00:00
Edward Z. Yang	e03cde07e4	Guarantee symbol allocation for all sizes/strides/storage offset (#89879 ) We may need to express guards on the size/stride/storage offset of a tensor, but we cannot do this if it's already been duck sized. This PR guarantees that we allocate a symbol (or negation of the symbol) whenever we ask to create a SymInt, and propagates this symbol to SymNode so that Dynamo can look at it (not in this PR). This PR doesn't actually add guards, nor does Dynamo do anything with these symbols. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89879 Approved by: https://github.com/albanD	2022-12-01 13:43:10 +00:00
Sherlock Huang	caf3d5319f	Symintify numel(), infer_size, prims.elementwise_meta (#88956 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88956 Approved by: https://github.com/ezyang	2022-11-20 00:42:03 +00:00
Edward Z. Yang	46796fe5e9	Fix XLA symbolic shapes binding (#88928 ) Obsoletes https://github.com/pytorch/pytorch/pull/88772 Mostly revolves around NOT assuming that the inside is a SymNode, but instead duck-typed to be a SymNode. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/88928 Approved by: https://github.com/SherlockNoMad	2022-11-13 00:31:27 +00:00
Sherlock Huang	d9ad08ce8a	Symbolic shape: sym_floor , sym_sqrt, sym_int (#88760 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88760 Approved by: https://github.com/ezyang	2022-11-10 23:41:33 +00:00
albanD	fcc2883476	Clean up SymFloat binding to cover all functions (#88370 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88370 Approved by: https://github.com/ezyang	2022-11-08 14:32:47 +00:00
albanD	6abaa5946d	Fix categorization of sym_int method (#88369 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88369 Approved by: https://github.com/ezyang, https://github.com/bdhirsh, https://github.com/anjali411	2022-11-08 14:32:47 +00:00
albanD	98f09c9ab3	[WIP] Add symnode magic method testing (#88119 ) There are failures that need to be addressed before landing: - Some issue with handling of booleans. - Most functions return wrong result when mixing int/float Pull Request resolved: https://github.com/pytorch/pytorch/pull/88119 Approved by: https://github.com/ezyang	2022-11-02 19:41:09 +00:00
Edward Z. Yang	1ff52225f1	Unify SymIntNode and SymFloatNode into SymNode (#87817 ) This refactor was prompted by challenges handling mixed int/float operations in C++. A previous version of this patch added overloads for each permutation of int/float and was unwieldy https://github.com/pytorch/pytorch/pull/87722/ This PR takes a different approach. The general outline of the patch is to combine the C++ types SymIntNode and SymFloatNode into a single type, SymNode. This is type erased; we no longer know statically at C++ if we have an int/float and have to test it with the is_int()/is_float() virtual methods. This has a number of knock on effects. - We no longer have C++ classes to bind to Python. Instead, we take an entirely new approach to our Python API, where we have a SymInt/SymFloat class defined entirely in Python, which hold a SymNode (which corresponds to the C++ SymNode). However, SymNode is not pybind11-bound; instead, it lives as-is in Python, and is wrapped into C++ SymNode using PythonSymNode when it goes into C++. This implies a userland rename. In principle, it is also possible for the canonical implementation of SymNode to be written in C++, and then bound to Python with pybind11 (we have this code, although it is commented out.) However, I did not implement this as we currently have no C++ implementations of SymNode. Because we do return SymInt/SymFloat from C++ bindings, the C++ binding code needs to know how to find these classes. Currently, this is done just by manually importing torch and getting the attributes. - Because SymInt/SymFloat are easy Python wrappers, __sym_dispatch__ now takes SymInt/SymFloat, rather than SymNode, bringing it in line with how __torch_dispatch__ works. Some miscellaneous improvements: - SymInt now has a constructor that takes SymNode. Note that this constructor is ambiguous if you pass in a subclass of SymNode, so an explicit downcast is necessary. This means toSymFloat/toSymInt are no more. This is a mild optimization as it means rvalue reference works automatically. - We uniformly use the caster for c10::SymInt/SymFloat, rather than going the long way via the SymIntNode/SymFloatNode. - Removed some unnecessary toSymInt/toSymFloat calls in normalize_* functions, pretty sure this doesn't do anything. - guard_int is now a free function, since to guard on an int you cannot assume the method exists. A function can handle both int and SymInt inputs. - We clean up the magic method definition code for SymInt/SymFloat/SymNode. ONLY the user classes (SymInt/SymFloat) get magic methods; SymNode gets plain methods; this is to help avoid confusion between the two types. Signed-off-by: Edward Z. Yang <ezyang@fb.com> cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87817 Approved by: https://github.com/albanD, https://github.com/anjali411	2022-10-27 20:56:02 +00:00
Horace He	569eebb43c	Add get_guard_expr to symbolic_shapes which returns all guards in a single expression (#87665 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87665 Approved by: https://github.com/ezyang, https://github.com/voznesenskym	2022-10-25 16:58:18 +00:00

... 2 3 4 5 6

285 Commits