pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Ryan Spring	534ae6ae47	[primTorch] Implement group norm reference (#87054 ) Add group norm reference Split from #81191 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87054 Approved by: https://github.com/mruberry	2022-11-11 01:08:20 +00:00
kshitij12345	fe3a226d74	[minor] use set_default_dtype instead of try and finally (#88295 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88295 Approved by: https://github.com/mruberry	2022-11-03 19:28:33 +00:00
soulitzer	4c20c0509d	Split out forward AD tests from test_ops_gradients and reenable slow gradcheck CI (#88216 ) Fixes: https://github.com/pytorch/pytorch/issues/88010 This PR does a couple things to stop slow gradcheck from timing out: - Splits out test_ops_fwd_gradients from test_ops_gradients, and factors out TestFwdGradients and TestBwdGradients which both inherit from TestGradients, now situated in common_utils (maybe there is a better place?) - Skips CompositeCompliance (and several other test files) for slow gradcheck CI since they do not use gradcheck - because test times for test_ops_fwd_gradients and test_ops_gradients are either unknown or wrong, we hardcode them for now to prevent them from being put together. We can undo the hack after we see actual test times are updated. ("def calculate_shards" randomly divides tests with unknown test times in a round-robin fashion.) - Updates references to test_ops_gradients and TestGradients - Test files that are skipped for slow gradcheck CI are now centrally located in in run_tests.py, this reduces how fine-grained we can be with the skips, so for some skips (one so far) we still use the old skipping mechanism, e.g. for test_mps Pull Request resolved: https://github.com/pytorch/pytorch/pull/88216 Approved by: https://github.com/albanD	2022-11-03 00:20:45 +00:00
Sean Ross-Ross	1a9edc8136	Changing from sample_inputs to reference_inputs in test_compare_cpu (#86462 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86462 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-31 20:06:03 +00:00
lezcano	fd27246c16	Fix decomposition for std (#87181 ) The previous implementation was lacking a few features and incurred on a pretty large error cc @ezyang @mruberry @ngimel @Lezcano @fdrocha Pull Request resolved: https://github.com/pytorch/pytorch/pull/87181 Approved by: https://github.com/ngimel, https://github.com/peterbell10	2022-10-28 00:50:29 +00:00
Natalia Gimelshein	f1b78224ca	Fix type promotion for 2 wrapped scalar args (#87845 ) Fixes #76801 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87845 Approved by: https://github.com/SherlockNoMad, https://github.com/mruberry	2022-10-27 15:53:11 +00:00
Nikita Karetnikov	59b9d29260	[primTorch] Check `error_regex` in `test_python_ref_errors` (#86987 ) cc @ezyang @mruberry @ngimel @Lezcano @fdrocha Pull Request resolved: https://github.com/pytorch/pytorch/pull/86987 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-26 23:34:34 +00:00
Bin Bao	2c1efe7472	Enable some PyTorch core tests with inductor (#87490 ) Summary: 1) Graph break on torch.random.set_rng_state since it blocks running inductor core tests; 2) Add several inductor-specific skips; 3) Enable several core tests for inductor CI; cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87490 Approved by: https://github.com/eellison	2022-10-26 18:58:33 +00:00
Sherlock Huang	eb99c1efce	Prefer python meta function over c++ meta function (#87426 ) This is a policy update for meta registration. We now prefer python meta implementation over C++ meta function. This is a flip of the previous policy, where we prefer C++ meta function over python meta function if they both exist. Here's the meta registration process: 1. register_meta and register_decomposition will place the python meta/decomp functions into the `global_decomp_table`. However, they will NOT register them into dispatcher. 2. After global_decomp_table is populated, we will compile an `active_meta_table`. For a given op, we pick the most specific decomp function from `global_decomp_table` in the preference order of Meta > PostAutograd > PreAutograd. 3. We will unconditionally register all of them into python dispatcher. And register them into C++ dispatcher, unless it one of the following 3 cases - 1. the op is a CompositeImplicitAutograd, and should rely on decomposed op's meta - 2. the op is a view op, as the MetaTensor doesn't support aliased storage - 3. the op is in the blocklist (due to UT failures, and we will burn down this list op by op) Over the long run, we wish to implement all meta functions in python. With this PR, 321 op_overloads will have cpp meta overridden by python meta. There are still 400 op_overloads is using cpp meta. The exact list can be found here https://gist.github.com/SherlockNoMad/d20bb736178df8eebd3b054c8bb7cdc5 cc @ngimel @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang Pull Request resolved: https://github.com/pytorch/pytorch/pull/87426 Approved by: https://github.com/ezyang, https://github.com/jansel	2022-10-25 16:49:02 +00:00
Nikita Karetnikov	1b8af28fe8	[primTorch] Add refs for `softmax`, `softmin`, `log_softmax` (#84956 ) cc @ezyang @mruberry @ngimel @Lezcano @fdrocha Pull Request resolved: https://github.com/pytorch/pytorch/pull/84956 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-20 12:29:04 +00:00
PyTorch MergeBot	cd21613526	Revert "[primTorch] Add refs for `softmax`, `softmin`, `log_softmax` (#84956 )" This reverts commit c09ca93e4733fdf0183433114dda2fc30a846700. Reverted https://github.com/pytorch/pytorch/pull/84956 on behalf of https://github.com/ZainRizvi due to This is causing the MPS test test_output_match_log_softmax_with_dtype_cpu_float32 (__main__.TestConsistencyCPU) to fail	2022-10-19 20:36:55 +00:00
Nikita Karetnikov	c09ca93e47	[primTorch] Add refs for `softmax`, `softmin`, `log_softmax` (#84956 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84956 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-19 18:45:40 +00:00
Nikita Karetnikov	b886cd15f5	[primTorch] Add a ref for NumPy-style `T` (#86850 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86850 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-18 10:19:47 +00:00
Nikita Karetnikov	841995d53b	[primTorch] Add refs for data conversion ops (#86561 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86561 Approved by: https://github.com/lezcano, https://github.com/mruberry, https://github.com/zou3519	2022-10-18 08:38:51 +00:00
Sean Ross-Ross	1bb609ad47	Added new test test_compare_cpu that checks if cpu and gpu results are consistent (#85011 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85011 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-14 20:15:16 +00:00
Ivan Yashchuk	fd80684784	Add nvFuser support for torch.Tensor.view (#84634 ) This is an alternative to https://github.com/pytorch/pytorch/pull/83739. While PrimTorch has `view` as a reference, we would like to use nvFuser's implementation for `view` for now. Later we might transition to PrimTorch's `torch._refs.view`. See `test_nvprims_view` for examples of things that are now sent to nvFuser. Note that nvFuser's `view` is a copy-like operation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84634 Approved by: https://github.com/kevinstephano, https://github.com/mruberry	2022-10-14 12:08:02 +00:00
Brian Hirsh	0feccda7d7	fix aliasing bug in pixel shuffle/unshuffle (#86608 ) Fixes https://github.com/pytorch/pytorch/issues/82235 cc @albanD - `at::pixel_shuffle` and `at::pixel_unshuffle` advertise as being non-aliasing, but they have a C++ decomposition that internally uses reshape(), which means that it might return an alias. I happened to notice this because a bunch of tests in `test/test_ops.py` failed when I ran locally with a `DEBUG=1` build. (P.S.: when are we finally gonna get a debug build test in CI? 😃) I fixed by adding an extra clone, which... is going to be an unnecessary perf hit in the case where the `reshape()` already properly cloned the input. My hope is that this is fine, because this only impacts the composite kernel- we already have a "fast" CPU kernel that does the right thing. Is `pixel_shuffle/unshuffle` commonly used with cuda? Maybe we should just add a fast cuda kernel for it if that's the case. Alternatively, it seems like it would be nice if `reshape()` accepted an optional argument to unconditionally return a copy. That seems like a rabbit hole that isn't worth going down for now though - I remember a discussion a while ago about making `reshape()` copy-on-write Pull Request resolved: https://github.com/pytorch/pytorch/pull/86608 Approved by: https://github.com/albanD	2022-10-13 14:14:26 +00:00
Peter Bell	73c43ce2e2	Display unexpected exceptions raised from test_dtypes (#86599 ) Currently `test_dtypes` swallows all exceptions which can make debugging failures more tricky. This changes the test to save the exceptions and print only the unexpected ones at the end e.g. ``` AssertionError: The supported dtypes for nn.functional._scaled_dot_product_attention on device type cuda are incorrect! The following dtypes did not work in backward but are listed by the OpInfo: {torch.bfloat16}. Unexpected failures raised the following errors: torch.bfloat16 - CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling [...] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/86599 Approved by: https://github.com/mruberry	2022-10-12 19:51:58 +00:00
Nikita Karetnikov	d56017a14f	[primTorch] Add ref for `triplet_margin_loss`, improve `triplet_margin_with_distance_loss` (#85614 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85614 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-12 18:37:58 +00:00
Khushi	2344135179	[primTorch] special: entr, expit (#86592 ) Add _refs for `entr` & `expit`. cc @mruberry @kshitij12345! Pull Request resolved: https://github.com/pytorch/pytorch/pull/86592 Approved by: https://github.com/mruberry	2022-10-12 07:00:40 +00:00
Elias Ellison	b409d1f65b	Turn on Data Dependent Throwing (#86480 ) This was already enabled in TorchDynamo, but was staged to make sure things don't break. Also makes backward single threaded for tests to fix a memory leak. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86480 Approved by: https://github.com/bdhirsh	2022-10-10 21:58:29 +00:00
Elias Ellison	d3f7c34cb3	Enable aten-aten decomps (#85921 ) Invokes aten-aten decomps with re-entrant FakeMode. These decomps are being used in other places, so it's good to unify the path static fake tensor takes / get additional testing etc. There is also an instance where we return different devices with cpu/cuda which this fixes ([batch_norm](https://github.com/pytorch/pytorch/blob/master/torch/_decomp/decompositions.py#L1374)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85921 Approved by: https://github.com/ezyang	2022-10-08 05:12:42 +00:00
PyTorch MergeBot	7ec12a559c	Revert "Enable aten-aten decomps (#85921 )" This reverts commit 62e4f51efdf98a3a91d29efa55e5665d5398b464. Reverted https://github.com/pytorch/pytorch/pull/85921 on behalf of https://github.com/huydhn due to Sorry for reverting your PR. I think it breaks a dynamo test in trunk `62e4f51efd`	2022-10-08 01:59:54 +00:00
Elias Ellison	62e4f51efd	Enable aten-aten decomps (#85921 ) Invokes aten-aten decomps with re-entrant FakeMode. These decomps are being used in other places, so it's good to unify the path static fake tensor takes / get additional testing etc. There is also an instance where we return different devices with cpu/cuda which this fixes ([batch_norm](https://github.com/pytorch/pytorch/blob/master/torch/_decomp/decompositions.py#L1374)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85921 Approved by: https://github.com/ezyang	2022-10-07 21:04:39 +00:00
Elias Ellison	9ceadcadb2	Fix unfold backward decomp aliasing for 0 dim input (#86428 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86428 Approved by: https://github.com/ngimel, https://github.com/ezyang	2022-10-07 03:55:31 +00:00
lezcano	c609768896	Add refs for torch.unfold and a decomposition for its backward. (#85629 ) It's not clear to me what's the difference between `unfold` and `unfold_copy`, as this latter one is codegen'd I also took this chance to clean the implementation of unfold and its reference Pull Request resolved: https://github.com/pytorch/pytorch/pull/85629 Approved by: https://github.com/mruberry	2022-10-05 12:15:49 +00:00
Elias Ellison	6a2b12dd65	Turn on aliasing tests for fake backwards, Fix Batch norm running mean/var decomp aliasing (#85471 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85471 Approved by: https://github.com/ezyang	2022-09-28 23:06:59 +00:00
Elias Ellison	0b93afb112	add amp tests (#85434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85434 Approved by: https://github.com/ngimel	2022-09-28 19:34:46 +00:00
samdow	18d8c548f4	[Modes] remove enable and rewrite mode stack (squashed) (#84774 ) Based on @ezyang's suggestion, mode stack now has "one true mode" which is the _only_ mode that can ever be active at the C++ level. That mode's torch dispatch is just to take the top mode in the stack, reenable itself (if we aren't at the end of the mode stack), and run the top mode's torch_{dispatch\|function} This maintains that in the middle of a mode's torch dispatch, the mode itself will not be active. It changes the function the user has to call to see what the current mode is (no longer queries the C++, it's python only) but allows the user to also see the entire mode stack easily Removes `enable_torch_dispatch_mode` and `.restore()` since neither makes sense in this new setup ### Background Why do we want this? Well, a pretty common pattern that was coming up was that users had to do something like ```python ## PRE-PR UX def f(mode): with mode.restore(): # user needs to understand this restore thing? ... with Mode() as m: pass f(m) ``` Many users were getting error from forgetting to call `.restore` or from forgetting to add the (tbh weird) "mode instantiation" step where they use the mode as a context manager with an empty body. Really, they wanted to treat modes like context managers and just write ```python ## FROM FEEDBACK, USER DESIRED CODE. POSSIBLE POST-PR def f(mode): with mode: ... f(Mode()) ``` Technical Details With the old mode stack, we basically had a linked list so the mode itself could only be used once and had a fixed parent. In this new design, the mode stack is just a python list that we're pushing to and popping from. There's only one mode that's ever active at the C++ level and it runs the next mode in the Python list. The modes don't have state on them anymore Pull Request resolved: https://github.com/pytorch/pytorch/pull/84774 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-09-27 01:04:35 +00:00
Elias Ellison	bcc544e9d7	Add FakeCrossRef tests for backwards, Fix Layer Norm Backward Decomp (#85417 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85417 Approved by: https://github.com/ezyang	2022-09-26 17:08:14 +00:00
PyTorch MergeBot	d10de31cc8	Revert "Add FakeCrossRef tests for backwards, Fix Layer Norm Backward Decomp (#85417 )" This reverts commit 78afa0cf0ca04ce437ca4b519f07c04e73fe0d4c. Reverted https://github.com/pytorch/pytorch/pull/85417 on behalf of https://github.com/clee2000 due to broke tests on trunk `78afa0cf0c`	2022-09-23 17:21:43 +00:00
PyTorch MergeBot	eb570ab7d0	Revert "add amp tests (#85434 )" This reverts commit c2f4bbe66918d167401ff5804c6b2d24fc6bda40. Reverted https://github.com/pytorch/pytorch/pull/85434 on behalf of https://github.com/clee2000 due to broke rocm and slow tests on trunk `c2f4bbe669`	2022-09-23 17:19:06 +00:00
PyTorch MergeBot	3b195fd33e	Revert "Turn on aliasing tests for fake backwards, Fix Batch norm running mean/var decomp aliasing (#85471 )" This reverts commit 1e92eb806865602be6d9c02a311108c4f88869b2. Reverted https://github.com/pytorch/pytorch/pull/85471 on behalf of https://github.com/clee2000 due to stacked prs https://github.com/pytorch/pytorch/pull/85417 and https://github.com/pytorch/pytorch/pull/85434 broke trunk, reverting this so i can revert the others	2022-09-23 17:13:35 +00:00
Elias Ellison	1e92eb8068	Turn on aliasing tests for fake backwards, Fix Batch norm running mean/var decomp aliasing (#85471 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85471 Approved by: https://github.com/ezyang	2022-09-23 16:02:15 +00:00
Elias Ellison	c2f4bbe669	add amp tests (#85434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85434 Approved by: https://github.com/ngimel	2022-09-23 15:57:37 +00:00
Elias Ellison	78afa0cf0c	Add FakeCrossRef tests for backwards, Fix Layer Norm Backward Decomp (#85417 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85417 Approved by: https://github.com/ezyang	2022-09-23 15:50:03 +00:00
PyTorch MergeBot	5043457a8e	Revert "Add FakeCrossRef tests for backwards, Fix Layer Norm Backward Decomp (#85417 )" This reverts commit 9c77083965e1283763a83f72a3adf299281761e3. Reverted https://github.com/pytorch/pytorch/pull/85417 on behalf of https://github.com/clee2000 due to broke tests on trunk (and pull somehow) `9c77083965`	2022-09-22 15:44:38 +00:00
Elias Ellison	9c77083965	Add FakeCrossRef tests for backwards, Fix Layer Norm Backward Decomp (#85417 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85417 Approved by: https://github.com/ezyang	2022-09-22 13:03:57 +00:00
Thomas Viehmann	764cba6848	add Python ref for isreal (#85361 ) Dipping my toes into prims waters Pull Request resolved: https://github.com/pytorch/pytorch/pull/85361 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-09-21 18:53:34 +00:00
Ivan Yashchuk	35943f30cb	Reference implementation for torch.Tensor.sum_to_size (#85338 ) New ref: `torch._refs.sum_to_size`. View consistency validation is disabled because the ref returns a view instead of returning the input. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85338 Approved by: https://github.com/mruberry	2022-09-21 18:12:52 +00:00
Horace He	2f4a517d67	Ported matmul compositeimplicitautograd impl into core (#85239 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85239 Approved by: https://github.com/ezyang, https://github.com/lezcano	2022-09-21 09:25:24 +00:00
Elias Ellison	a3afb2c2f6	Fake: fix conv_transpose2d striding (#82846 ) The output striding channels-last preservation logic differs between cuda and cpu. For the meta kernel, we can peek at the fake tensor device and use that to determine whether to do cpu or cuda. You could argue there's a leaking of abstraction here but this seems like a pretty minimal leak and I'm not sure there's a much cleaner way forward for device-specific striding tracing logic. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82846 Approved by: https://github.com/ezyang	2022-09-20 18:00:59 +00:00
lezcano	5dd9610e9d	Refs and decompositions for index_{add,copy,select,fill} (#85002 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/85002 Approved by: https://github.com/ngimel	2022-09-17 19:57:34 +00:00
PyTorch MergeBot	e33b464ffc	Revert "Refs and decompositions for index_{add,copy,select,fill} (#85002 )" This reverts commit 2f0b3de443dd8d4477d70c5a56fa14496d1eebe3. Reverted https://github.com/pytorch/pytorch/pull/85002 on behalf of https://github.com/huydhn due to Broke trunk slow tests	2022-09-17 04:26:04 +00:00
lezcano	2f0b3de443	Refs and decompositions for index_{add,copy,select,fill} (#85002 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/85002 Approved by: https://github.com/ngimel	2022-09-16 23:59:35 +00:00
Horace He	4bdc0af53d	Added support for symbolic is_contiguous (#84829 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84829 Approved by: https://github.com/ezyang	2022-09-16 04:54:01 +00:00
Sherlock Huang	17925122d0	Rewrite new_zeros, new_ones, new_full decomp with aten.full (#84946 ) We should NOT introducing non-functional op for decomps of functional op. For example ``` make_fx(functionalize(lambda x: x.new_zeros(3)), decomposition_table=decomposition_table)(x) ``` is producing ``` def forward(self, x_1): empty = torch.ops.aten.empty.memory_format([3, 4], dtype = torch.float32, layout = torch.strided, device = device(type='cpu'), pin_memory = False) zero_ = torch.ops.aten.zero_.default(empty); empty = None return zero_ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/84946 Approved by: https://github.com/ngimel	2022-09-15 05:45:40 +00:00
Ivan Yashchuk	6750946b82	Skip validate_view_consistency for nvFuser tests (#84858 ) nvFuser's execute function always returns a copy for now. Ref. https://github.com/pytorch/pytorch/pull/84629#discussion_r966375582 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84858 Approved by: https://github.com/mruberry, https://github.com/ngimel	2022-09-14 12:03:11 +00:00
Ryan Spring	d09e8b23bf	[primTorch] Add repeat and unfold_copy references (#81374 ) Add References: - repeat - unfold - expand_as Pull Request resolved: https://github.com/pytorch/pytorch/pull/81374 Approved by: https://github.com/mruberry, https://github.com/ngimel	2022-09-12 22:19:06 +00:00
kshitij12345	4f6027b78a	[opinfo] narrow: add new sample for Tensor overload (#84785 ) `narrow` accepts `start` argument to be a Tensor. We add a sample to test this overload. NOTE: This leads to a bunch of failed tests and hence the skips and xfails Pull Request resolved: https://github.com/pytorch/pytorch/pull/84785 Approved by: https://github.com/zou3519	2022-09-12 16:59:08 +00:00

... 2 3 4 5 6 ...

443 Commits