pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Aaron Gokaslan	5a1216bb2e	[BE]: Update ruff to 0.4.1 (#124549 ) Update ruff to 0.4.1 . This version fixes a lot false negatives/false positives, is 20-40% faster, and has various other bug fixes. Below is a before and after table showing the execution time of ruff lint and ruff format in milliseconds courtesy of https://astral.sh/blog/ruff-v0.4.0 \| Repository \| Linter (v0.3) \| Linter (v0.4) \| Formatter (v0.3) \| Formatter (v0.4) \| \|----------------------------------------------------\|---------------\|---------------\|------------------\|------------------\| \| [pytorch/pytorch](https://github.com/pytorch/pytorch) \| 328.7 \| 251.8 \| 351.1 \| 274.9 \| Pull Request resolved: https://github.com/pytorch/pytorch/pull/124549 Approved by: https://github.com/ezyang	2024-04-21 14:06:23 +00:00
Edward Z. Yang	8aad72b0d3	Support all unsigned int sizes on unique (#123643 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/123643 Approved by: https://github.com/albanD, https://github.com/kit1980	2024-04-11 06:50:12 +00:00
Aaron Gokaslan	1562dae62c	[BE]: Apply RUF025 dict.fromkeys preview rule (#118637 ) Simplifies and optimizes dict construction using the `fromkeys` classmethod ctor. This also makes it really obvious when all the keys will have the same static value, which could be a bug if unintentional. It is also significantly faster than using a dict comprehension. The rule is in preview, but I am adding a forward fix for when it becomes stable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118637 Approved by: https://github.com/albanD	2024-01-30 20:46:54 +00:00
Aaron Shi	a357a0f315	Back out "[Kineto] Initialize libkineto profilers during torch init process during pybind set-up (#112623 )" (#116201 ) Summary: This diff needs to be backed out because TorchBench llama_v2_7b_16h has a cublas init error. https://github.com/pytorch/benchmark/actions/runs/7266269668/job/19797677485?pr=2095 Test Plan: CI Differential Revision: D52339142 Pull Request resolved: https://github.com/pytorch/pytorch/pull/116201 Approved by: https://github.com/xuzhao9	2023-12-21 16:32:19 +00:00
Aaron Gokaslan	6de28e92d2	[BE]: Apply FURB118 (prev): replaces unnecessary lambdas with operator. (#116027 ) This replaces a bunch of unnecessary lambdas with the operator package. This is semantically equivalent, but the operator package is faster, and arguably more readable. When the FURB rules are taken out of preview, I will enable it as a ruff check. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116027 Approved by: https://github.com/malfet	2023-12-20 19:35:08 +00:00
Joel Schlosser	afdc528520	Print the index and summary of the SampleInput that failed an OpInfo test (#99444 ) Related to the Reproducible Testing BE project. Goal is to print out the sample input that failed an OpInfo test. Crazy idea: to avoid requiring widespread changes across tests that use OpInfo sample inputs, return a new special iterator type from `OpInfo.sample_inputs()`, etc. that tracks the most recent item seen. If a test fails later on, print out this info to identify the sample that failed the test. This solves the problem that the test framework currently has no concept of which sample input is being operated on. This PR contains the following changes: * New `TrackedInputIter` that wraps a sample inputs func iterator and tracks the most recent input seen in a `TrackedInput` structure * The information is stored in a dictionary on the test function itself, mapping `full test ID -> most recent TrackedInput` * To determine the test function that is being run, we do some stack crawling hackery in `extract_test_fn_and_id()` * Above applies only when one of the following is called: `OpInfo.sample_inputs()`, `OpInfo.error_inputs()`, `OpInfo.reference_inputs()`, and `OpInfo.conjugate_sample_inputs()`. This could easily be extended to `ModuleInfo`s and the sparse sample input funcs as well Example output when a sample input causes a failure: ``` ====================================================================== ERROR: test_foo_add_cpu_uint8 (__main__.TestFakeTensorCPU) ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 911, in test_wrapper return test(args, kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 1097, in only_fn return fn(slf, args, *kwargs) File "/home/jbschlosser/branches/reproducible_testing/test/test_ops.py", line 2211, in test_foo self.fail('Example failure') AssertionError: Example failure The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_utils.py", line 2436, in wrapper method(args, kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 414, in instantiated_test result = test(self, param_kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 917, in test_wrapper raise Exception( Exception: Caused by sample input at index 2: SampleInput(input=Tensor[size=(5, 1), device="cpu", dtype=torch.uint8], args=TensorList[Tensor[size=(5,), device="cpu", dtype=torch.uint8]], kwargs={}, broadcasts_input=True, name='') To execute this test, run the following from the base repo dir: python test/test_ops.py -k test_foo_add_cpu_uint8 This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------- ``` This notably doesn't print the actual `SampleInput` values, as that's hard without fully reproducible random sample generation. I went down this path for a while and it seems infeasible without adding an untenable amount of overhead to set the random seed per SampleInput (see https://github.com/pytorch/pytorch/issues/86694#issuecomment-1614943708 for more details). For now, I am settling for at least spitting out the index and some metadata of the `SampleInput`, as it seems better than nothing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99444 Approved by: https://github.com/janeyx99	2023-11-21 23:08:35 +00:00
PyTorch MergeBot	5f0d72124e	Revert "Print the index and summary of the SampleInput that failed an OpInfo test (#99444 )" This reverts commit e7f12b1eb0cedfd20dcb41ea35e21e9a71e3390a. Reverted https://github.com/pytorch/pytorch/pull/99444 on behalf of https://github.com/huydhn due to Sorry for reverting your change but it seems to cause memory leak on CUDA job `e7f12b1eb0` ([comment](https://github.com/pytorch/pytorch/pull/99444#issuecomment-1820491298))	2023-11-21 08:58:54 +00:00
Joel Schlosser	e7f12b1eb0	Print the index and summary of the SampleInput that failed an OpInfo test (#99444 ) Related to the Reproducible Testing BE project. Goal is to print out the sample input that failed an OpInfo test. Crazy idea: to avoid requiring widespread changes across tests that use OpInfo sample inputs, return a new special iterator type from `OpInfo.sample_inputs()`, etc. that tracks the most recent item seen. If a test fails later on, print out this info to identify the sample that failed the test. This solves the problem that the test framework currently has no concept of which sample input is being operated on. This PR contains the following changes: * New `TrackedInputIter` that wraps a sample inputs func iterator and tracks the most recent input seen in a `TrackedInput` structure * The information is stored in a dictionary on the test function itself, mapping `full test ID -> most recent TrackedInput` * To determine the test function that is being run, we do some stack crawling hackery in `extract_test_fn_and_id()` * Above applies only when one of the following is called: `OpInfo.sample_inputs()`, `OpInfo.error_inputs()`, `OpInfo.reference_inputs()`, and `OpInfo.conjugate_sample_inputs()`. This could easily be extended to `ModuleInfo`s and the sparse sample input funcs as well Example output when a sample input causes a failure: ``` ====================================================================== ERROR: test_foo_add_cpu_uint8 (__main__.TestFakeTensorCPU) ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 911, in test_wrapper return test(args, kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 1097, in only_fn return fn(slf, args, *kwargs) File "/home/jbschlosser/branches/reproducible_testing/test/test_ops.py", line 2211, in test_foo self.fail('Example failure') AssertionError: Example failure The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_utils.py", line 2436, in wrapper method(args, kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 414, in instantiated_test result = test(self, param_kwargs) File "/home/jbschlosser/branches/reproducible_testing/torch/testing/_internal/common_device_type.py", line 917, in test_wrapper raise Exception( Exception: Caused by sample input at index 2: SampleInput(input=Tensor[size=(5, 1), device="cpu", dtype=torch.uint8], args=TensorList[Tensor[size=(5,), device="cpu", dtype=torch.uint8]], kwargs={}, broadcasts_input=True, name='') To execute this test, run the following from the base repo dir: python test/test_ops.py -k test_foo_add_cpu_uint8 This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------- ``` This notably doesn't print the actual `SampleInput` values, as that's hard without fully reproducible random sample generation. I went down this path for a while and it seems infeasible without adding an untenable amount of overhead to set the random seed per SampleInput (see https://github.com/pytorch/pytorch/issues/86694#issuecomment-1614943708 for more details). For now, I am settling for at least spitting out the index and some metadata of the `SampleInput`, as it seems better than nothing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99444 Approved by: https://github.com/janeyx99	2023-11-21 00:11:20 +00:00
Philip Meier	769f924bc6	robustify parametrize default name (#113856 ) #113340 was reverted initially due to a bad default parametrization name. The test looked like ```python @common_utils.parametrize( "type_fn", [ type, lambda obj: obj.__class__, ], ) def test_access_class_method_from_user_class(self, type_fn): ``` This is a valid parametrization, but results in these default test names: ```bash ❯ pytest test/dynamo/test_export.py -k test_access_class_method_from_user_class --co -q test/dynamo/test_export.py::ExportTests::test_access_class_method_from_user_class_type_fn_<class 'type'> test/dynamo/test_export.py::ExportTests::test_access_class_method_from_user_class_type_fn_<function ExportTests_<lambda> at 0x7f3be5de0c10> ``` Ignoring the whitespace in the test names, which can lead to other issues down the line, the problem in #113340 was that the lambda parameter included a memory address. IIUC, internally, the tests are not collected and run in the same process. Meaning, the address of the lambda and in turn the test name is no longer valid on the runner. This is fixed earlier in the stack by giving the parametrization an explicit name with `subtest`, but this PR is about preventing issues in the default case. `pytest` solves this by simply using the name of the parameter plus its index as id in the test name: ```python import pytest class Foo: def __repr__(self): return str(id(self)) @pytest.mark.parametrize( "bar", [ pytest.param(type), pytest.param(lambda obj: obj.__class__), pytest.param(Foo()), ], ) def test_foo(bar): pass ``` ``` ❯ pytest main.py --co -q main.py::test_foo[type] main.py::test_foo[<lambda>] main.py::test_foo[bar2] ``` `pytest` has better defaults for `type` and `lambda` than we do, but is has a safe default for custom objects. This PR aligns our default test name with `pytest`. Using the parametrization from above again, we now collect ```bash ❯ pytest test/dynamo/test_export.py -k test_access_class_method_from_user_class --co -q test/dynamo/test_export.py::ExportTests::test_access_class_method_from_user_class_type_fn0 test/dynamo/test_export.py::ExportTests::test_access_class_method_from_user_class_type_fn1 ``` which might not be as expressive at first glance, but at least prevents bugs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113856 Approved by: https://github.com/malfet, https://github.com/huydhn ghstack dependencies: #113855	2023-11-16 23:25:04 +00:00
Aaron Enye Shi	12b2dd16b0	[Kineto] Initialize libkineto profilers during torch init process during pybind set-up (#112623 ) Summary: We are planning to lazily initialize CUPTI when profiling is actually performed. Therefore, we need to remove profiler init dependency on CUPTI Callbacks' RESOURCE_CONTEXT_CREATED. Instead, we can initialize the profilers during torch profiler pybind, ie. THPAutograd_initExtension() and lazily in profilerStep(). Test Plan: CI and ran internally, see internal diff logs. Pulled By: aaronenyeshi Pull Request resolved: https://github.com/pytorch/pytorch/pull/112623 Approved by: https://github.com/albanD	2023-11-15 20:26:13 +00:00
PyTorch MergeBot	5465f2bb6c	Revert "Improves comparison of state dicts for Checkpoint E2E Tests (#113181 )" This reverts commit 8f5fead86ea9a9eac85d20c6aee780e06ce04eb7. Reverted https://github.com/pytorch/pytorch/pull/113181 on behalf of https://github.com/huydhn due to Sorry for reverting your change but it is failing distribute test in trunk `8f5fead86e` with a not defined DTensor error ([comment](https://github.com/pytorch/pytorch/pull/113181#issuecomment-1810925052))	2023-11-14 18:42:40 +00:00
Lucas Pasqualin	8f5fead86e	Improves comparison of state dicts for Checkpoint E2E Tests (#113181 ) Addresses the following comment - https://github.com/pytorch/pytorch/pull/112541#discussion_r1380197424 Changes the comparison of models in the checkpointing E2E test to compare a non-parallelized model against distribued model after training, saving, & loading. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113181 Approved by: https://github.com/fegin	2023-11-14 14:54:40 +00:00
PyTorch MergeBot	4916a7e94f	Revert "[Kineto] Initialize libkineto profilers during torch init process during pybind set-up (#112623 )" This reverts commit a62a88bb84f633581242bd0107e01d2a075884a3. Reverted https://github.com/pytorch/pytorch/pull/112623 on behalf of https://github.com/huydhn due to This break TestCuda::test_lazy_init on ROCm ([comment](https://github.com/pytorch/pytorch/pull/112623#issuecomment-1806597750))	2023-11-11 00:35:56 +00:00
Aaron Enye Shi	a62a88bb84	[Kineto] Initialize libkineto profilers during torch init process during pybind set-up (#112623 ) Summary: We are planning to lazily initialize CUPTI when profiling is actually performed. Therefore, we need to remove profiler init dependency on CUPTI Callbacks' RESOURCE_CONTEXT_CREATED. Instead, we can initialize the profilers during torch profiler pybind, ie. THPAutograd_initExtension() and lazily in profilerStep(). Test Plan: CI and ran internally, see internal diff logs. Differential Revision: D50894961 Pulled By: aaronenyeshi Pull Request resolved: https://github.com/pytorch/pytorch/pull/112623 Approved by: https://github.com/albanD	2023-11-10 20:50:54 +00:00
Nikita Shulga	328a4c5475	[BE] Enhance `OpInfo.supported_dtype` (#111995 ) Current implementation is prone to errors, as it accepts any object, but does not print an error or something if device_type is not recognized. Remediate it by accepting both device-type and device identifies (either `torch.device` instance or "{device_type}:{ordinal}" string Fixes https://github.com/pytorch/pytorch/issues/111179 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111995 Approved by: https://github.com/albanD	2023-10-27 19:42:01 +00:00
lezcano	acd02a60d5	Add a test making sure we are not importing SymPy when importing torch (#112038 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/112038 Approved by: https://github.com/malfet, https://github.com/peterbell10 ghstack dependencies: #112035, #112036, #112037	2023-10-26 23:32:27 +00:00
Joel Schlosser	42e4c648a2	New @decorateIf decorator for param-specific conditional decoration (#112033 ) Adds a new decorator `@decorateIf(decorator, predicate_fn)`. Examples: ```python from torch.testing._internal.common_utils import decorateIf ... @decorateIf(unittest.skip, lambda params: params["x"] == 2) @parametrize("x", range(5)) def test_foo(self, x): ... @parametrize("x,y", [(1, 'foo'), (2, 'bar'), (3, 'baz')]) @decorateIf( unittest.expectedFailure, lambda params: params["x"] == 3 and params["y"] == "baz" ) def test_bar(self, x, y): ... @decorateIf( unittest.expectedFailure, lambda params: params["op"].name == "add" and params["dtype"] == torch.float16 ) @ops(op_db) def test_op_foo(self, device, dtype, op): ... @decorateIf( unittest.skip, lambda params: params["module_info"].module_cls is torch.nn.Linear and \ params["device"] == "cpu" ) @modules(module_db) def test_module_foo(self, device, dtype, module_info): ... ``` Follow-up for per-param decoration based on https://github.com/pytorch/pytorch/issues/79161#issuecomment-1152487359 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112033 Approved by: https://github.com/clee2000, https://github.com/pmeier	2023-10-26 14:39:59 +00:00
Ying Zhang	a14761b68a	[Inductor CUTLASS backend] Step 1: Inductor config for cuda / cutlass, util functions. (#107802 ) This is the step 1 to add cutlass as an alternative inductor backend. Feature request: https://github.com/pytorch/pytorch/issues/106991. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107802 Approved by: https://github.com/jansel, https://github.com/aakhundov, https://github.com/kadeng	2023-09-12 17:44:32 +00:00
Justin Chu	73e1455327	[BE] Enable ruff's UP rules and autoformat test/ (#105434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105434 Approved by: https://github.com/albanD	2023-07-19 20:36:06 +00:00
Nikita Shulga	fea683491e	Make `torch._dynamo` lazy-importable (#104368 ) Use [PEP-562](https://peps.python.org/pep-0562) to import `_dynamo` and `_inductor` only when needed. - Remove redundant imports from tests - Add `test_lazy_imports_are_lazy` to make sure they will not get imported by accident <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at bae8e90</samp> > _Sing, O Muse, of the daring deeds of PyTorch, the swift and fiery_ > _framework of deep learning, that with skill and cunning wrought_ > _many wonders of dynamic compilation, using the hidden powers_ > _of `_dynamo` and `_inductor`, the secret modules of LLVM and MLIR._ Pull Request resolved: https://github.com/pytorch/pytorch/pull/104368 Approved by: https://github.com/msaroufim, https://github.com/albanD	2023-06-29 00:51:59 +00:00
Peter Bell	ed2eb13d76	[inductor] Create triton_helpers module for helper functions (#99880 ) This changes codegen of `torch.prod` from: ```python tl.reduce(tmp2, 1, _prod_accumulate)[:, None] ``` where `_prod_accumulate` is defined elsewhere, to ```python triton_helpers.prod(tmp2, 1)[:, None] ``` A quirk I uncovered though is that `TritonCodeCache` breaks if you define any new symbol beginning with `triton_`, since it assumes that must be the kernel name. Instead, I've made the kernel name an explicit argument to `async_compile.triton` so it doesn't have to guess. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99880 Approved by: https://github.com/ngimel	2023-04-27 15:10:50 +00:00
Jagadish Krishnamoorthy	ecf08a0f8b	[ROCm] Enable test_filtering_env_var (#84100 ) The test "test_filtering_env_var" requires CPU device_type along with GPU. Hence enable both device_types for ROCm, since the PYTORCH_TESTING_DEVICE_ONLY_FOR env var will have the same effect as the code being removed, making the latter redundant anyway: `9e81c0c3f4/.jenkins/pytorch/test.sh (L54-L59)` `9e81c0c3f4/torch/testing/_internal/common_device_type.py (L626-L634)` Signed-off-by: Jagadish Krishnamoorthy <jagdish.krishna@gmail.com> Enables the test disabled by #56178 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84100 Approved by: https://github.com/jeffdaily, https://github.com/malfet	2023-04-04 21:49:53 +00:00
Philip Meier	129e03905d	disallow invalid value ranges in torch.testing.make_tensor (#96334 ) Fixes #96179. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96334 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
Philip Meier	47bfb192a7	deprecate low==high in torch.testing.make_tensor (#96333 ) Addresses #96179. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96333 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
Philip Meier	76fb9a1c7f	fix low and high in torch.testing.make_tensor for integral inputs (#96124 ) Fixes #96178. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96124 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
Philip Meier	9029361f24	honor low and high for torch.bool in torch.testing.make_tensor (#96332 ) Fixes #96101. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96332 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
Philip Meier	303eb37e38	QoL improvements for torch.testing.make_tensor (#96125 ) Per title. The major ones: - Enforce keyword only parameters for `_modify_low_high`, which takes 7 parameters. `28aa2efd14/torch/testing/_creation.py (L147)` is just impossible to comprehend without multiple trips back and forth. - Improve the error messages by including the offending values in the message I'll highlight the smaller ones inline. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96125 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
Philip Meier	090af4aa71	add proper tests for torch.testing.make_tensor (#96331 ) We had some minimal tests for `torch.testing.make_tensor` before, but nothing exhaustive. This lead to quite few edge cases being undetected. This PR adds comprehensive tests and leaves a few FIXMEs in there for behavior that needs to be fixed in `make_tensor`. This will happen in later commits of this stack. Meaning, at the end of this stack, there shouldn't be any FIXME left in the tests added here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96331 Approved by: https://github.com/mruberry	2023-03-24 23:55:17 +00:00
puririshi98	8aa34602f7	Jetson Update for CI Redo (#94549 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94549 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-02-21 17:13:38 +00:00
BowenBao	889a4640a0	[ONNX] Skip import test for experimental files (#94552 ) `torch.onnx._internal.fx` is experimental and is not imported when `import torch`/`import torch.onnx`. Need to skip it in this test as it depends on `onnx-script`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94552 Approved by: https://github.com/kit1980	2023-02-10 15:58:49 +00:00
Aaron Gokaslan	748bac8757	[BE]: Apply pyupgrade yield from and unit test alias upgrades (#94309 ) Applies some more harmless pyupgrades. This one gets rid of deprecated aliases in unit_tests and more upgrades yield for loops into yield from generators which are more performance and propagates more information / exceptions from original generator. This is the modern recommended way of forwarding generators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94309 Approved by: https://github.com/albanD	2023-02-07 20:08:58 +00:00
Aleksandar Samardžić	8612ec5b90	Implement hybrid sparse to/from dense conversions. (#90177 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90177 Approved by: https://github.com/cpuhrsch, https://github.com/pearu	2023-01-12 03:31:30 +00:00
Joel Schlosser	1effabe257	Support per-parameter test decoration (#91658 ) Continuation of #79979. Fixes #79161 This PR does the following: * Expands the `parametrize_fn()` signature from returning a 3-tuple of `(test, test_name, param_kwargs)` to returning a 4-tuple of `(test, test_name, param_kwargs, decorator_fn)`. Expected signature for the addition is `decorator_fn(param_kwargs) -> List[decorator]` i.e. given the full set of test params, return a list of decorators to apply. * `modules`, `ops`, and `parametrize` now fit the new signature, returning `decorator_fn`s instead of applying decorators themselves. * `instantiate_parametrized_tests()` and `instantiate_device_type_tests()` now call the returned `decorator_fn`, passing in the full set of `param_kwargs` (after composition + `device` / `dtype` additions) and applying the returned decorators. * Composing multiple `parametrize_fn`s also composes the corresponding `decorator_fn`s; the composed `decorator_fn` simply concatenates the decorator lists returned by the constituents. * Expands `DecorateInfo.is_active` to support callables: ```python DecorateInfo( unittest.expectedFailure, "TestOps", "test_python_ref_executor", device_type='cuda', active_if=lambda params: params['executor'] == 'nvfuser' ), ``` * Adds several tests to `test/test_testing.py` ensuring proper decoration using `@parametrize`, `@modules`, and `@ops`. * (minor) Fixes a couple `ModuleInfo` naming oddities uncovered during testing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91658 Approved by: https://github.com/malfet	2023-01-04 21:08:32 +00:00
Edward Z. Yang	b589e726d9	Refactor how AOTAutograd backends are defined (#89736 ) There was a lot of strangeness in how AOTAutograd backends were previously defined. This refactor replaces the strangeness with something simple and straightforward. The improvements: - There is no longer a footgun aot_autograd "backend" which doesn't actually work. No more mistyping `torch._dynamo.optimize("aot_autograd")` when you meant "aot_eager" - Deleted aot_print because it's annoying and anyway there's no uses of it - Instead of having BOTH the backend Subgraph and AotAutogradStrategy, there is now only an aot_autograd function which takes the kwargs to configure AOTAutograd, and then gives you a compiler function that does AOTAutograd given those kwargs. Easy. - The primary downside is that we are now eagerly populating all of the kwargs, and that can get us into import cycle shenanigans. Some cycles I resolved directly (e.g., we now no longer manually disable the forward function before passing it to aot_autograd; aot_autograd it does it for us), but for getting inductor decompositions I had to make it take a lambda so I could lazily populate the decomps later. New code is 130 lines shorter! Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89736 Approved by: https://github.com/anjali411, https://github.com/albanD	2022-11-28 18:39:12 +00:00
lezcano	c2cf0bde1f	Move the OpInfo same-storage error to the autograd test (#88306 ) This check was previously located at the `non_contiguous` test (quite and odd location). Even more, at https://github.com/pytorch/pytorch/pull/86378#discussion_r993658395, Kshiteej found that this assert was not doing anything really. We move it to the autograd test and make it a proper `self.assert`. We also disallow returning 1-tuples from sample_input functions, as they were breaking this assert. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88306 Approved by: https://github.com/mruberry	2022-11-21 13:59:03 +00:00
Charlie Yan	ee05f47bdd	Rebase and re-land thread PG (#88795 ) The previous PR (https://github.com/pytorch/pytorch/pull/88627) has been reverted due to a failed check. After rebasing and rerun, all checks passed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88795 Approved by: https://github.com/huydhn, https://github.com/wanchaol	2022-11-15 21:58:58 +00:00
Nikita Shulga	f98edfcc48	Make TorchElastic timer importable on Windows (#88522 ) Also, add `torch.distributed` to test imports, so that we would not regress in the future Fixes https://github.com/pytorch/pytorch/issues/85427 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88522 Approved by: https://github.com/d4l3k	2022-11-10 17:42:20 +00:00
Philip Meier	e6561291b8	add hack to allow hybrid compressed sparse comparison in assertEqual (#88749 ) Hybrid sparse CSR tensors can currently not be compared to strided ones since `.to_dense` does not work: ```py import torch from torch.testing._internal.common_utils import TestCase assertEqual = TestCase().assertEqual actual = torch.sparse_csr_tensor([0, 2, 4], [0, 1, 0, 1], [[1, 11], [2, 12] ,[3, 13] ,[4, 14]]) expected = torch.stack([actual[0].to_dense(), actual[1].to_dense()]) assertEqual(actual, expected) ``` ``` main.py:4: UserWarning: Sparse CSR tensor support is in beta state. If you miss a functionality in the sparse tensor support, please submit a feature request to https://github.com/pytorch/pytorch/issues. (Triggered internally at ../aten/src/ATen/SparseCsrTensorImpl.cpp:54.) actual = torch.sparse_csr_tensor([0, 2, 4], [0, 1, 0, 1], [[1, 11], [2, 12] ,[3, 13] ,[4, 14]]) Traceback (most recent call last): File "/home/philip/git/pytorch/torch/torch/testing/_comparison.py", line 1098, in assert_equal pair.compare() File "/home/philip/git/pytorch/torch/torch/testing/_comparison.py", line 619, in compare actual, expected = self._equalize_attributes(actual, expected) File "/home/philip/git/pytorch/torch/torch/testing/_comparison.py", line 706, in _equalize_attributes actual = actual.to_dense() if actual.layout != torch.strided else actual RuntimeError: sparse_compressed_to_dense: Hybrid tensors are not supported The above exception was the direct cause of the following exception: Traceback (most recent call last): File "main.py", line 10, in <module> assertEqual(actual, expected) File "/home/philip/git/pytorch/torch/torch/testing/_internal/common_utils.py", line 2503, in assertEqual msg=(lambda generated_msg: f"{generated_msg}\n{msg}") if isinstance(msg, str) and self.longMessage else msg, File "/home/philip/git/pytorch/torch/torch/testing/_comparison.py", line 1112, in assert_equal ) from error RuntimeError: Comparing TensorOrArrayPair( id=(), actual=tensor(crow_indices=tensor([0, 2, 4]), col_indices=tensor([0, 1, 0, 1]), values=tensor([[ 1, 11], [ 2, 12], [ 3, 13], [ 4, 14]]), size=(2, 2, 2), nnz=4, layout=torch.sparse_csr), expected=tensor([[[ 1, 11], [ 2, 12]], [[ 3, 13], [ 4, 14]]]), rtol=0.0, atol=0.0, equal_nan=True, check_device=False, check_dtype=True, check_layout=False, check_stride=False, check_is_coalesced=False, ) resulted in the unexpected exception above. If you are a user and see this message during normal operation please file an issue at https://github.com/pytorch/pytorch/issues. If you are a developer and working on the comparison functions, please except the previous error and raise an expressive `ErrorMeta` instead. ``` This adds a temporary hack to `TestCase.assertEqual` to enable this. Basically, we are going through the individual CSR subtensors, call `.to_dense()` on them, and stack everything back together. I opted to not do this in the common machinery, since that way users are not affected by this (undocumented) hack. I also added an xfailed test that will trigger as soon as the behavior is supported natively so we don't forget to remove the hack when it is no longer needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88749 Approved by: https://github.com/mruberry, https://github.com/pearu	2022-11-10 13:44:45 +00:00
Richard Zou	642b63e1e7	Add test that `import torch` doesn't modify global logging state (#87629 ) Fixes https://github.com/pytorch/pytorch/issues/87626 Also adds the same test for `import functorch`. Users have complained at us when we do modify the global logging state, which has happened in the past. Test Plan: - tested locally; I added `logging.basicConfig` to `torch/__init__.py` and checked that the test got triggered Pull Request resolved: https://github.com/pytorch/pytorch/pull/87629 Approved by: https://github.com/albanD	2022-10-26 15:53:28 +00:00
lezcano	5e4bcb049e	Improve readability of the extra message errors in assertEqual (#87202 ) Goes from (note the `linspace.default` is very difficult to find) ``` Mismatched elements: 15 / 50 (30.0%) Greatest absolute difference: 1 at index (17,) Greatest relative difference: 1.0 at index (17,) : linspace.default args = (0, -3, 50) kwargs = {'dtype': torch.int16, 'device': device(type='cpu'), 'pin_memory': False} ``` to ``` Mismatched elements: 15 / 50 (30.0%) Greatest absolute difference: 1 at index (17,) Greatest relative difference: 1.0 at index (17,) linspace.default args = (0, -3, 50) kwargs = {'dtype': torch.int16, 'device': device(type='cpu'), 'pin_memory': False} ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/87202 Approved by: https://github.com/ezyang	2022-10-24 06:11:50 +00:00
Peter Bell	1285542f9b	OpInfo: Add test that sample_inputs_func returns a generator (#84567 ) This also includes a small list exception for single element lists since none of the memory usage or performance implications of lists apply there. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84567 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-21 15:28:47 +00:00
Alex	8f2dda5bf2	[CI] Build MacOS M1 binaries without distributed support (#86451 ) Partial fix for #86448 which causes the broken code to be exercised in CI. If this demonstrates the break, I'm not sure whether there should be a fix forward of https://github.com/pytorch/pytorch/pull/85781 or a revert Pull Request resolved: https://github.com/pytorch/pytorch/pull/86451 Approved by: https://github.com/malfet	2022-10-10 17:42:13 +00:00
Zafar	0e30da3f2f	[refactor] Renaming ao.sparsity to ao.pruning (#84867 ) `Sparsity` as a term doesn't reflect the tools that are developed by the AO. The `torch/ao/sparsity` also has utilities for structured pruning, which internally we always referred to as just "pruning". To avoid any confusion, we renamed `Sparsity` to `Prune`. We will not be introducing the backwards compatibility, as so far this toolset was kept under silent development. This change will reflect the changes in the documentation as well. TODO: - [ ] Change the tutorials - [ ] Confirm no bc-breakages - [ ] Reflect the changes in the trackers and RFC docs Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/84867 Approved by: https://github.com/supriyar	2022-10-07 00:58:41 +00:00
Peter Bell	007e12a3e9	OpInfo: Extend natural syntax to allow adding metadata (#85890 ) Splitting into a seperate PR in case of bike shedding. We can't use the normal fluent syntax `SampleInput(x).name("foo")` because `.name` is already how the metadata is accessed. So instead, this adds a single function where you pass keyword arguments to fill in the metadata, e.g. ``` SampleInput(x).with_metadata( name="foo", output_process_fn_grad=out_fn) ``` An alternative closer to the normal fluent style would be to adding a prefix to the property's name, e.g. ``` (SampleInput(x) .with_name("foo") .with_output_process_fn_grad(out_fn)) ``` However, I have a slight preference for the `with_metadata` style because you don't need to add extra parenthesis to break lines. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85890 Approved by: https://github.com/mruberry	2022-10-02 19:56:40 +00:00
Peter Bell	ed5f95048e	OpInfo: Add natural syntax for SampleInput creation (#85723 ) Most SampleInput objects currently have no additional metadata, meaning they have a 1:1 mapping with a normal function call. This adds var arg forms of the `SampleInput` constructor such that you can just call the `SampleInput` constructor as you would call the operator. So, for example ```python SampleInput(make_arg(shape), args=(2, 3), kwargs=dict(alpha=4)) ``` becomes ```python SampleInput(make_arg(shape), 2, 3, alpha=4) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85723 Approved by: https://github.com/mruberry	2022-10-02 19:56:40 +00:00
Peter Bell	4d405517e4	Move OpInfo class into new opinfo folder (#82540 ) Ref #82518 Starting small to minimize merge conflicts, this moves the top-level class definitions and some helper functions into the `opinfos` folder. It also brings `common_methods_invocations.py` to just below 1MB. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82540 Approved by: https://github.com/albanD	2022-08-05 15:10:17 +00:00
Edward Z. Yang	3c2c2cc947	cudagraphs dynamo backend (#80566 ) This backend handles cases where the preexisting cuda graphs implementation from dynamo is unsound/has errors. Requires this functorch bug fix: https://github.com/pytorch/functorch/pull/935 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/80566 Approved by: https://github.com/ngimel, https://github.com/wconstab	2022-07-22 14:06:07 +00:00
macandro96	eecf34fbe7	[ao][sparsity] Post training data sparsifier callback for lightning (#80370 ) Lightning callback that enables post-training sparsity. This callback aims to sparsify the model inside lightning module after training. Note that the model is copied and then sparsified, so the existing model is not modified The sparsified model can be used for comparison and can be accessed using <callback_obj>.sparsified Test Plan ```python torch/ao/sparsity/_experimental/data_sparsifier/lightning/tests/test_callbacks.py TestPostTrainingCallback``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/80370 Approved by: https://github.com/z-a-f	2022-07-21 16:39:13 +00:00
Pearu Peterson	cde365a7cd	Validate Sparse Compressed tensor inputs (#79385 ) The validation includes regular tensor inputs, batched tensor inputs, as well as hybrid tensor inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79385 Approved by: https://github.com/nikitaved, https://github.com/cpuhrsch	2022-06-27 17:19:54 +00:00
Wei Wei	94dda03c78	[fx2trt] move common_fx2trt.py into fx folder (#79924 ) Summary: as titled Then update the library import path Test Plan: internal CI Reviewed By: yinghai Differential Revision: D37287068 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79924 Approved by: https://github.com/yinghai	2022-06-22 17:54:10 +00:00

1 2 3 4

154 Commits