pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-29 11:14:56 +08:00

Author	SHA1	Message	Date
kshitij12345	5ec4ad7f54	[special] Add special.ndtri (#58650 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 TODO * [x] Add docs https://13865352-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtri * [x] Add comments on implementation * [x] Clean-up Pull Request resolved: https://github.com/pytorch/pytorch/pull/58650 Reviewed By: H-Huang Differential Revision: D29160170 Pulled By: mruberry fbshipit-source-id: 50e4ea663920e97b8437d03d5b52bcd9dedc1a8d	2021-06-19 18:36:54 -07:00
Philip Meier	d5988c5eca	remove unused `type: ignore` directives (#60006 ) Summary: During development it is common practice to put `type: ignore` comments on lines that are correct, but `mypy` doesn't recognize this. This often stems from the fact, that the used `mypy` version wasn't able to handle the used pattern. With every new release `mypy` gets better at handling complex code. In addition to fix all the previously accepted but now failing patterns, we should also revisit all `type: ignore` comments to see if they are still needed or not. Fortunately, we don't need to do it manually: by adding `warn_unused_ignores = True` to the configuration, `mypy` will error out in case it encounters an `type: ignore` that is no longer needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/60006 Reviewed By: jbschlosser, malfet Differential Revision: D29133237 Pulled By: albanD fbshipit-source-id: 41e82edc5cd5affa7ccedad044b59b94dad4425a	2021-06-18 07:23:31 -07:00
Mike Ruberry	5609c2e59c	Adds an OpInfo note (#57428 ) Summary: Like the title says. The OpInfo pattern can be confusing when first encountered, so this note links the Developer Wiki and tracking issue, plus elaborates on the goals and structure of the OpInfo pattern. cc imaginary-person, who I can't add as a reviewer, unfortunately Pull Request resolved: https://github.com/pytorch/pytorch/pull/57428 Reviewed By: SplitInfinity Differential Revision: D29221874 Pulled By: mruberry fbshipit-source-id: aa73228748c9c96eadf2b2397a8b2ec31383971e	2021-06-18 03:40:42 -07:00
Mike Ruberry	59b10036d5	Unifies OpInfo dtype tests (#60157 ) Summary: Simplifies the OpInfo dtype tests and produces nicer error messages, like: ``` AssertionError: Items in the first set but not the second: torch.bfloat16 Items in the second set but not the first: torch.int64 : Attempted to compare [set] types: Expected: {torch.float64, torch.float32, torch.float16, torch.bfloat16}; Actual: {torch.float64, torch.float32, torch.float16, torch.int64}. The supported dtypes for logcumsumexp on cuda according to its OpInfo are {torch.float64, torch.float32, torch.float16, torch.int64}, but the detected supported dtypes are {torch.float64, torch.float32, torch.float16, torch.bfloat16}. The following dtypes should be added to the OpInfo: {torch.bfloat16}. The following dtypes should be removed from the OpInfo: {torch.int64}. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/60157 Reviewed By: ngimel Differential Revision: D29188665 Pulled By: mruberry fbshipit-source-id: e84c9892c6040ea47adb027cfef3a6c0fd2f9f3c	2021-06-17 06:34:54 -07:00
kshitij12345	3288c9d304	[numpy] mvlgamma: int -> float promotion (#59934 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/42515 Last int->float promotion as per the tracker! Pull Request resolved: https://github.com/pytorch/pytorch/pull/59934 Reviewed By: H-Huang Differential Revision: D29160008 Pulled By: mruberry fbshipit-source-id: 389a5a7683e0c00d474da913012768bf2a212ef0	2021-06-16 17:44:20 -07:00
kshitij12345	64aec8d2ca	[testing] OpInfoHelper tool (#58698 ) Summary: Fixes: https://github.com/pytorch/pytorch/issues/57577 Usage: Add OpInfo entry to `common_methods_invocations` with `dtypes=_DYNAMIC_DYTPES` Eg. ``` OpInfo('atan2', dtypes=_DYNAMIC_DTYPES, sample_inputs_func=sample_inputs_atan2,) ``` Run the helper with `python -m torch.testing._internal.opinfo_helper` Output ``` OpInfo(atan2, # hint: all_types + (torch.bool,), dtypes=[torch.float32, torch.float64, torch.uint8, torch.int8, torch.int16, torch.int32, torch.int64, torch.bool], # hint: all_types + (torch.bool, torch.bfloat16, torch.float16), dtypesIfCUDA=[torch.float32, torch.float64, torch.uint8, torch.int8, torch.int16, torch.int32, torch.int64, torch.bool, torch.bfloat16, torch.float16], sample_inputs_func=sample_inputs_atan2) ``` Output without CUDA (run with `$ CUDA_VISIBLE_DEVICES=-1 python -m torch.testing._internal.opinfo_helper`) ``` UserWarning: WARNING: CUDA is not available, information pertaining to CUDA could be wrong warnings.warn("WARNING: CUDA is not available, information pertaining to CUDA could be wrong") OpInfo(atan2, # hint: all_types + (torch.bool,), dtypes=[torch.float32, torch.float64, torch.uint8, torch.int8, torch.int16, torch.int32, torch.int64, torch.bool], sample_inputs_func=sample_inputs_atan2) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/58698 Reviewed By: H-Huang Differential Revision: D29160668 Pulled By: mruberry fbshipit-source-id: 707370a83b451b02ad2fe539775c8c50ecf90be8	2021-06-16 17:17:03 -07:00
Joel Schlosser	c645d39a77	Implementation of torch.isin() (#53125 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/3025 ## Background This PR implements a function similar to numpy's [`isin()`](https://numpy.org/doc/stable/reference/generated/numpy.isin.html#numpy.isin). The op supports integral and floating point types on CPU and CUDA (+ half & bfloat16 for CUDA). Inputs can be one of: * (Tensor, Tensor) * (Tensor, Scalar) * (Scalar, Tensor) Internally, one of two algorithms is selected based on the number of elements vs. test elements. The heuristic for deciding which algorithm to use is taken from [numpy's implementation](`fb215c7696/numpy/lib/arraysetops.py (L575)`): if `len(test_elements) < 10 * len(elements) ** 0.145`, then a naive brute-force checking algorithm is used. Otherwise, a stablesort-based algorithm is used. I've done some preliminary benchmarking to verify this heuristic on a devgpu, and determined for a limited set of tests that a power value of `0.407` instead of `0.145` is a better inflection point. For now, the heuristic has been left to match numpy's, but input is welcome for the best way to select it or whether it should be left the same as numpy's. Tests are adapted from numpy's [isin and in1d tests](`7dcd29aaaf/numpy/lib/tests/test_arraysetops.py`). Note: my locally generated docs look terrible for some reason, so I'm not including the screenshot for them until I figure out why. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53125 Test Plan: ``` python test/test_ops.py # Ex: python test/test_ops.py TestOpInfoCPU.test_supported_dtypes_isin_cpu_int32 python test/test_sort_and_select.py # Ex: python test/test_sort_and_select.py TestSortAndSelectCPU.test_isin_cpu_int32 ``` Reviewed By: soulitzer Differential Revision: D29101165 Pulled By: jbschlosser fbshipit-source-id: 2dcc38d497b1e843f73f332d837081e819454b4e	2021-06-14 13:50:53 -07:00
Kushashwa Ravi Shrimali	cf38b20c61	Alias for `digamma` as `psi` to `special` namespace (#59143 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59143 Reviewed By: jbschlosser Differential Revision: D28986909 Pulled By: mruberry fbshipit-source-id: bc8ff0375de968f3662b224689fa0a6b117f9c4e	2021-06-14 03:05:14 -07:00
Mike Ruberry	92513038e8	Revert D28994140: [pytorch][PR] Implemented torch.cov Test Plan: revert-hammer Differential Revision: D28994140 (`23c232554b`) Original commit changeset: 1890166c0a9c fbshipit-source-id: 73dfe1b00464e38f004f99960cdeeb604ed4b20a	2021-06-13 02:33:37 -07:00
Heitor Schueroff	23c232554b	Implemented torch.cov (#58311 ) Summary: Based from https://github.com/pytorch/pytorch/pull/50466 Adds the initial implementation of `torch.cov` similar to `numpy.cov`. For simplicity, we removed support for many parameters in `numpy.cov` that are either redundant such as `bias`, or have simple workarounds such as `y` and `rowvar`. cc PandaBoi TODO - [x] Improve documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/58311 Reviewed By: mruberry Differential Revision: D28994140 Pulled By: heitorschueroff fbshipit-source-id: 1890166c0a9c01e0a536acd91571cd704d632f44	2021-06-11 09:40:50 -07:00
albanD	a524ee00ca	Forward AD formulas batch 3 (#59711 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59711 This is the exact same PR as before. This was reverted before the PR below was faulty. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D28995762 Pulled By: albanD fbshipit-source-id: 65940ad93bced9b5f97106709d603d1cd7260812	2021-06-10 19:30:02 -07:00
Natalia Gimelshein	52b2ed65c0	Revert D29007258: Revert D28926135: [pytorch][PR] Refactor Foreach Tests: Unary Functions Test Plan: revert-hammer Differential Revision: D29007258 Original commit changeset: c15f51661641 fbshipit-source-id: 98236153136a5c6b6c2911079b7bd214da6cb424	2021-06-09 21:02:56 -07:00
Natalia Gimelshein	171142f9cc	Revert D28926135: [pytorch][PR] Refactor Foreach Tests: Unary Functions Test Plan: revert-hammer Differential Revision: D28926135 (`0897df18a3`) Original commit changeset: 4eb21dcebbff fbshipit-source-id: c15f51661641f455ae265cdf048051a3c01198f9	2021-06-09 14:05:56 -07:00
Masaki Kozuki	0897df18a3	Refactor Foreach Tests: Unary Functions (#58960 ) Summary: Related issue: https://github.com/pytorch/pytorch/issues/58833 __changes__ - slowpath tests: pass every dtype&device tensors and compare the behavior with regular functions including inplace - check of #cudaLaunchKernel - rename `ForeachUnaryFuncInfo` -> `ForeachFuncInfo`: This change is mainly for the future binary/pointwise test refactors cc: ngimel ptrblck mcarilli Pull Request resolved: https://github.com/pytorch/pytorch/pull/58960 Reviewed By: ejguan Differential Revision: D28926135 Pulled By: ngimel fbshipit-source-id: 4eb21dcebbffffaf79259e31961626e0707fb8d1	2021-06-09 09:45:16 -07:00
Rong Rong (AI Infra)	26beda8ed5	[BE] unsupported backward failing on single sample (#59455 ) Summary: Echo on https://github.com/pytorch/pytorch/pull/58260#discussion_r637467625 similar to `test_unsupported_dtype` which only check exception raised on the first sample. we should do similar things for unsupported_backward as well. The goal for both test is to remind developer to 1. add a new dtype to the support list if they are fulling runnable without failure (over all samples) 2. replace the skip mechanism which will indefinitely ignore tests without warning Pull Request resolved: https://github.com/pytorch/pytorch/pull/59455 Test Plan: CI. Reviewed By: mruberry Differential Revision: D28927169 Pulled By: walterddr fbshipit-source-id: 2993649fc17a925fa331e27c8ccdd9b24dd22c20	2021-06-09 08:17:03 -07:00
Ivan Yashchuk	acc47357b5	Fix torch.conj for zero-dimensional sparse coo matrix (#59553 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59553 Added a test for 0x0 sparse coo input for sparse_unary_ufuncs. This test fails for `conj` on master. Modified `unsupportedTypes` for test_sparse_consistency, complex dtypes pass, but float16 doesn't pass for `conj` because `to_dense()` doesn't work with float16. Fixes https://github.com/pytorch/pytorch/issues/59549 Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D28968215 Pulled By: anjali411 fbshipit-source-id: 44e99f0ce4aa45b760d79995a021e6139f064fea	2021-06-08 15:46:49 -07:00
Jane Xu	14f4c8d333	Revert D28387762: Forward AD formulas batch 3 Test Plan: revert-hammer Differential Revision: D28387762 (`58348bea06`) Original commit changeset: fc395c92af7e fbshipit-source-id: 608d704ff5bc560714790a576eaf9ed7f1f44e13	2021-06-08 15:19:26 -07:00
Natalia Gimelshein	9d533ef3ac	Renorm fix (#59615 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/59584 albanD, soulitzer, `renorm` grad was completely busted. Fast gradcheck is definitely not doing its job. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59615 Reviewed By: jbschlosser Differential Revision: D28964271 Pulled By: ngimel fbshipit-source-id: b6878cd24db9189b64b67eb58bd2cd8956cda78a	2021-06-08 14:59:24 -07:00
Yukio Siraichi	84061dadad	Add reduce variants for `scatter` operation. (#57015 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/56463 #56464 - Add reduce variants for `scatter` in both _native_functions.yaml_ and _TensorAdvancedIndexing.cpp_ - Add `OpInfo` tests and reduce tests in _test_torch.py_ - Fix default reduce argument for `scatter_` in __tensor_docs.py_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/57015 Reviewed By: mrshenli Differential Revision: D28162657 Pulled By: ezyang fbshipit-source-id: 4d37ed1569ce8560aca1085c9cf5349f11427c4f	2021-06-08 13:37:26 -07:00
albanD	58348bea06	Forward AD formulas batch 3 (#58094 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58094 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D28387762 Pulled By: albanD fbshipit-source-id: fc395c92af7ebb5ebae95c40f6c76273047f4097	2021-06-08 13:00:21 -07:00
Jeffrey Wan	4920d5a05a	Temporarily add skip to fix slow gradcheck failure on master (#59585 ) Summary: Related https://github.com/pytorch/pytorch/issues/59584 Failure https://app.circleci.com/pipelines/github/pytorch/pytorch/331771/workflows/fed7923c-3490-490f-8769-81a71beae558/jobs/13940286 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59585 Reviewed By: albanD Differential Revision: D28945267 Pulled By: soulitzer fbshipit-source-id: 72ae4b6c9a04fe9fdfb89888e12bae25c78be23c	2021-06-08 07:21:30 -07:00
Mike Ruberry	de40c8e495	Adds remaining OpInfos and removes redundant test generators (#55558 ) Summary: Per title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/55558 Reviewed By: ngimel Differential Revision: D28922522 Pulled By: mruberry fbshipit-source-id: 89cefd93788bc8aa0683f4583cf5caa81aa2dc93	2021-06-06 14:52:26 -07:00
kshitij12345	da972afdcd	OpInfo: to_sparse (#59445 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59445 Reviewed By: ngimel Differential Revision: D28920866 Pulled By: mruberry fbshipit-source-id: ba8d3071d9937096288b69511000eeb007f53434	2021-06-05 19:13:58 -07:00
kshitij12345	96ac0e0340	OpInfo: t (#59442 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59442 Reviewed By: agolynski Differential Revision: D28898946 Pulled By: mruberry fbshipit-source-id: be32429fa7306554e4912fdcc382593d00c9f4ad	2021-06-05 18:59:38 -07:00
Akifumi Imanishi	0a5bfa9919	Support `__rmod__` (#58476 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58035. This PR implements `torch.Tensor.__rmod__` and `torch.remainder(scalar, tensor)` for the compatibility with NumPy’s interface. (cc: mruberry, rgommers, emcastillo, kmaehashi) TODO: - [x] Update `tensor_binary_op` in test/test_binary_ufuncs.py after https://github.com/pytorch/pytorch/issues/58216 is merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58476 Reviewed By: ngimel Differential Revision: D28776810 Pulled By: mruberry fbshipit-source-id: 74f8aea80f439ef2cc370333524e39971eeb7bf4	2021-06-05 16:19:24 -07:00
Natalia Gimelshein	344ecb2e71	flip via TI (#59509 ) Summary: Resubmit of https://github.com/pytorch/pytorch/issues/58747 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59509 Reviewed By: mruberry Differential Revision: D28918665 Pulled By: ngimel fbshipit-source-id: b045c7b35eaf22e53b1bc359ffbe5a4fda05dcda	2021-06-05 15:43:29 -07:00
Kshiteej K	1be7ca71ee	OpInfo: log_softmax (#59336 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59336 Reviewed By: agolynski Differential Revision: D28899052 Pulled By: mruberry fbshipit-source-id: 60a9a4ffbca5a0f2c899d4d83500dcab4555ffb0	2021-06-05 13:51:50 -07:00
Natalia Gimelshein	5117ac3bb4	Revert D28877076: [pytorch][PR] torch.flip via TI Test Plan: revert-hammer Differential Revision: D28877076 (`d82bc3feb8`) Original commit changeset: 4fa6eb519085 fbshipit-source-id: c81e7d3283ff6822db913bf9f49a1533268755d0	2021-06-04 23:03:53 -07:00
lezcano	d82bc3feb8	torch.flip via TI (#58747 ) Summary: Implements an idea by ngimel to improve the performance of `torch.flip` via a clever hack into TI to bypass the fact that TI is not designed to work with negative indices. Something that might be added is vectorisation support on CPU, given how simple the implementation is now. Some low-hanging fruits that I did not implement: - Write it as a structured kernel - Migrate the tests to opinfos - Have a look at `cumsum_backward` and `cumprod_backward`, as I think that they could be implemented faster with `flip`, now that `flip` is fast. Edit This operation already has OpInfos and it cannot be migrated to a structured kernel because it implements quantisation Summary of the PR: - x1.5-3 performance boost on CPU - x1.5-2 performance boost on CUDA - Comparable performance across dimensions, regardless of the strides (thanks TI) - Simpler code <details> <summary> Test Script </summary> ```python from itertools import product import torch from torch.utils.benchmark import Compare, Timer def get_timer(size, dims, num_threads, device): x = torch.rand(size, device=device) timer = Timer( "torch.flip(x, dims=dims)", globals={"x": x, "dims": dims}, label=f"Flip {device}", description=f"dims: {dims}", sub_label=f"size: {size}", num_threads=num_threads, ) return timer.blocked_autorange(min_run_time=5) def get_params(): sizes = ((1000,)2, (1000,)3, (10000,)2) for size, device in product(sizes, ("cpu", "cuda")): threads = (1, 2, 4) if device == "cpu" else (1,) list_dims = [(0,), (1,), (0, 1)] if len(size) == 3: list_dims.append((0, 2)) for num_threads, dims in product(threads, list_dims): yield size, dims, num_threads, device def compare(): compare = Compare([get_timer(*params) for params in get_params()]) compare.trim_significant_figures() compare.colorize() compare.print() compare() ``` </details> <details> <summary> Benchmark PR </summary> ![image](https://user-images.githubusercontent.com/3291265/119139954-81e46d80-ba3b-11eb-9aad-e825e515d41b.png) </details> <details> <summary> Benchmark master </summary> ![image](https://user-images.githubusercontent.com/3291265/119139915-76914200-ba3b-11eb-9aa8-84b3ca220c93.png) </details> Pull Request resolved: https://github.com/pytorch/pytorch/pull/58747 Reviewed By: agolynski Differential Revision: D28877076 Pulled By: ngimel fbshipit-source-id: 4fa6eb519085950176cb3a9161eeb3b6289ec575	2021-06-04 20:13:38 -07:00
anjali411	3607478ecd	Conjugate View (#54987 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54987 Based off of ezyang (https://github.com/pytorch/pytorch/pull/44799) and bdhirsh (https://github.com/pytorch/pytorch/pull/43702) 's prototype: Here's a summary of the changes in this PR: This PR adds a new dispatch key called Conjugate. This enables us to make conjugate operation a view and leverage the specialized library functions that fast path with the hermitian operation (conj + transpose). 1. Conjugate operation will now return a view with conj bit (1) for complex tensors and returns self for non-complex tensors as before. This also means `torch.view_as_real` will no longer be a view on conjugated complex tensors and is hence disabled. To fill the gap, we have added `torch.view_as_real_physical` which would return the real tensor agnostic of the conjugate bit on the input complex tensor. The information about conjugation on the old tensor can be obtained by calling `.is_conj()` on the new tensor. 2. NEW API: a) `.conj()` -- now returning a view. b) `.conj_physical()` -- does the physical conjugate operation. If the conj bit for input was set, you'd get `self.clone()`, else you'll get a new tensor with conjugated value in its memory. c) `.conj_physical_()`, and `out=` variant d) `.resolve_conj()` -- materializes the conjugation. returns self if the conj bit is unset, else returns a new tensor with conjugated values and conj bit set to 0. e) `.resolve_conj_()` in-place version of (d) f) `view_as_real_physical` -- as described in (1), it's functionally same as `view_as_real`, just that it doesn't error out on conjugated tensors. g) `view_as_real` -- existing function, but now errors out on conjugated tensors. 3. Conjugate Fallback a) Vast majority of PyTorch functions would currently use this fallback when they are called on a conjugated tensor. b) This fallback is well equipped to handle the following cases: - functional operation e.g., `torch.sin(input)` - Mutable inputs and in-place operations e.g., `tensor.add_(2)` - out-of-place operation e.g., `torch.sin(input, out=out)` - Tensorlist input args - NOTE: Meta tensors don't work with conjugate fallback. 4. Autograd a) `resolve_conj()` is an identity function w.r.t. autograd b) Everything else works as expected. 5. Testing: a) All method_tests run with conjugate view tensors. b) OpInfo tests that run with conjugate views - test_variant_consistency_eager/jit - gradcheck, gradgradcheck - test_conj_views (that only run for `torch.cfloat` dtype) NOTE: functions like `empty_like`, `zero_like`, `randn_like`, `clone` don't propagate the conjugate bit. Follow up work: 1. conjugate view RFC 2. Add neg bit to re-enable view operation on conjugated tensors 3. Update linalg functions to call into specialized functions that fast path with the hermitian operation. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D28227315 Pulled By: anjali411 fbshipit-source-id: acab9402b9d6a970c6d512809b627a290c8def5f	2021-06-04 14:12:41 -07:00
Saketh Are	aa06bc0731	OpInfo: minor fix in sample_inputs_diff (#59181 ) Summary: sample_inputs_diff constructs all five positional arguments for [diff ](https://pytorch.org/docs/stable/generated/torch.diff.html) but uses only the first three. This doesn't seem to be intentional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59181 Test Plan: This change expands coverage of diff's OpInfo sample inputs. Related tests still pass. Reviewed By: mruberry Differential Revision: D28878359 Pulled By: saketh-are fbshipit-source-id: 1466f6c6c341490885c85bc6271ad8b3bcdf3a3e	2021-06-04 09:53:31 -07:00
Peter Bell	6408cbd918	Migrate renorm to ATen (CPU and CUDA) (#59250 ) Summary: Resubmit of https://github.com/pytorch/pytorch/issues/59108, closes https://github.com/pytorch/pytorch/issues/24754, closes https://github.com/pytorch/pytorch/issues/24616 This reuses `linalg_vector_norm` to calculate the norms. I just add a new kernel that turns the norm into a normalization factor, then multiply the original tensor using a normal broadcasted `mul` operator. The result is less code, and better performance to boot. #### Benchmarks (CPU): \| Shape \| Dim \| Before \| After (1 thread) \| After (8 threads) \| \|:------------:\|:---:\|--------:\|-----------------:\|------------------:\| \| (10, 10, 10) \| 0 \| 11.6 us \| 4.2 us \| 4.2 us \| \| \| 1 \| 14.3 us \| 5.2 us \| 5.2 us \| \| \| 2 \| 12.7 us \| 4.6 us \| 4.6 us \| \| (50, 50, 50) \| 0 \| 330 us \| 120 us \| 24.4 us \| \| \| 1 \| 350 us \| 135 us \| 28.2 us \| \| \| 2 \| 417 us \| 130 us \| 24.4 us \| #### Benchmarks (CUDA) \| Shape \| Dim \| Before \| After \| \|:------------:\|:---:\|--------:\|--------:\| \| (10, 10, 10) \| 0 \| 12.5 us \| 12.1 us \| \| \| 1 \| 13.1 us \| 12.2 us \| \| \| 2 \| 13.1 us \| 11.8 us \| \| (50, 50, 50) \| 0 \| 33.7 us \| 11.6 us \| \| \| 1 \| 36.5 us \| 15.8 us \| \| \| 2 \| 41.1 us \| 15 us \| Pull Request resolved: https://github.com/pytorch/pytorch/pull/59250 Reviewed By: mruberry Differential Revision: D28820359 Pulled By: ngimel fbshipit-source-id: 572486adabac8135d52a9b8700f9d145c2a4ed45	2021-06-03 11:43:27 -07:00
kshitij12345	6620d7d688	OpInfo: norm (#59259 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 EDIT: ~~Test takes whooping 4 mins to run 😓~~ (Filtered tests also included linalg norm) Newly added tests take around 2 mins. ``` ==================================================== 193 passed, 224 skipped, 27224 deselected, 5 warnings in 138.87s (0:02:18) ==================================================== ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/59259 Reviewed By: jbschlosser Differential Revision: D28833962 Pulled By: mruberry fbshipit-source-id: 40b24d6a8cb8b7d231b2f6b34b87cee4f136c5f9	2021-06-03 08:25:58 -07:00
albanD	d095ec75a1	Forward AD formulas batch 2 (#57863 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57863 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D28387763 Pulled By: albanD fbshipit-source-id: e1b60ab728bb05b9e3323ee0dc7e401aaf5b8817	2021-06-03 07:33:04 -07:00
Nikita Shulga	f7097b0c0b	Make unary tests runnable if SCIPY is not installed (#59304 ) Summary: By adding `if TEST_SCIPY else _NOTHING` to special.i1 and special.i1e Discovered while running tests on M1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59304 Reviewed By: jbschlosser Differential Revision: D28835693 Pulled By: malfet fbshipit-source-id: e4fde6584da29fa43bc6da75eebe560512754ed0	2021-06-02 12:47:30 -07:00
Kushashwa Ravi Shrimali	44c20ce676	Alias for `i0` to `special` namespace (#59141 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59141 Reviewed By: ngimel Differential Revision: D28784097 Pulled By: mruberry fbshipit-source-id: 9b61a21906ef337292686fd40e328502a79e6f09	2021-06-01 23:04:09 -07:00
kshitij12345	223725cfb0	OpInfo: div - port pending method_tests entry (#59173 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Depends on: https://github.com/pytorch/pytorch/issues/59154 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59173 Reviewed By: ngimel Differential Revision: D28785178 Pulled By: mruberry fbshipit-source-id: 902310f2d77e499a2355a23b2d5a8c0b21b8c5bb	2021-05-31 17:32:27 -07:00
Kushashwa Ravi Shrimali	6d45d7a6c3	Enables previously "slow" `gradgrad` checks on CUDA (#57802 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/57508 Earlier, a few CUDA `gradgrad` checks (see the list of ops below) were disabled because of them being too slow. There have been improvements (see https://github.com/pytorch/pytorch/issues/57508 for reference) and this PR aimed on: 1. Time taken by `gradgrad` checks on CUDA for the ops listed below. 2. Enabling the tests again if the times sound reasonable Ops considered: `addbmm, baddbmm, bmm, cholesky, symeig, inverse, linalg.cholesky, linalg.cholesky_ex, linalg.eigh, linalg.qr, lu, qr, solve, triangular_solve, linalg.pinv, svd, linalg.svd, pinverse, linalg.householder_product, linalg.solve`. For numbers (on time taken) on a separate CI run: https://github.com/pytorch/pytorch/pull/57802#issuecomment-836169691. cc: mruberry albanD pmeier Pull Request resolved: https://github.com/pytorch/pytorch/pull/57802 Reviewed By: ngimel Differential Revision: D28784106 Pulled By: mruberry fbshipit-source-id: 9b15238319f143c59f83d500e831d66d98542ff8	2021-05-30 22:16:46 -07:00
krshrimali	ef40757de3	OpInfo: `zero_` (#58731 ) Summary: See https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58731 Reviewed By: ngimel Differential Revision: D28784083 Pulled By: mruberry fbshipit-source-id: f06de8045afd3728b1fedc014c091d8fd1955a9f	2021-05-30 21:49:29 -07:00
kshitij12345	fea7a79e0b	[special] Add ndtr (#58126 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Plot: ![image](https://user-images.githubusercontent.com/19503980/117942099-54efd680-b328-11eb-8948-c3080779ce19.png) https://colab.research.google.com/drive/1Of67A042rOImj8wrLF_fUTgoy_wVEOZS?usp=sharing TODO: * [x] Add docs (https://13385714-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtr) Pull Request resolved: https://github.com/pytorch/pytorch/pull/58126 Reviewed By: anjali411 Differential Revision: D28700957 Pulled By: mruberry fbshipit-source-id: 5b9991e97ec1e8fd01518cc9d9849108d35fe406	2021-05-30 21:12:04 -07:00
kshitij12345	445e838210	OpInfo: resize_, resize_as_ (#59176 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59176 Reviewed By: ngimel Differential Revision: D28780083 Pulled By: mruberry fbshipit-source-id: 472584e8faa4cb1031908df097849d2d4167fdf5	2021-05-30 18:53:17 -07:00
kshitij12345	ea465f7378	OpInfo: true_divide and minor fix (#59154 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59154 Reviewed By: ngimel Differential Revision: D28780115 Pulled By: mruberry fbshipit-source-id: 91e254698597fa0c7d4df6053ec017a85e180304	2021-05-30 18:35:30 -07:00
Kshiteej K	6ee9466d3a	OpInfo: tensor_split: port remaining method_test entries (#59133 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59133 Reviewed By: ngimel Differential Revision: D28776470 Pulled By: mruberry fbshipit-source-id: 975a7062788de514f214f8c4ef0146eaf6b407f7	2021-05-30 00:40:29 -07:00
kshitij12345	cab65ea3b9	OpInfo: renorm (#59079 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59079 Reviewed By: ngimel Differential Revision: D28776789 Pulled By: mruberry fbshipit-source-id: ca46f2debe918c3de1f3b5bbc9924b7ddfe9442a	2021-05-29 22:38:15 -07:00
kshitij12345	5c18994674	[special] Add `i1` and `i1e` (#56352 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 * [x] Check Docs https://12721710-65600975-gh.circle-artifacts.com/0/docs/special.html * [x] Investigate fp32 failure on CI?! (Fails on clang. Reproduced locally with clang-11) * [ ] Kernel vs Composite? * [x] Autograd for `i0e` for zero? Pull Request resolved: https://github.com/pytorch/pytorch/pull/56352 Reviewed By: anjali411 Differential Revision: D28700888 Pulled By: mruberry fbshipit-source-id: 91a3cbb94f5b8a3b063589ec38179848c11def83	2021-05-29 20:55:23 -07:00
kshitij12345	9fc0c5a54a	OpInfo: tril, triu (#59145 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59145 Reviewed By: ngimel Differential Revision: D28776433 Pulled By: mruberry fbshipit-source-id: 2ff11a5202af1e73ffc2b242035c990646bd2259	2021-05-29 02:55:50 -07:00
kshitij12345	d68df54269	OpInfo: fill_ (#59138 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59138 Reviewed By: ngimel Differential Revision: D28776451 Pulled By: mruberry fbshipit-source-id: 2e8e9f1805ec7d900223ea749a4a0b86a1bedb54	2021-05-29 00:35:02 -07:00
kshitij12345	c9af4c2636	OpInfo: where (#58349 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58349 Reviewed By: mrshenli Differential Revision: D28744220 Pulled By: mruberry fbshipit-source-id: 893a2fb88a48a60df75c7d6e2f58a42ca949daa7	2021-05-28 18:22:03 -07:00
Kushashwa Ravi Shrimali	0c1420aa3c	OpInfo: `fmod` and `remainder` (#57941 ) Summary: See https://github.com/pytorch/pytorch/issues/54261 cc: mruberry Lezcano kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57941 Reviewed By: mrshenli Differential Revision: D28744464 Pulled By: mruberry fbshipit-source-id: 19847277d4f8d3a39a706c2b3c9eddf0dedcb20c	2021-05-27 20:32:56 -07:00
Bin Bao	7e4e648c2a	Enable NNC fusion for relu6 (#58773 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58773 Test Plan: ``` python test/test_ops.py -k relu6 python test/test_jit_fuser_te.py ``` Reviewed By: bertmaher Differential Revision: D28721791 Pulled By: desertfire fbshipit-source-id: a94f711977afd080faae052f66eb8dded3cdc79e	2021-05-27 10:54:02 -07:00

1 2 3 4 5 ...

470 Commits