pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-11-03 15:35:04 +08:00

Author	SHA1	Message	Date
CaoE	9399e0b1ff	add fp16 support for gemm (#99498 ) ### Testing Native matmul vs. mkldnn matmul on SPR (with avx512_fp16 support) single core: Input \| Naïve impl / ms \| oneDNN / ms \| Speed up -- \| -- \| -- \| -- M: 128, N: 128, K: 128, trans_a: False, trans_b: False \| 2010.387 \| 64.700 \| 31.072 M: 128, N: 256, K: 128, trans_a: False, trans_b: False \| 4027.116 \| 107.780 \| 37.364 M: 8192, N: 768, K: 768, trans_a: False, trans_b: False \| 28685868.488 \| 90663.008 \| 316.401 56 cores: Input \| Naïve impl / ms \| oneDNN / ms \| Speed up -- \| -- \| -- \| -- M: 128, N: 128, K: 128, trans_a: False, trans_b: False \| 5.091 \| 0.24 \| 211.30 M: 128, N: 128, K: 128, trans_a: False, trans_b: True \| 5.224 \| 0.23 \| 220.09 M: 128, N: 256, K: 128, trans_a: False, trans_b: False \| 10.006 \| 0.30 \| 330.31 M: 8192, N: 768, K: 768, trans_a: False, trans_b: False \| 29435.372 \| 1.770 \| 1662.80 M: 8192, N: 768, K: 768, trans_a: False, trans_b: True \| 31464.961 \| 1.728 \| 18204.76 M: 8192, N: 768, K: 3072, trans_a: False, trans_b: False \| 115035.849 \| 7.990 \| 14396.90 M: 8192, N: 768, K: 3072, trans_a: False, trans_b: True \| 122981.023 \| 7.725 \| 15918.34 Batch: 768, M: 128, N: 64, K: 128 \| 2032.523 \| 0.705 \| 2882.23 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99498 Approved by: https://github.com/jgong5, https://github.com/malfet	2023-09-28 01:03:50 +00:00
Edward Z. Yang	869226bf94	Avoid passing generator to parametrize (#110104 ) Fixes ``` ValueError: <function TestMeta.test_layer_norm_backward at 0x7f555f56e440>: An empty arg_values was passed to @parametrize. Note that this may result from reuse of a generator. ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/110104 Approved by: https://github.com/malfet, https://github.com/jbschlosser, https://github.com/voznesenskym	2023-09-27 02:52:48 +00:00
Mwiza Kunda	5c4b5baf21	Fix python decomps for OpOverloadPackets and add tests (#107707 ) - Extend `test_torch_dispatch_meta_outplace` to test torch ops that do not have an out parameter but have aten op overloads that have out parameters. Additionally, Python decompositions may register `OpOverloadPacket`'s so decompositions need to be tested to ensure all `OpOverloads` still function for the `Meta` key (e.g. if a python decomposition is registered for an aten op `aten.foo` with overloads `[default, out]`, the python function needs to support receiving out arguments) - Add out parameter wrappers to python decomps for aten ops that have out overloads CC. @ezyang @albanD @lezcano Fixes #107713 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107707 Approved by: https://github.com/lezcano	2023-09-25 20:53:30 +00:00
Mwiza Kunda	8dedc9dd9b	Add meta tests for layer/group/batch norm backward (#109591 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/109591 Approved by: https://github.com/ezyang	2023-09-21 18:58:51 +00:00
CaoE	54c28c564f	add Half support for BatchNorm on CPU (#102070 ) Fixes #106543 ### Testing Single core: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.7116 \| 0.1427 \| 0.1744 \| 0.2638 \| 0.2002 \| 0.2556 (1, 32, 100, 100) \| 0.8579 \| 0.1725 \| 0.2077 \| 0.3023 \| 0.2399 \| 0.2995 (32, 16, 200, 200) \| 57.3466 \| 12.2179 \| 13.1320 \| 45.9524 \| 24.1526 \| 24.9882 28 cores: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.2571 \| 0.0713 \| 0.0846 \| 0.1140 \| 0.0883 \| 0.1043 (1, 32, 100, 100) \| 0.1077 \| 0.0510 \| 0.0548 \| 0.0700 \| 0.0645 \| 0.0713 (32, 16, 200, 200) \| 5.5060 \| 1.4195 \| 1.4663 \| 6.773 \| 3.0886 \| 3.1343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102070 Approved by: https://github.com/jgong5, https://github.com/mikaylagawarecki, https://github.com/mingfeima	2023-09-19 10:43:33 +00:00
Jez Ng	7f3885137f	Add meta function for _segment_reduce (#109359 ) This fixes numerous tests which were xfailing. For instance, the `_segment_reduce.lengths` OpInfo test, which was previously relying on the fallback kernel to determine the shape of the meta tensor. The fallback kernel would fail with segment_reduce(): Expected all rows of lengths along axis to sum to data.size(lengths.dim()-1) when !unsafe. as it was trying to read the values of a meta tensor. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109359 Approved by: https://github.com/ezyang	2023-09-16 13:31:03 +00:00
PyTorch MergeBot	b226373d16	Revert "add Half support for BatchNorm on CPU (#102070 )" This reverts commit b6a1d3fb97ca8eeccf15a4c495fdd1af4b197f88. Reverted https://github.com/pytorch/pytorch/pull/102070 on behalf of https://github.com/clee2000 due to I'm very sorry but it looks like #106543 was not fixed, I still see it failing on main `b6a1d3fb97` https://github.com/pytorch/pytorch/actions/runs/6185704949/job/16793975677 ([comment](https://github.com/pytorch/pytorch/pull/102070#issuecomment-1719747065))	2023-09-14 16:13:34 +00:00
CaoE	b6a1d3fb97	add Half support for BatchNorm on CPU (#102070 ) Fixes #106543 ### Testing Single core: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.7116 \| 0.1427 \| 0.1744 \| 0.2638 \| 0.2002 \| 0.2556 (1, 32, 100, 100) \| 0.8579 \| 0.1725 \| 0.2077 \| 0.3023 \| 0.2399 \| 0.2995 (32, 16, 200, 200) \| 57.3466 \| 12.2179 \| 13.1320 \| 45.9524 \| 24.1526 \| 24.9882 28 cores: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.2571 \| 0.0713 \| 0.0846 \| 0.1140 \| 0.0883 \| 0.1043 (1, 32, 100, 100) \| 0.1077 \| 0.0510 \| 0.0548 \| 0.0700 \| 0.0645 \| 0.0713 (32, 16, 200, 200) \| 5.5060 \| 1.4195 \| 1.4663 \| 6.773 \| 3.0886 \| 3.1343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102070 Approved by: https://github.com/jgong5, https://github.com/mikaylagawarecki	2023-09-14 12:23:59 +00:00
PyTorch MergeBot	04a765f95d	Revert "add Half support for BatchNorm on CPU (#102070 )" This reverts commit 6065e7a97cfad4c2ae2b8722969648a53265fa13. Reverted https://github.com/pytorch/pytorch/pull/102070 on behalf of https://github.com/clee2000 due to sorry it looks like this is causing an unexpected success for `test_jit_fuser_te.py::TestNNCOpInfoCPU::test_nnc_correctness_nn_functional_batch_norm_cpu_float16` `6065e7a97c` https://github.com/pytorch/pytorch/actions/runs/6178069462/job/16770849782 ([comment](https://github.com/pytorch/pytorch/pull/102070#issuecomment-1718402208))	2023-09-13 22:38:42 +00:00
CaoE	6065e7a97c	add Half support for BatchNorm on CPU (#102070 ) Fixes #106543 ### Testing Single core: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.7116 \| 0.1427 \| 0.1744 \| 0.2638 \| 0.2002 \| 0.2556 (1, 32, 100, 100) \| 0.8579 \| 0.1725 \| 0.2077 \| 0.3023 \| 0.2399 \| 0.2995 (32, 16, 200, 200) \| 57.3466 \| 12.2179 \| 13.1320 \| 45.9524 \| 24.1526 \| 24.9882 28 cores: shape \| fp32 forward / ms \| fp16 forward / ms \| bf16 forward / ms \| fp32 backward / ms \| fp16 backward / ms \| bf16 backward / ms -- \| -- \| -- \| -- \| -- \| -- \| -- (1, 4, 256, 256) \| 0.2571 \| 0.0713 \| 0.0846 \| 0.1140 \| 0.0883 \| 0.1043 (1, 32, 100, 100) \| 0.1077 \| 0.0510 \| 0.0548 \| 0.0700 \| 0.0645 \| 0.0713 (32, 16, 200, 200) \| 5.5060 \| 1.4195 \| 1.4663 \| 6.773 \| 3.0886 \| 3.1343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102070 Approved by: https://github.com/jgong5, https://github.com/mikaylagawarecki	2023-09-13 17:30:16 +00:00
Huamin Li	9fa5283401	[dynamo+aten] Enable embedding_bag_byte_unpack + meta kernel impl (#107937 ) Summary: ``` torch._dynamo.exc.Unsupported: unsupported operator: quantized.embedding_bag_byte_unpack.default ``` Differential Revision: D48652953 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107937 Approved by: https://github.com/houseroad	2023-08-26 08:52:42 +00:00
Jordan Fix	df16b1ed53	[dynamo+aten] Enable embedding_bag_byte_rowwise_offsets + meta kernel impl (#106105 ) Differential Revision: D47007550 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106105 Approved by: https://github.com/gmagogsfm	2023-08-21 16:33:42 +00:00
Sam Larsen	e165938853	Implement decomposition for aten.rrelu_with_noise (#106812 ) Test Plan: * Primarily, added new test in test/test_decomp.py * Updated existing tests, e.g., to NOT expect failure Pull Request resolved: https://github.com/pytorch/pytorch/pull/106812 Approved by: https://github.com/eellison	2023-08-11 19:18:29 +00:00
Nikita Karetnikov	12041d8e1f	Use default dispatch table for `tensordot.out` (#106669 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106669 Approved by: https://github.com/ezyang	2023-08-07 00:58:17 +00:00
Nikita Karetnikov	05e1a50723	[pt2] remove meta skips for `aminmax`, decomp exists (#106670 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106670 Approved by: https://github.com/ezyang	2023-08-07 00:55:25 +00:00
Nikita Karetnikov	19621a73c0	[pt2] add metas for `grid_sampler_3d` ops (#106261 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106261 Approved by: https://github.com/ezyang	2023-08-05 14:48:11 +00:00
Kshiteej K	a899333ffc	fix: nll_loss batch rule with negative ignore_idx (#106118 ) We use python decompositions instead of writing our own for batching rules. Fixes https://github.com/pytorch/pytorch/issues/105736 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106118 Approved by: https://github.com/lezcano, https://github.com/zou3519	2023-08-04 07:43:02 +00:00
Nikita Karetnikov	1f734e03df	[pt2] add metas for `mode` ops (#106273 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106273 Approved by: https://github.com/ezyang ghstack dependencies: #106272	2023-08-03 13:11:10 +00:00
Nikita Karetnikov	70469e6f04	[pt2] add metas for `median` ops (#106272 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106272 Approved by: https://github.com/ezyang	2023-08-03 13:11:10 +00:00
Nikita Karetnikov	f23d755e1f	[pt2] add meta for `ormqr` (#106278 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106278 Approved by: https://github.com/ezyang	2023-08-01 06:47:48 +00:00
Nikita Karetnikov	0ee3b84021	[pt2] add meta for `cholesky_inverse` (#106120 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106120 Approved by: https://github.com/ezyang	2023-07-29 17:16:20 +00:00
Nikita Karetnikov	80755884be	[pt2] add meta for `cholesky` (#106115 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106115 Approved by: https://github.com/Skylion007, https://github.com/ezyang	2023-07-29 17:16:20 +00:00
Nikita Karetnikov	a4cffaae67	[pt2] add metas for `_cholesky_solve_helper` and `cholesky_solve` (#105867 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105867 Approved by: https://github.com/ezyang	2023-07-25 20:21:47 +00:00
Justin Chu	73e1455327	[BE] Enable ruff's UP rules and autoformat test/ (#105434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105434 Approved by: https://github.com/albanD	2023-07-19 20:36:06 +00:00
Nikita Karetnikov	c00dd43e43	[pt2] add metas for `multilabel_margin_loss` ops (#104388 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104388 Approved by: https://github.com/ezyang	2023-07-05 13:42:22 +00:00
Nikita Karetnikov	a3aa4da154	[pt2] add metas for `multi_margin_loss` ops (#104236 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104236 Approved by: https://github.com/ezyang	2023-07-05 13:40:05 +00:00
Nikita Karetnikov	b1c31b1d26	[pt2] metas and `SymInt` support for `max_pool` ops (#103951 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103951 Approved by: https://github.com/Chillee, https://github.com/kulinseth	2023-07-01 01:33:35 +00:00
Nikita Karetnikov	c4a6f86062	[pt2] add metas for `max_unpool2d` and `max_unpool3d` (#103821 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103821 Approved by: https://github.com/Skylion007, https://github.com/Chillee	2023-07-01 01:33:35 +00:00
Nikita Karetnikov	e9705c52ac	[pt2] add metas for `_pdist_forward` and `_pdist_backward` (#103817 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103817 Approved by: https://github.com/ezyang	2023-06-22 11:18:05 +00:00
Aleksandar Samardžić	09fdea8564	Fix autograd issue with identity conversions (#92022 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/92022 Approved by: https://github.com/pearu, https://github.com/mtaaooby, https://github.com/amjames, https://github.com/cpuhrsch	2023-06-21 21:23:03 +00:00
Nikita Karetnikov	2b3d955ffd	[pt2] add meta and `SymInt` support for `linalg_matrix_exp` (#102945 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102945 Approved by: https://github.com/lezcano	2023-06-09 22:45:16 +00:00
Edward Z. Yang	96fd283640	Preserve CreationMeta when metafying views. (#103152 ) This helps us avoid erroring / generate more accurate error messages in Dynamo when doing mutations on views. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103152 Approved by: https://github.com/albanD	2023-06-09 12:34:54 +00:00
Nikita Karetnikov	757791d1e3	[pt2] add `SymInt` support for `linalg.vander` (#102469 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102469 Approved by: https://github.com/Skylion007, https://github.com/lezcano	2023-06-04 09:58:02 +00:00
PyTorch MergeBot	463df86ce8	Revert "[pt2] add `SymInt` support for `linalg.vander` (#102469 )" This reverts commit 05717895aaab826bfd0e59567729e0d979e27897. Reverted https://github.com/pytorch/pytorch/pull/102469 on behalf of https://github.com/clee2000 due to broke test_aotdispatch on linux ex `05717895aa` https://github.com/pytorch/pytorch/actions/runs/5125654882/jobs/9219389448, shows up as green on pr due to bug with keep-going flag and reruns ([comment](https://github.com/pytorch/pytorch/pull/102469#issuecomment-1569041604))	2023-05-30 20:24:26 +00:00
Nikita Karetnikov	05717895aa	[pt2] add `SymInt` support for `linalg.vander` (#102469 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102469 Approved by: https://github.com/Skylion007, https://github.com/lezcano	2023-05-30 19:50:16 +00:00
Nikita Karetnikov	995ac703cd	[pt2] add `SymInt` support for `linalg.pinv` (#102367 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102367 Approved by: https://github.com/lezcano	2023-05-27 11:10:47 +00:00
PyTorch MergeBot	da3aba1e46	Revert "[pt2] add `SymInt` support for `linalg.pinv` (#102367 )" This reverts commit 0d5b74da0cab798fbfdb9caa53fad816999c8386. Reverted https://github.com/pytorch/pytorch/pull/102367 on behalf of https://github.com/kit1980 due to Broke slow tests https://github.com/pytorch/pytorch/actions/runs/5095190248/jobs/9160028124 ([comment](https://github.com/pytorch/pytorch/pull/102367#issuecomment-1565104562))	2023-05-27 00:33:42 +00:00
Nikita Karetnikov	0d5b74da0c	[pt2] add `SymInt` support for `linalg.pinv` (#102367 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102367 Approved by: https://github.com/lezcano	2023-05-26 15:20:34 +00:00
Nikita Karetnikov	e79d9b9938	[pt2] add `SymInt` support for `linalg.matrix_power` (#101940 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101940 Approved by: https://github.com/lezcano, https://github.com/ezyang	2023-05-24 00:21:52 +00:00
Nikita Karetnikov	42b974e8f7	[pt2] add meta for `linalg_lu_solve` (#101836 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101836 Approved by: https://github.com/lezcano	2023-05-24 00:21:50 +00:00
drisspg	6f13d6892a	Add meta support for multinomial (#101324 ) # Summary Found this when trying to compile the text gen loop of nanogpt here: `b33289942b/torchbenchmark/models/nanogpt_generate/model.py (L322)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101324 Approved by: https://github.com/ngimel	2023-05-19 00:04:26 +00:00
Angela Yi	72a73ef67b	Add aten.searchsorted.Tensor meta kernel (#101637 ) Test Plan: CI Differential Revision: D45933187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101637 Approved by: https://github.com/ezyang	2023-05-18 06:55:11 +00:00
kshitij12345	afea1a9fe9	[meta] error checking for inplace ops (#101532 ) Fixes #100753 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101532 Approved by: https://github.com/lezcano	2023-05-16 17:26:59 +00:00
Nikita Karetnikov	a8964d6377	[pt2] add meta and `SymInt` support for `linalg_householder_product` (#101315 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101315 Approved by: https://github.com/lezcano	2023-05-15 02:56:49 +00:00
Sun, Jiayi	d56e1b2f67	add Half support for unary ops on CPU (#98493 ) Add Half support for log_sigmoid and some unary ops on CPU, including sinc, acosh, asinh, atanh, digamma, trigamma, rsqrt, acos, asin, atan, ceil, cos, erf, erfc, erfinv, exp, expml, floor, log, log10, log1p, log2, i0, round, sin, sqrt, tan, tanh, trunc, lgamma. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98493 Approved by: https://github.com/jgong5, https://github.com/mingfeima, https://github.com/ngimel	2023-05-12 04:52:34 +00:00
Khushi	51fe53e619	[opinfo] item (#100313 ) Follows #100223 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100313 Approved by: https://github.com/ezyang	2023-05-10 11:32:45 +00:00
Nikita Karetnikov	1e591a8b64	[pt2] add meta function for `solve_triangular` (#100829 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100829 Approved by: https://github.com/ezyang	2023-05-08 13:48:15 +00:00
Nikita Karetnikov	e87ed2a88d	[primTorch] add ref for `polar` (#100345 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100345 Approved by: https://github.com/ezyang	2023-05-04 01:37:02 +00:00
Richard Zou	4135295a76	Excise yaml dependency in torchgen.model (#100203 ) The problem: - The new CustomOp API depends on torchgen.model - torchgen.model imports `yaml` - `yaml` is not a PyTorch runtime dependency To unblock myself, because I'm not sure how long it'll take to convince people yaml should be a PyTorch runtime dependency (unless one of you wants to approve #100166), this PR removes the yaml dependency from torchgen.model. It does so by splitting torchgen.utils (the offender) into torchgen.utils (no yaml) and torchgen.yaml (which uses yaml). Test Plan: - CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/100203 Approved by: https://github.com/ezyang, https://github.com/Skylion007	2023-04-28 13:45:39 +00:00
Jiong Gong	e5c9a0fcf5	[dynamo] avoid graph break on repeat_interleave.self_int (#99528 ) Address convit_base failure: https://github.com/pytorch/torchdynamo/issues/1886 mentioned in https://github.com/pytorch/pytorch/issues/93777 Also for models like EleutherAI/gpt-j-6B. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99528 Approved by: https://github.com/ezyang	2023-04-25 04:47:39 +00:00

1 2 3 4 5 ...

275 Commits