pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-11-11 22:34:53 +08:00

Author	SHA1	Message	Date
Edward Yang	73a97387c1	Replace AT_CHECK with TORCH_CHECK [shard 9/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20435 Reviewed By: jerryzh168 Differential Revision: D15318877 fbshipit-source-id: 4d83571187ea14a604fef83ac355d328b46d93e1	2019-05-15 08:05:59 -07:00
Edward Yang	365fc26571	Replace AT_CHECK with TORCH_CHECK [shard 8/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20434 Reviewed By: jerryzh168 Differential Revision: D15318396 fbshipit-source-id: dcd0f51be2d64b9440bb95ce8f40acb12545c2f4	2019-05-15 08:05:56 -07:00
Edward Yang	d1623f4cc9	Replace AT_CHECK with TORCH_CHECK [shard 3/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20428 Reviewed By: jerryzh168 Differential Revision: D15318209 fbshipit-source-id: e492aaa79146cfce9489bdb354cc539d7c4220a7	2019-05-15 07:40:50 -07:00
Edward Yang	9d09f5df6c	Replace AT_CHECK with TORCH_CHECK [shard 7/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20432 Reviewed By: jerryzh168 Differential Revision: D15318289 fbshipit-source-id: 6c443ac848fe28a1e3e8d7f33a12cd50f80b3e40	2019-05-15 07:40:47 -07:00
peter	101067703e	Fix strtod for MSVC (#20490 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/20408. Tested locally by Jonas1312. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20490 Differential Revision: D15353137 Pulled By: ezyang fbshipit-source-id: 0c0aefe54b11d50f703171700838af51f7666418	2019-05-15 07:40:44 -07:00
Edward Yang	97e1f07ffc	Replace AT_CHECK with TORCH_CHECK [shard 10/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20436 Reviewed By: jerryzh168 Differential Revision: D15318926 fbshipit-source-id: 71a43070cc50cc174f703ebc595f1d87c6fc1e91	2019-05-15 07:35:37 -07:00
Bram Wasti	8e26759f14	Back out "[pytorch][PR] Manually set _GLIBCXX_USE_CXX11_ABI in devtoolset7 binary builds" Summary: Original commit changeset: 571bba8a93ea Reviewed By: pjh5 Differential Revision: D15349783 fbshipit-source-id: 75c3e2b9b97e0ac0e8bcdef93e53b0d475c6fa38	2019-05-15 00:02:55 -07:00
Cheng Cheng	fd18b89c98	shape inference for learning rate op (#20020 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20020 Add shape inference for LearningRate op. The output (lr) should have similar shape with input (iteration), but not the same type (float vs int). Reviewed By: un-disclosed Differential Revision: D15112300 fbshipit-source-id: 09969aefa15172a6f3c70cd9b2548e3020da5d7a	2019-05-14 23:34:32 -07:00
Jiyan Yang	33f421027c	Allow recency weight pooling for fp16 (#20506 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20506 as titled Reviewed By: alex1o1o7cloud Differential Revision: D15342758 fbshipit-source-id: 89e7cb6d7b9511ef6c70611359736328571d7fc0	2019-05-14 20:13:38 -07:00
svcscm	ea13b53856	Updating submodules Reviewed By: cdelahousse fbshipit-source-id: 63e9b4a8cf5b15a6ba20d1946aac36c1604d8079	2019-05-14 19:02:43 -07:00
Kedar Pujara	254de9e8ec	Removing cyclic dependency (#20511 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20511 Removed cyclic dependency of caffe2/core/net.h and workspace.h Differential Revision: D15303412 fbshipit-source-id: 6e772e372cd0cf2af05d7815f1df8ae20bc2a65e	2019-05-14 18:55:19 -07:00
Sebastian Messmer	ace506fb38	Dict (#20372 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20372 Implement a Dict type that allows us to abstract away from the concrete implementation used. The API is similar to std::unordered_map, but behind the scenes we can switch to any map implementation we like. ska::flat_hash_map, google dense map, or any future map implementation with better performance. Switching such an implementation choice does not have to break backwards compatibility of kernel code using the Dict type. Reviewed By: zdevito Differential Revision: D15298234 fbshipit-source-id: b5ad368a9e9516030805cd8f5f1b02e3986933c0	2019-05-14 18:37:02 -07:00
Jerry Zhang	56fb5e03b5	refactor registerStoragePyTypeObject (#20467 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20467 for upcoming changes in Storage for QInt8 Reviewed By: ezyang Differential Revision: D15330865 fbshipit-source-id: 2840e59c0bf088983f792fd724de41b3bb3dec55	2019-05-14 18:22:33 -07:00
Jesse Hellemn	ea38fbfc5c	Manually set _GLIBCXX_USE_CXX11_ABI in devtoolset7 binary builds (#20243 ) Summary: Fix for https://github.com/pytorch/pytorch/issues/17492 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20243 Differential Revision: D15348101 Pulled By: pjh5 fbshipit-source-id: 571bba8a93eaa9806db3f3d38697c26b5285da7a	2019-05-14 18:02:42 -07:00
Edward Yang	358fb51e77	Replace AT_CHECK with TORCH_CHECK [shard 6/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20430 Reviewed By: jerryzh168 Differential Revision: D15318250 fbshipit-source-id: eaee93447d757124a0c9fb5dcde503ae6a065912	2019-05-14 16:00:59 -07:00
Edward Yang	5b45355431	Replace AT_CHECK with TORCH_CHECK [shard 2/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20427 Reviewed By: jerryzh168 Differential Revision: D15318190 fbshipit-source-id: 15518a683d7b662ef00f255134aaf9dbd183f099	2019-05-14 16:00:56 -07:00
Edward Yang	71af7c46bb	Replace AT_CHECK with TORCH_CHECK [shard 4/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20429 Reviewed By: jerryzh168 Differential Revision: D15318222 fbshipit-source-id: daf693c34b4ee92e302eee679ed76a862715d1bb	2019-05-14 15:50:16 -07:00
Zachary DeVito	9610f150d7	stop build spew on development (#20508 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20508 ghimport-source-id: 26a16e2918fb93058c7740afb85070e0d29b4d1b Differential Revision: D15343207 Pulled By: zdevito fbshipit-source-id: b6d8858024cc440d59cf88d69e0fbc0e67dc85ce	2019-05-14 15:30:52 -07:00
Michael Suo	24cd0e08cf	identify important circleci builds (#20498 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20498 ghimport-source-id: b62b5bcf73ce87b1054cad053fd1cc118a586cf6 Differential Revision: D15342506 Pulled By: suo fbshipit-source-id: 9889103d23affe0d7eea0abfd801bae46d5238a2	2019-05-14 15:16:06 -07:00
Sebastian Messmer	9e7f22b223	Remove dependencies from Caffe2Go on PyTorch JIT (#20463 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20463 Source file changes mostly involve ifdef'ing-out references to JIT code from files that are part of Caffe2Go. Update Internal build scripts to remove those files from our globs. After this, changes to most of the JIT files should not trigger mobile CI. Reviewed By: dzhulgakov Differential Revision: D15329407 fbshipit-source-id: 48f614c6b028eef0a03ce5161d083a3e078b0412	2019-05-14 14:36:08 -07:00
Ivan Ogasawara	3479777519	UpSample GPU Porting (#19630 ) Summary: resolves #16158 Pull Request resolved: https://github.com/pytorch/pytorch/pull/19630 Differential Revision: D15335765 Pulled By: ezyang fbshipit-source-id: 03dd590c715a65c20ac99674a5d77179cd4a50fc	2019-05-14 11:58:21 -07:00
Cheng Cheng	7ffc37e022	Add ShapeInference for AtomicIter Op (#20021 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20021 Add shape inference for AtomicIter operator. The operator takes two blobs iteration and iter_mutex as input and outputs iteration, which should have the same type and shape as the input. Reviewed By: un-disclosed Differential Revision: D15111643 fbshipit-source-id: 0d06413305cc4c6257c0cfabf62fb874970803bc	2019-05-14 11:43:21 -07:00
Jason Lian	6e82b1c77d	Split nn.MultiHeadAttention into Module + functional (#20415 ) Summary: Moving functions from torch/nn/modules/activation.py to torch/nn/functional.py. For functions not implemented (_get_input_buffer and _set_input_buffer), a TODO is added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20415 Differential Revision: D15318078 Pulled By: jamarshon fbshipit-source-id: 5ca698e2913821442cf8609cc61ac8190496a3c6	2019-05-14 08:41:28 -07:00
Sam Gross	b46a630836	Update Sleef to include fix for FMA4 detection (#20450 ) Summary: FMA4 support is in bit 16 of register ECX, not EDX of the "extended processor info" (0x80000001). Once we verify that this change fixes https://github.com/pytorch/pytorch/issues/12112, I'll make a PR for upstream Sleef. The mapping of registers to reg is: ``` reg[0] = eax reg[1] = ebx reg[2] = ecx <--- reg[3] = edx ``` Bit 16 of EDX is PAT (Page Attribute Table) on AMD CPUs, which is widely supported. Intel CPUs do not set this bit. This causes "Illegal instruction" errors on AMD CPUs that do not support FMA4. See https://github.com/pytorch/pytorch/issues/12112 See https://github.com/shibatch/sleef/issues/261 http://developer.amd.com/wordpress/media/2012/10/254811.pdf (Page 20) Pull Request resolved: https://github.com/pytorch/pytorch/pull/20450 Differential Revision: D15324405 Pulled By: colesbury fbshipit-source-id: 96fb344c646998ff5da19e4cdbf493f5a4e9892a	2019-05-14 08:33:18 -07:00
Jongsoo Park	101176870e	eliminate FE_INVALID exceptions related to fp16 conversion (#20390 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20390 duc0 Ngo implemented observing floating point exceptions but there were a couple of places where we have "benign" floating point exceptions leading to false positives. This diff eliminates one source of such false positives, namely using _mm256_cvtph_ps and _mm256_cvtps_ph for partially uninitialized array for the remainder loop. Reviewed By: hx89 Differential Revision: D15307358 fbshipit-source-id: 38f57dfdd90c70bc693292d2f9c33c7ba558e2c9	2019-05-13 23:42:01 -07:00
Du Tran	8e9692df27	codemode change missing [from D13586737] Summary: as title Reviewed By: jerryzh168 Differential Revision: D15327669 fbshipit-source-id: e262dacb097e91475b1925ec40b375ec6722ad5a	2019-05-13 20:44:04 -07:00
davidriazati	e8fb5f35f0	Bump torch proto version (#20444 ) Summary: Tagging along to changes in #20191 which added more support for types in the pickler Pull Request resolved: https://github.com/pytorch/pytorch/pull/20444 Pulled By: driazati Differential Revision: D15321463 fbshipit-source-id: 985061bf5070a7d7bad58ea8db11d531f3d13e74	2019-05-13 18:32:16 -07:00
Ansha Yu	a9aaf698a4	add c2 benchmark runs in cpp (#20108 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20108 Add cpp runs for c2, hooked up via pybinds. Print output to terminal. This is not hooked up with the pep output yet because I'd like to verify the numbers first. Note that this isn't quite the same mechanism as the pytorch cpp hookup, which uses cpp_python_extensions. If I can use the same mechanism to pull all the inputs for c2 through cpp and do FeedBlobs in cpp, then I'll switch to that. Reviewed By: zheng-xq Differential Revision: D15155976 fbshipit-source-id: 708079dacd3e19aacfe43d70c5e5bc54da2cf9e3	2019-05-13 17:01:08 -07:00
Wanchao Liang	d2da3ee601	temporarily disbale layernorm AD (#20442 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20442 ghimport-source-id: c246ade4ee9ee31b2e3413efff3ea6a246e1837e Differential Revision: D15321524 Pulled By: wanchaol fbshipit-source-id: 22c77d08c91af2d83dfd2c4a84cafc56e9240033	2019-05-13 16:35:50 -07:00
Edward Yang	f0829f37c8	Rename AT_ASSERT to TORCH_INTERNAL_ASSERT; other macro updates (#20321 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20321 First part of https://github.com/pytorch/pytorch/issues/20287 - Rename `AT_ASSERT` to `TORCH_INTERNAL_ASSERT` - Make `TORCH_INTERNAL_ASSERT` work with variadic inputs - Deprecated `AT_ASSERT` and `AT_ASSERTM` - Rename `AT_CHECK` to `TORCH_CHECK` - Make `TORCH_CHECK` give a better error message when no arguments are provided - Deprecate `AT_ERROR` in favor of `TORCH_CHECK(false, ...)` - Deprecate `AT_INDEX_ERROR` in favor of `TORCH_CHECK_INDEX(false, ...)` - Rename `AT_WARN` to `TORCH_WARN` No use sites are changed; I'll work on that in follow up patches (or disable the deprecation, if necessary.) Differential Revision: D15278439 fbshipit-source-id: 7e0ed489d4e89e5f56b8ad7eafa72cb9a06065ee	2019-05-13 16:16:42 -07:00
Will Feng	1364104054	Fix version counter sharing in set_data() (#20391 ) Summary: In https://github.com/pytorch/pytorch/pull/18223/files#diff-77a6f3462f2233b921d3042412fed6d3R178, we used `auto saved_version_ = data_.unsafeGetTensorImpl()->version_counter().current_version()` and then `new_data_impl_copy->set_version_counter(saved_version_)`, which actually doesn't preserve the original semantics that `var.set_data(tensor)` should keep `var`'s version counter object intact. This PR fixes the bug and adds test to make sure it doesn't happen again. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20391 Differential Revision: D15323430 Pulled By: yf225 fbshipit-source-id: e3ba49b51ec8ccecd51c80cb182387f74cfd2b2b	2019-05-13 16:03:42 -07:00
Will Feng	3a0b27b73d	Move at::NonVariableTypeMode to TensorImpl, and check it in is_variable() (#20392 ) Summary: As part of the Variable/Tensor merge, we allow passing Tensor with AutogradMeta into ATen ops, but we want to make sure they are not treated as Variables (i.e. their `is_variable()` is false). This PR makes the necessary change to make this work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20392 Differential Revision: D15321899 Pulled By: yf225 fbshipit-source-id: c2ab09db73c63bd71ba2d8391095f4d6b4240a9a	2019-05-13 15:49:23 -07:00
Lu Fang	2dc9152dbe	Automatic update of fbcode/onnx to e08efaa35ed54362dfa283240506c003175889b7 (#20443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20443 Previous import was 5bde6371620b76302864bce90f521d72eda95d0e Included changes: - [e08efaa3](https://github.com/onnx/onnx/commit/e08efaa3): Fix shape inference logic for TopK operator (#2005) <Hariharan Seshadri> - [d80ea947](https://github.com/onnx/onnx/commit/d80ea947): Nullary variadic (#1889) <G. Ramalingam> - [50dc186b](https://github.com/onnx/onnx/commit/50dc186b): Removed setting MD/MDd flags manually through cmake. The MTd/MT part is still necessary. Looks like CI fails without it. (#1995) <Alexander Yermolovich> - [e7f81c5e](https://github.com/onnx/onnx/commit/e7f81c5e): Move NonMaxSupression to object_detection folder (#2001) <Hector Li> - [86ab4517](https://github.com/onnx/onnx/commit/86ab4517): Prevent using invalid iterator, fix arithmetics. (#2004) <Dmitri Smirnov> Reviewed By: zrphercule Differential Revision: D15302141 fbshipit-source-id: 146c346c188934e5125371b261ecfde93b4aa166	2019-05-13 14:47:11 -07:00
Jesse Hellemn	824d4f9957	Needed fixes for binaries Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20385 Differential Revision: D15321396 Pulled By: pjh5 fbshipit-source-id: de7ca1ac928bdea3bcf6c78e84c7e9b786bcff52	2019-05-13 11:58:50 -07:00
Jiyan Yang	6c3b8a24ff	Make sure reducer=None is not used when fp16 embedding is enabled Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20349 Reviewed By: hyuen Differential Revision: D15291545 fbshipit-source-id: fa5fd0b97aeca6e5f45866908f3f205b701c931b	2019-05-13 11:53:14 -07:00
davidriazati	63c05bffcb	Fix lint Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20440 Pulled By: driazati Differential Revision: D15320614 fbshipit-source-id: dc650c478e39d0c3e6b660c2d9ef93b3479df1ac	2019-05-13 11:37:27 -07:00
Masaki Kozuki	7799ea5eb3	Port adaptive_avg_pool3d to ATen (#19898 ) Summary: Resolves #18065. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19898 Differential Revision: D15240607 Pulled By: ezyang fbshipit-source-id: 00cf23ed20c1757d5eef71fd8c6a2f53d372e341	2019-05-13 11:29:22 -07:00
Syed Tousif Ahmed	5268b7dfaf	Remove support for CUDA 8 (#20298 ) Summary: 1.1.0 stopped support for CUDA 8 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20298 Differential Revision: D15294639 Pulled By: ezyang fbshipit-source-id: b9411bfe456f93f1529b745dc83b7d6310df684d	2019-05-13 11:24:22 -07:00
sebftw	62957ab0a1	Tiny spelling mistake fix. (#20425 ) Summary: "then the output would also has k tensors" -> "then the output would also have k tensors" Pull Request resolved: https://github.com/pytorch/pytorch/pull/20425 Differential Revision: D15320152 Pulled By: zou3519 fbshipit-source-id: b04e2ccd29c6a3e33ad1040d0ea975a01a7bd9b5	2019-05-13 11:18:53 -07:00
Syed Tousif Ahmed	67414714e5	Move THCTensor_(uniform) to ATen (#20292 ) Summary: As a first step for this plan: https://github.com/pytorch/pytorch/issues/19508#issuecomment-485178192, this PR moves `THCTensor_(uniform)` to ATen. Major changes are: - `uniform_` cuda kernel now utilizes a philox generator. - the kernel also utilizes TensorIterator - the kernel uses a grid-stride loop to achieve peak effective bandwidth - Since the engine has changed from `curandStateMTGP32` to `curandStatePhilox4_32_10`, the randoms generated now will be different. - Here is the diff showing codegen changes: https://gist.github.com/syed-ahmed/4af9ae0d42b6c7dbaa13b9dd0d1dd1e8 (BC breaking change if any) - Philox4_32_10 is known to pass the standard TestU01 Big Crush test (https://www.thesalmons.org/john/random123/papers/random123sc11.pdf) and hence the quality of random numbers generated isn't an issue when compared to the previously used `curandStateMTGP32`. - I have added a test case in `aten/src/ATen/test/cuda_distributions_test.cu` which verifies that philox offset is incremented properly The benchmark was done on a DGX station with 4 V100s. I modified the script from jcjohnson 's [multinomial benchmark](https://github.com/jcjohnson/pytorch-multinomial-benchmark) to produce this notebook which shows that there is a general speedup with this PR and a regression hasn't been introduced: https://gist.github.com/syed-ahmed/9d26d4e96308aed274d0f2c7be5218ef To reproduce the notebook: - Run https://gist.github.com/syed-ahmed/4208c22c541f1d30ad6a9b1efc1d728f in a container with the current pytorch top of tree with the command: `python uniform_benchmark.py --stats_json before.json` - Apply this diff to the current pytorch top of tree and run the same script in a container with the command: `python uniform_benchmark.py --stats_json after.json` - Run the notebook attached above with the `after.json` and `before.json` in the same directory The effected bandwidth was calculated using the script (thanks to ngimel ): https://gist.github.com/syed-ahmed/f8b7384d642f4bce484228b508b4bc68 Following are the numbers before and after. ``` uniform, size, elements 65536 forward 5.168914794921875e-06 bandwidth (GB/s) 50.71548098597786 uniform, size, elements 131072 forward 5.056858062744141e-06 bandwidth (GB/s) 103.67860705101367 uniform, size, elements 262144 forward 7.164478302001953e-06 bandwidth (GB/s) 146.357621001797 uniform, size, elements 524288 forward 1.1217594146728515e-05 bandwidth (GB/s) 186.9520302275877 uniform, size, elements 1048576 forward 1.923084259033203e-05 bandwidth (GB/s) 218.10297600317384 uniform, size, elements 2097152 forward 3.640890121459961e-05 bandwidth (GB/s) 230.39992200138826 uniform, size, elements 4194304 forward 6.778717041015625e-05 bandwidth (GB/s) 247.49839679819922 uniform, size, elements 8388608 forward 0.00012810707092285157 bandwidth (GB/s) 261.92490202361347 uniform, size, elements 16777216 forward 0.00025241613388061524 bandwidth (GB/s) 265.86598474620627 uniform, size, elements 33554432 forward 0.000497891902923584 bandwidth (GB/s) 269.5720239913193 ``` ``` uniform, size, elements 65536 forward 5.550384521484375e-06 bandwidth (GB/s) 47.22988091821306 uniform, size, elements 131072 forward 5.581378936767578e-06 bandwidth (GB/s) 93.93520954942333 uniform, size, elements 262144 forward 6.165504455566406e-06 bandwidth (GB/s) 170.071404141686 uniform, size, elements 524288 forward 6.3276290893554685e-06 bandwidth (GB/s) 331.4277702414469 uniform, size, elements 1048576 forward 8.509159088134765e-06 bandwidth (GB/s) 492.91639239047356 uniform, size, elements 2097152 forward 1.2989044189453124e-05 bandwidth (GB/s) 645.8218077979443 uniform, size, elements 4194304 forward 2.347707748413086e-05 bandwidth (GB/s) 714.6211452997259 uniform, size, elements 8388608 forward 4.4286251068115234e-05 bandwidth (GB/s) 757.6715389250498 uniform, size, elements 16777216 forward 8.672237396240235e-05 bandwidth (GB/s) 773.8356427961071 uniform, size, elements 33554432 forward 0.00016920566558837892 bandwidth (GB/s) 793.2224227438523 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/20292 Differential Revision: D15277761 Pulled By: ezyang fbshipit-source-id: 8bfe31a01eeed77f0ed6e7ec4d2dda4c6472ecaa	2019-05-13 09:38:28 -07:00
Yauheni Koran	5f7ef09f57	math module support: gcd, copysign, erf, erfc, expm1, fabs, gamma, lgamma (#19707 ) Summary: eellison driazati Refer to issue #19026 Pull Request resolved: https://github.com/pytorch/pytorch/pull/19707 Differential Revision: D15302632 Pulled By: eellison fbshipit-source-id: 68ff13b478b93cc33703ef3276b5fa727c8ff31a	2019-05-13 08:55:23 -07:00
Guanheng Zhang	41673d477c	Disable incremental_state function in MultiheadAttention module. (#20177 ) Summary: To fully support incremental_state function, it requires several additional utils available in fairseq. However, we lack a problem for the unit test. Therefore, the incremental_state function will be disable for now. If it is needed in the future, a feature request could be created. Fixed #20132 Add some unit tests to cover the arguments of MultiheadAttention module, including bias, add_bias_kv, add_zero_attn, key_padding_mask, need_weights, attn_mask. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20177 Differential Revision: D15304575 Pulled By: cpuhrsch fbshipit-source-id: ebd8cc0f11a4da0c0998bf0c7e4e341585e5685a	2019-05-13 08:21:15 -07:00
Clément Pinard	f8aa6a8f44	Make a deep copy of extra_compile_flag dictionnary (#20221 ) Summary: See issue #20169 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20221 Differential Revision: D15317126 Pulled By: ezyang fbshipit-source-id: 0a12932db4f6ba15ea1d558fa329ce23fe2baef6	2019-05-13 08:11:39 -07:00
peter	30bdb8c0d7	Hotfix for caffe2 windows build (#20417 ) Summary: We don't need to overlay vc env when not using ninja. CMake will deal with it automatically. Overlaying is a no-op when the env is the same with the generator specified but will generate the error "Cannot find CMAKE_CXX_COMPILER" when they are different. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20417 Differential Revision: D15317081 Pulled By: ezyang fbshipit-source-id: 5d9100321ecd593e810c31158f22c67d3e34973b	2019-05-13 08:03:45 -07:00
Tongzhou Wang	f496ea36b2	DataLoader: add error detection for worker_init_fn (#20150 ) Summary: This is an attempt to isolate unrelated changes from #19228 for easier review. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20150 Differential Revision: D15314891 Pulled By: ezyang fbshipit-source-id: 8c429747ba83ad5aca4cdd8f8086bcf65a326921	2019-05-12 18:28:56 -07:00
Roy Li	163f0e182c	Fix bug in non_blocking copy (#20305 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20305 ghimport-source-id: eb3dacb10fd93bbb5a6bbe078ed1ec842163d0e6 Differential Revision: D15276094 Pulled By: li-roy fbshipit-source-id: 4728f419aa050e6c94a4f62231fa1a86caa556a7	2019-05-11 15:20:19 -07:00
Nishant Pandit	6a8f55796a	Add quant-dequant nodes for weights Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20041 Differential Revision: D15178086 fbshipit-source-id: 8cb060d72b68e44bf042338924f203ae62d74f6a	2019-05-11 14:03:10 -07:00
Nikolay Korovaiko	9499c7b7ee	Profiling GraphExecutor Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19994 Differential Revision: D15307752 Pulled By: Krovatkin fbshipit-source-id: 7b35191042199ef16823487e15fe639968cbdc89	2019-05-10 23:05:47 -07:00
Lara Haidar	f4d9bfaa4d	Support Exports to Multiple ONNX Opset (#19294 ) Summary: Support exporting multiple ONNX opsets (more specifically opset 10 for now), following the proposal in https://gist.github.com/spandantiwari/99700e60919c43bd167838038d20f353. And add support for custom ops (merge with https://github.com/pytorch/pytorch/pull/18297). This PR will be followed by another PR containing the changes related to testing the ops for different opsets. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19294 Reviewed By: zrphercule Differential Revision: D15043951 Pulled By: houseroad fbshipit-source-id: d336fc35b8827145639137bc348ae07e3c14bb1c	2019-05-10 18:37:12 -07:00
Xue Feng	1129b3344a	move DistillBatchLRLoss Layer from open source to fb Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20291 Reviewed By: chocjy Differential Revision: D15272181 fbshipit-source-id: 2e0964fa1b1031607134548bb87c4e103c5b1383	2019-05-10 17:46:04 -07:00

1 2 3 4 5 ...

17935 Commits