pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-11-01 13:34:57 +08:00

Author	SHA1	Message	Date
Can Balioglu	80b19c4c8c	Enable Python bindings for UntypedStorage (#68945 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68945 This PR enables the Python conversion functions for `Storage` (specifically `UntypedStorage`) and also cleans up some remnants of the deprecated typed storages from `DynamicTypes.cpp`. ghstack-source-id: 147245110 Test Plan: Run the existing unit and integration tests. Reviewed By: albanD Differential Revision: D32676505 fbshipit-source-id: 3a3f6db4fb0da5c78dd406c96ab70bdc37015521 (cherry picked from commit d6427b94cf88b078bd228d43cd2afbabf0773b39)	2022-01-20 02:11:34 +00:00
Peter Bell	b2e79ed5ec	Remove WindowsTorchApiMacro.h in favor of Export.h (#69585 ) Summary: Follow up to https://github.com/pytorch/pytorch/issues/68095 This also changes the files from the ATen folder to include c10's `Export.h` instead since they can't ever be exporting `TORCH_PYTHON_API`. cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse SciPioneer H-Huang Pull Request resolved: https://github.com/pytorch/pytorch/pull/69585 Reviewed By: mrshenli Differential Revision: D32958594 Pulled By: albanD fbshipit-source-id: 1ec7ef63764573fa2b486928955e3a1172150061	2021-12-09 17:30:09 -08:00
Scott Wolchok	82f7f8d471	[PyTorch] Adopt IValue::toTupleRef() where obvious (#65505 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65505 Generated with `fastmod -m 'toTuple(\s)->' 'toTupleRef()${1}.'` , followed by `fastmod '(std::move$.)toTupleRef\($.' '${1}toTuple()->'` to unbreak 2 callsites. ghstack-source-id: 142065835 Test Plan: CI Reviewed By: gchanan Differential Revision: D31131025 fbshipit-source-id: 54457ae5bbeb38db9c7f196d469b98521c3d3f34	2021-11-02 10:22:18 -07:00
Zhengxu Chen	f20614af21	[jit] Allow custom class functions to be traced in invokeScriptMethodFromPython(). (#67380 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67380 Test Plan: eyes Reviewed By: tugsbayasgalan Differential Revision: D31975656 fbshipit-source-id: 47c8c9854899e9fed5a635f88470711dc4c95970	2021-10-27 16:38:50 -07:00
Zhengxu Chen	b55a2500d2	[jit] Remove graph() call from abstract Function interface. (#65967 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65967 Graph is an implementation detail. If user wants to get access to the underlying graph, they should be able to explicitly dynamic cast instead. ghstack-source-id: 141659819 Test Plan: no behavior change. Reviewed By: gmagogsfm Differential Revision: D31326153 fbshipit-source-id: a0e984f57c6013494b92a7095bf5bb660035eb84	2021-10-27 11:54:26 -07:00
Scott Wolchok	e88d1c4f10	[PyTorch] Add tuple inline storage (#64066 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64066 I noticed a bunch of time being spent heap-allocating Tuples in the unpickler. 1-, 2-, and 3-element Tuples are apparently common enough that they get their own bytecode instructions, so I decided to try also giving them their own representation. We store up to 3 IValues inline in `Tuple` rather than doing a second heap allocation for a `std::vector<IValue>`. ghstack-source-id: 140695395 Test Plan: Added automated tests for TupleElements. Pixel 3 before: https://www.internalfb.com/intern/aibench/details/761596366576284 Pixel 3 after: https://www.internalfb.com/intern/aibench/details/591414145082422 We went from 347 ms to 302 ms. Reviewed By: dhruvbird Differential Revision: D30592622 fbshipit-source-id: 93625c54c9dca5f765ef6d5c191944179cb281a8	2021-10-15 12:16:51 -07:00
Scott Wolchok	2d885ab73d	[jit] Reduce refcounting of Types (#65345 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65345 FooType::get() can return a const reference. Inconveniently, converting shared_ptr<FooType> to shared_ptr<Type> requires a copy & refcount bump, so to properly take advantage of this in unshapedType() we need to take a const Type& in isSubtypeOf(), which is good practice anyway -- don't require a shared_ptr if you don't need to take ownership. ghstack-source-id: 140044165 Test Plan: CI perf says c10::unshapedType time decreased from 2.8% to 2.2% during static runtime startup, though I expect this to be generally beneficial. Reviewed By: hlu1 Differential Revision: D31027361 fbshipit-source-id: 676feb81db9f74ad7b8651d8774f4ecb4cfa6ab8	2021-10-08 09:03:04 -07:00
Zhengxu Chen	ac99d63f83	[jit] Make operation call accept Stack& instead Stack* (#63414 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63414 Misuse of raw pointer in here where stack is never nullable. ghstack-source-id: 136938318 Test Plan: compiles. Imported from OSS Reviewed By: ejguan Differential Revision: D30375410 fbshipit-source-id: 9d65b620bb76d90d886c800f54308520095d58ee	2021-08-30 11:49:20 -07:00
Jiewen Tan	04caef8e1d	Improve IMethod::getArgumentNames to deal with empty argument names list (#62947 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62947 This diff improved IMethod::getArgumentNames to deal with empty argument names list. Test Plan: buck test mode/dev //caffe2/caffe2/fb/predictor:pytorch_predictor_test -- PyTorchDeployPredictor.GetEmptyArgumentNamesValidationMode buck test mode/dev //caffe2/caffe2/fb/predictor:pytorch_predictor_test -- PyTorchDeployPredictor.GetEmptyArgumentNamesRealMode Reviewed By: wconstab Differential Revision: D30179974 fbshipit-source-id: c7aec35c360a73318867c5b77ebfec3affee47e3	2021-08-11 16:44:00 -07:00
Edward Yang	cdf702b60c	Reject kwonly arguments passed positionally in torch.ops (#62981 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62981 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: Chillee Differential Revision: D30211030 Pulled By: ezyang fbshipit-source-id: aae426592e92bf3a50076f470e153a4ae7d6f101	2021-08-10 07:16:00 -07:00
Natalia Gimelshein	e3944ab00e	Revert D30038175: Improve IMethod::getArgumentNames to deal with empty argument names list Test Plan: revert-hammer Differential Revision: D30038175 (`64b3ab6407`) Original commit changeset: 46f08dda9418 fbshipit-source-id: 604735d2300487a0b75890b330d7ba5b3e7145b2	2021-08-06 14:58:43 -07:00
Jiewen Tan	64b3ab6407	Improve IMethod::getArgumentNames to deal with empty argument names list (#62782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62782 This diff improved IMethod::getArgumentNames to deal with empty argument names list. Test Plan: buck test mode/dev caffe2/caffe2/fb/predictor:pytorch_predictor_test -- PyTorchDeployPredictor.GetEmptyArgumentNamesValidationMode buck test mode/dev caffe2/caffe2/fb/predictor:pytorch_predictor_test -- PyTorchDeployPredictor.GetEmptyArgumentNamesRealMode Reviewed By: wconstab Differential Revision: D30038175 fbshipit-source-id: 46f08dda94187160b4d6ee87600d1b46fe934222	2021-08-05 01:32:00 -07:00
Richard Barnes	9e77113e85	irange-ify 11 (#62121 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62121 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D29879701 fbshipit-source-id: 5c51879c88fa6a5790db241c8b33ec0dc4b177ca	2021-07-28 13:32:09 -07:00
Meghan Lele	4a2e8b53bb	[JIT] Add `torch._C.ScriptList`` (#52832 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52832 Summary This commit adds `torch._C.ScriptList`, a list type that has reference semantics across the Python/TorchScript boundary. That is, modifications made in TorchScript to instances of `torch._C.ScriptList` are visible in Python even when it is not returned from the function. `torch._C.ScriptList` is implemented using a modified version of pybind's `stl_bind.h`-style bindings attached to `ScriptList` and `ScriptListIterator`, wrapper classes around `c10::impl::GenericList` and `c10::impl::GenericList::iterator`. These bindings allow instances of `torch._C.ScriptList` to be used as if it were a regular `list` in Python. Reference semantics are achieved by simply retrieving the `IValue` contained in `ScriptList` in `toIValue` (invoked when converting Python arguments to `IValues` before calling TorchScript code). Test Plan This commit adds `TestScriptList` to `test_list_dict.py`, a set of tests that check that all of the common list operations are supported and that instances have reference semantics across the Python/TorchScript boundary. Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D29478121 Pulled By: SplitInfinity fbshipit-source-id: 652cc25cfa37debe28db9527504846f22abd8b54	2021-07-01 20:28:13 -07:00
Ansley Ussery	0fbc471d10	Support default values on NamedTuple fields (#54682 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54682 Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D27327241 Pulled By: ansley fbshipit-source-id: 76546f1770d50ebc3435bba3b74540e3c6be8a1c	2021-06-26 15:18:21 -07:00
Richard Barnes	b162d95e46	Fix a number of lint perf and safety issues in torch (#59897 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59897 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D29037012 fbshipit-source-id: 7c16286d5fc2b67964fb65f8374dfff4d1a7aefb	2021-06-15 13:14:51 -07:00
Meghan Lele	b14c3205fd	[JIT] Add torch._C.ScriptDict (#52659 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52659 Summary This commit adds `torch._C.ScriptDict`, a dictionary type that has reference semantics across the Python/TorchScript boundary. That is, modifications made to instances of `torch._C.ScriptDict` in TorchScript are visible in Python even when it is not returned from the function. Instances can be constructed by passing an instance of a Python dictionary to `torch.jit.script`. In the case of an empty dictionary, its type is assumed to be `Dict[str, Tensor]` to be consistent with the handling of empty dictionaries in TorchScript source code. `torch._C.ScriptDict` is implemented using a modified version of pybind's `stl_bind.h`-style bindings attached to `ScriptDict`, `ScriptDictIterator` and `ScriptDictKeyIterator`, wrapper classes around `c10::impl::GenericDict` and `c10::impl::GenericDict::iterator`. These bindings allow instances of `torch._C.ScriptDict` to be used as if it were a regular `dict` Python. Reference semantics are achieved by simply retrieving the `IValue` contained in `ScriptDict` in `toIValue` (invoked when converting Python arguments to `IValues` before calling TorchScript code). Test Plan This commit adds `TestScriptDict` to `test_list_dict.py`, a set of tests that check that all of the common dictionary operations are supported and that instances have reference semantics across the Python/TorchScript boundary. Differential Revision: D27211605 D27211605 Test Plan: Imported from OSS Reviewed By: gmagogsfm Pulled By: SplitInfinity fbshipit-source-id: 446d4e5328375791aa73eb9e8b04dfe3465af960	2021-05-27 10:25:30 -07:00
Luca Wehrstedt	36e47af58b	Pass reference to parent future in callbacks (#57635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57635 Note: this PR looks massive, but it's just one simple change, codemodded many times. In many cases, a callback needs to access the value/error produced by the parent future. In Python this was easy because the callback was invoked with the parent future as argument, and could thus inspect it. In C++ the callbacks didn't take any arguments, thus in many cases we worked around this by capturing the future in its own callback. This is risky (leads to reference cycle and thus memory leak) and must be done carefully (spoiler: sometimes we weren't). ghstack-source-id: 128296580 Test Plan: CI Reviewed By: wanchaol Differential Revision: D28178783 fbshipit-source-id: 6de02c4568be42123372edc008f630d5ddae0081	2021-05-07 03:59:18 -07:00
Yi Huang (Symphony)	ba78bf1363	[standaloneRunner] fix another GIL mutithreading issue exposed by torch::jit::toIValue() (#57688 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57688 P412982836 says that `torch::jit::toIValue()` will also touch GIL through `torch::jit::createGenericDict()` (P412848640) So we have to move `torch::jit::toIValue()` out of multithreading execution Reviewed By: hyuen Differential Revision: D28236527 fbshipit-source-id: 43a33dbcfc828cc42c5e1230c8f5cb415bf7bde4	2021-05-05 21:41:04 -07:00
Scott Wolchok	b87d3fa432	[PyTorch][jit] Don't allow create() on singleton types (#56807 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56807 If I understand correctly, there's no reason to create your own instance of these global singleton types. ghstack-source-id: 127312270 Test Plan: CI Reviewed By: SplitInfinity Differential Revision: D27973447 fbshipit-source-id: f12df69d185f1baaa45f2ac6eac70570a7a65912	2021-04-30 10:28:50 -07:00
James Reed	71a5314591	Fix ScriptMethod dispatch on __torch_function__ (#56103 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56103 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D27784142 Pulled By: jamesr66a fbshipit-source-id: 555dcb7c3a98b8fb9e9ca9b499cafad54e819aa7	2021-04-15 08:46:43 -07:00
Nikita Shulga	6a39613f35	[BE] Make torch/csrc/jit/tensorexpr/ clang-tidy clean (#55628 ) Summary: Mostly auto-generated changes using ``` python3 tools/clang_tidy.py -c build -x torch/csrc/jit/tensorexpr/eval.cpp -s ``` With following common patterns manually fixed - Use ` = default` instead of `{}` - deleted methods should be public - Use pass-by-value + std::move instead of pass-by-reference+copy Pull Request resolved: https://github.com/pytorch/pytorch/pull/55628 Reviewed By: walterddr Differential Revision: D27655378 Pulled By: malfet fbshipit-source-id: 92be87a08113435d820711103ea9b0364182c71a	2021-04-08 19:44:14 -07:00
Meghan Lele	6866c033d5	[JIT] Add recursive scripting for class type module attributes (#55124 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55124 Summary This commit modifies type inference (used by the module scripting code) so that it tries to script the type of any class instances that it encounters. This enables recursive, automatic scripting of class type module attributes. Test Plan This commit adds a test case for this to `TestClassType`. Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D23971883 Pulled By: SplitInfinity fbshipit-source-id: 7a5a2e7c12ee68cbdeb0a07e6aaf98734a79cb06	2021-04-02 12:16:21 -07:00
Rohan Varma	a37fbf9b45	[Futures] Bump log verbosity when ignoring cb errors in python future. (#54476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54476 Per title. For `add_done_callback`, we log but swallow exceptions in order to keep consistent with what concurrent.futures python library does, see discussion in https://github.com/pytorch/pytorch/pull/45675. Although, it would be good to improve the verbosity here as this can be a source of confusion if users are setting a different future via `add_done_callback`, and an error is hit resulting in an unexpected hang (see https://github.com/pytorch/pytorch/issues/52132 for more details on how this can happen). ghstack-source-id: 125300389 Test Plan: CI Reviewed By: lw Differential Revision: D27253004 fbshipit-source-id: 72ed21c8fb6d27de5797c17fc46b762f893e6fea	2021-03-31 15:17:06 -07:00
Michael Suo	8a170fbacd	[package] fix mangling issues with TorchScript (#54915 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54915 TorchScript and torch.package have different mangling schemes. To avoid them interfering with each other, we should undo the torch.package mangling before processing anything with TorchScript (since TS independently makes sure that no names collide). Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D27410472 Pulled By: suo fbshipit-source-id: d1cc013c532d9abb7fb9615122bc465ded4785bb	2021-03-31 00:58:05 -07:00
Christian Puhrsch	2668149b8c	Export torch::jit::toIValue (#54449 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54448 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54449 Reviewed By: SplitInfinity Differential Revision: D27243154 Pulled By: cpuhrsch fbshipit-source-id: fc21d6ce251b868356ad8ea13ae891fb56e311ce	2021-03-22 17:17:18 -07:00
James Reed	1fe6a6507e	[WIP][FX] Fix tracing support for torchbind (#52884 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52884 Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D26675801 Pulled By: jamesr66a fbshipit-source-id: 8e5100bcea17589a53163abf6ab991658e11fa3a	2021-03-05 23:40:16 -08:00
Rohan Varma	c941730b96	[JIT/Futures] support set_exception api (#50983 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50983 There is currently no way to handle/propagate errors with the python-based futures API (they are raised correctly if set with an error, but this is only possible from C++). This diff allows the Future's `unwrap_func` to be set in python optionally, so users can set futures completed with an exception and the error will throw as expected. This is mostly to support the following use case in the next diff: ``` ret_fut = torch.futures.Future(unwrap_func = lambda python_result: { # throw exception if needed if isinstance(python_result, Exception): throw python_result }) rpc_fut = rpc.rpc_async(...) # RPC future that times out # Goal is to propagate RPC error to this future rpc_fut.add_done_callback( res => { # Note that ret_fut.set_result(res.wait()) won't propagate the error try: ret_fut.set_result(res.wait()) except Exception as e: ret_fut.set_result(e) } ) ``` ghstack-source-id: 121021434 Test Plan: unittest ``` buck test mode/dev-nosan mode/no-gpu //caffe2/test:futures -- te st_unwrap --print-passing-details ``` Reviewed By: mrshenli Differential Revision: D25950304 fbshipit-source-id: 7ee61e98fcd783b3f515706fa141d538e6d2174d	2021-02-04 20:22:19 -08:00
Thomas Viehmann	86861095fa	Graceful invalidation of Python Node/Value/Block when C++ object is deleted (#50326 ) Summary: Previously we might have gotten segfaults and all, now it raises an exception. Thread safety hasn't been an objective. I have a followup to expand the Python interface for the API. Fixes https://github.com/pytorch/pytorch/issues/49969. wanchaol Pull Request resolved: https://github.com/pytorch/pytorch/pull/50326 Reviewed By: pbelevich Differential Revision: D26096234 Pulled By: gmagogsfm fbshipit-source-id: 5425772002eb4deb3830ed51eaa3964f22505840	2021-02-04 01:34:46 -08:00
anjali411	18a7ec7d7d	Update the JIT complex type name to be consistent with Python (#51476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51476 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D26179237 Pulled By: anjali411 fbshipit-source-id: 6a5c60c8545eb42416583836b8038ceffd3f3244	2021-02-03 09:59:08 -08:00
anjali411	f9f22c8b5c	Add serialization logic for complex numbers (#51287 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51287 This reverts commit dfdb1547b9c1934904bfd137b4007d6a46a6f597. Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D26131165 Pulled By: anjali411 fbshipit-source-id: 047167fac594ddb670c5e169446e90e74991679a	2021-01-28 17:25:35 -08:00
Meghan Lele	88baf470d1	[JIT] Provide more info when attribute fails to convert (#50870 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50870 Summary Module attributes whose types cannot be determined based on annotations or inference based on their values at script time are added to the concrete type of the corresponding module as "failed attributes". Any attempt to access them in scripted code produces an error with a message explaining that the attribute could not be contributed to a corresponding attribute on the TorchScript module. However, this error is not more specific than that. This commit modifies `infer_type` in `_recursive.py` so that it returns `c10::InferredType` instead, which allows more information about typing failures to be communicated to the caller through the `reason()` method on this class. This information is appended to the hint added to the module concrete type for failed attributes. Testing This commit adds a unit test to `test_module_containers.py` that checks that extra information is provided about the reason for the failure when a module attribute consisting of a list of `torch.nn.Module` fails to convert. Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D26091472 Pulled By: SplitInfinity fbshipit-source-id: fcad6588b937520f250587f3d9e005662eb9af0d	2021-01-27 20:37:10 -08:00
Mike Ruberry	dfdb1547b9	Revert D26094906: Add serialization logic for complex numbers Test Plan: revert-hammer Differential Revision: D26094906 (`2de4ecd4eb`) Original commit changeset: 7b2614f3ee4a fbshipit-source-id: 6f32a9fc6bb2a904ca1a282bbc6b2df0aee50068	2021-01-27 19:44:26 -08:00
anjali411	2de4ecd4eb	Add serialization logic for complex numbers (#50885 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50885 Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D26094906 Pulled By: anjali411 fbshipit-source-id: 7b2614f3ee4a30c4b4cf04aaa3432988b38a0721	2021-01-27 15:19:36 -08:00
Anjali Chourdia	b6eaca9f1f	Add type annotation logic for complex numbers (#50884 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50884 Test Plan: Imported from OSS Reviewed By: heitorschueroff Differential Revision: D26086963 fbshipit-source-id: f103f7f529d63d701c4f17862e30eafbab7d0c68	2021-01-26 19:39:35 -08:00
Shen Li	c480eebf95	Completely remove FutureMessage type (#50029 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50029 Test Plan: buck run mode/opt -c=python.package_style=inplace //caffe2/torch/fb/training_toolkit/examples:ctr_mbl_feed_april_2020 -- local-preset --flow-entitlement pytorch_ftw_gpu --secure-group oncall_pytorch_distributed Before: ``` ... I0107 11:03:10.434000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|total_examples 14000.0 I0107 11:03:10.434000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|window_qps 74.60101318359375 I0107 11:03:10.434000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|lifetime_qps 74.60101318359375 ... I0107 11:05:12.132000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|total_examples 20000.0 I0107 11:05:12.132000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|window_qps 64.0 I0107 11:05:12.132000 3831111 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|lifetime_qps 64.64917755126953 ... ``` After: ``` ... I0107 11:53:03.858000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|total_examples 14000.0 I0107 11:53:03.858000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|window_qps 72.56404876708984 I0107 11:53:03.858000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|lifetime_qps 72.56404876708984 ... I0107 11:54:24.612000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|total_examples 20000.0 I0107 11:54:24.612000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|window_qps 73.07617950439453 I0107 11:54:24.612000 53693 print_publisher.py:23 master ] Publishing batch metrics: qps-qps\|lifetime_qps 73.07617950439453 ... ``` Reviewed By: lw Differential Revision: D25774915 Pulled By: mrshenli fbshipit-source-id: 1128c3c2df9d76e36beaf171557da86e82043eb9	2021-01-07 19:50:57 -08:00
Luca Wehrstedt	1ac05cfe01	Remove DataPtr extractor from CUDAFuture (#48840 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48840 The CUDAFuture class needs to inspect the values it contains in order to extract its tensors (in fact, the DataPtrs backing those). These are needed first to determine what CUDA devices back those tensors, so that an event for each such device can be recorded; and later to record these DataPtrs with the CUDA caching allocator if they are used in other streams. This became complicated when Python was added to the mix, because to inspect a Python object we need to acquire the GIL, but we couldn't do so from code that was supposed to also work in C++-only mode. The solution was for users to provide a custom way to extract DataPtrs, so that the PythonFutureWrapper could install such a custom Python-aware one. This was the DataPtr extractor. In https://github.com/pytorch/pytorch/pull/48502 a different suggestion was proposed. At its root, it consists in adding support for IValues of type PyObject to the visit() and getSubValues() methods. In order to deal with the GIL, we do this through a virtual method: PyObjectHolder, which is the base class, is available also in C++-only mode, and thus defines this method but leaves it unimplemented; ConcretePyObjectHolder, which is the subclass, is only included in Python mode, and thus it can implement that method, acquire the GIL, and do what it's supposed to. In my opinion, this approach is just brilliant! Thank wanchaol for proposing it! It hides the complexity of dealing with Python inside getSubValues(), where it can be done properly, thus simplifying enormously the CUDAFuture and the PythonFutureWrapper classes. ghstack-source-id: 118704935 Test Plan: Unit tests Reviewed By: wanchaol Differential Revision: D25334355 fbshipit-source-id: 3f1d3bf6e6e8505a114c877fb9a6fcc3f68d91d3	2020-12-19 11:03:45 -08:00
Rohan Varma	a727bf2851	Refactor RPC matchBuiltInOp to get rid of exception swallowing (#49009 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49009 As per the title, we should generally not have exception swalling and this commit makes it so that if there is a true error in JIT operator resolution, it is propagated back to the RPC callee and we don't silently swallow any other exceptions that may happen. Swallowing the exceptions previously resulted in hard to debug issues such as unexpected ops showing up in profiler, and flaky tests which were fixed by https://github.com/pytorch/pytorch/pull/41287 Added a unittest that validates the error that comes from `jit/pybind_utils.h`. ghstack-source-id: 118794661 Test Plan: CI Reviewed By: mrshenli Differential Revision: D25392905 fbshipit-source-id: 6f93251635740bcf902824548b2bc6f9249be5f0	2020-12-17 11:37:21 -08:00
Sebastian Messmer	4431731c68	Making ops c10-full: Storage arguments (#49146 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49146 Add support for Storage arguments to IValue and the JIT typing system, and make ops that were blocked on that c10-full. ghstack-source-id: 118710665 (Note: this ignores all push blocking failures!) Test Plan: waitforsandcastle Reviewed By: ezyang Differential Revision: D25456799 fbshipit-source-id: da14f125af352de5fcf05a83a69ad5a69d5a3b45	2020-12-16 14:00:34 -08:00
James Reed	76d41c801e	[JIT] Fix toIValue handling of AttributeError when casting ClassType (#49188 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49188 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D25476573 Pulled By: jamesr66a fbshipit-source-id: cec296fae71cc0cdf36bde60417d7d3b1aa84198	2020-12-11 17:54:16 -08:00
Luca Wehrstedt	a6778989d1	Support wider range of types in FutureNCCL (#48502 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48502 This commit is part of a stack that reworks FutureNCCL in order to extract a generic CUDA-aware Future subclass. The stack deliberately breaks up this transition into elementary changes, to make it easier to verify that the behavior is preserved (or to highlight how it gets changed). --- FutureNCCL restricted the values to be tensors, or (singleton) lists of tensors, or Python object that could be converted to either of those types. We need a CUDA future that can handle more generic types though. The main challenge is extracting all DataPtrs from an arbitrary object. I think I found some ways of doing so, but I'd like some JIT experts to look into this and tell me if there are better ways. I'll add inline comments for where their input would be appreciated. ghstack-source-id: 118180026 Test Plan: Unit tests (I should probably add new ones) Reviewed By: wanchaol Differential Revision: D25177562 fbshipit-source-id: 1ef18e67bf44543c70abb4ca152f1610dea4e533	2020-12-10 03:54:15 -08:00
Luca Wehrstedt	b7f5aa9890	Remove NCCL dependency from PythonFutureWrapper (#48495 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48495 This commit is part of a stack that reworks FutureNCCL in order to extract a generic CUDA-aware Future subclass. The stack deliberately breaks up this transition into elementary changes, to make it easier to verify that the behavior is preserved (or to highlight how it gets changed). --- PythonFutureWrapper needs to provide a GIL-aware way to extract tensors from an IValue of type PyObject. Since this was only used by FutureNCCL it was guarded by #ifdef USE_C10D_NCCL. However, we will need to use it with CUDA-aware futures other than the NCCL one. This might have been achieved simply by replacing USE_C10D_NCCL with USE_CUDA, but I wanted to clean this up better. We're dealing with two independent dimensions: C++-vs-Python and CPU-vs-CUDA. To make the code more modular, the two dimensions should be dealt with by orthogonal solutions: the user setting a custom callback to handle Python, and the subclass being CUDA-aware. Mixing these two axes makes it more complicated. Another reason for changing how this works is that later on, when we'll introduce multi-device support, we'll need to extract dataptrs for other reasons too (rather than just recording streams with the caching allocator), namely to inspect the value to determine which devices it resides on. ghstack-source-id: 118180038 Test Plan: Unit tests Reviewed By: mrshenli Differential Revision: D25177560 fbshipit-source-id: 3a424610c1ea191e8371ffee0a26d62639895884	2020-12-10 03:53:44 -08:00
Meghan Lele	18eccfbe42	[JIT] Fix clang-tidy warnings in jit/python (#47985 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47985 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D25258644 Pulled By: SplitInfinity fbshipit-source-id: dfc15dc62c148f79f4e99fd058a6bf2d071ccbb5	2020-12-02 12:35:36 -08:00
Elias Ellison	4380934b9b	[JIT] Dont use specialized tensor type (#46130 ) Summary: Fix for https://github.com/pytorch/pytorch/issues/46122 For `Any`, we infer the type of the ivalue to set the ivalue's type tag. When we saw a Tensor, we would use a specialized Tensor type, so when `Dict[str, Tensor]` was passed in as any `Any` arg it would be inferred as `Dict[str, Float(2, 2, 2, 2)]` which breaks runtime `isinstance` checking. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46130 Reviewed By: glaringlee Differential Revision: D24261447 Pulled By: eellison fbshipit-source-id: 8a2bb26ce5b6c56c8dcd8db79e420f4b5ed83ed5	2020-11-13 18:34:40 -08:00
Wanchao Liang	fa560ceb9c	[reland] make intrusive_ptr as a pybind holder type (#47586 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47586 relanding PR of https://github.com/pytorch/pytorch/pull/44492, and add additional Capsule related wrapping to ensure we still have the correct type in pybind11 to resolve Capsule as torch._C.CapsuleType Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D24822519 Pulled By: wanchaol fbshipit-source-id: eaaea446fb54b56ed3b0d04c31481c64096e9459	2020-11-10 10:09:08 -08:00
Yi Wang	98aad933b6	[pytorch][PR] Record FutureNCCL callback stream on CUDA caching allocator (#45318 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45318 When calling `then()` from WorkNCCL, record the input data pointers in futureNCCLCallbackStream_ before the execution of the input callback. Note that the recording cannot be directly added to the lambda used by addCallback in ProcessGroupNCCL.hpp. This is because the type of future value in that context is pyobject rather than TensorList, but a type casting will require pybind and introduce Python dependency, which should not be allowed in c10d library. I have considered creating a util function in a separate file to support this type casting, and then placing it under torch/csrc directory where python dependency is allowed. However, torch/csrc has a dependency on c10d, so this will create a circular dependency. Finally, a `record_stream_cb_` member is added to FutureNCCL, and the default value is nullptr. A default `record_stream_cb_` implementation is added to `PythonFutureWrapper,` where Python dependency is allowed. In addition, a few lines are reformatted by lint. caffe2/torch/csrc/distributed/c10d/init.cpp is only reformatted. #Closes: https://github.com/pytorch/pytorch/issues/44203 Test Plan: buck test mode/dev-nosan caffe2/test/distributed:c10d -- ProcessGroupNCCLTest buck test mode/dev-nosan caffe2/test/distributed:c10d -- test_accumulate_gradients_no_sync_allreduce_with_then_hook buck test mode/dev-nosan caffe2/test/distributed:c10d -- test_ddp_comm_hook_allreduce_with_then_hook_nccl Reviewed By: pritamdamania87 Differential Revision: D23910257 fbshipit-source-id: 66920746c41f3a27a3689f22e2a2d9709d0faa15	2020-10-22 01:49:47 -07:00
chengjun	5741de883a	Define the record_stream method in native_functions.yaml (#44301 ) Summary: The record_stream method was hard coded for CUDA device. Define the record_stream in the native_functions.yaml to enable the dynamic dispatch to different end device. Fixes https://github.com/pytorch/pytorch/issues/36556 Pull Request resolved: https://github.com/pytorch/pytorch/pull/44301 Reviewed By: glaringlee Differential Revision: D23763954 Pulled By: ezyang fbshipit-source-id: e6d24f5e7892b56101fa858a6cad2abc5cdc4293	2020-10-13 09:15:22 -07:00
Brian Hirsh	a3caa719af	fix #45552 - adding add_done_callback(fn) to torch.futures.Future (#45675 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45675 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D24055353 Pulled By: bdhirsh fbshipit-source-id: 9233c8e17acc878f0fecbe740a4397fb55cf722f	2020-10-13 07:47:36 -07:00
gunandrose4u	f07ac6a004	Fix Windows build failure after DDP PR merged (#45335 ) Summary: Fixes #{issue number} This is resubmit for PR https://github.com/pytorch/pytorch/issues/42897 . Together with fix for Windows build issue introduced by PR https://github.com/pytorch/pytorch/issues/44344 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/45335 Reviewed By: zou3519 Differential Revision: D23931471 Pulled By: mrshenli fbshipit-source-id: f49b5a114944c1450b32934b3292170be064f494	2020-09-25 12:37:50 -07:00
Mike Ruberry	103fa3894a	Revert D23841786: [pytorch][PR] Enable distributed package on windows, Gloo backend supported only Test Plan: revert-hammer Differential Revision: D23841786 (`0122299f9b`) Original commit changeset: 334ba1ed73ef fbshipit-source-id: ec95432f9957df56a5a04e52661f5db920b7f57f	2020-09-24 22:44:33 -07:00

1 2

83 Commits