pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 13:44:15 +08:00

Author	SHA1	Message	Date
Aaron Orenstein	e95e8eed0a	mypy 1.16.0 (#155821 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155821 Approved by: https://github.com/ezyang, https://github.com/zou3519	2025-06-14 18:18:43 +00:00
cyy	d87aad6877	[5/N] Apply Ruff fixes and pyupgrade to Python 3.9 (#144205 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/144205 Approved by: https://github.com/albanD	2025-01-15 04:00:47 +00:00
Aaron Gokaslan	91dbd7b75c	[BE]: Improve typing inference with TypeIs (#144682 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144682 Approved by: https://github.com/albanD Co-authored-by: Aaron Orenstein <aorenste@meta.com>	2025-01-13 21:14:31 +00:00
Xuehai Pan	b5c006acac	[BE][Easy] enable UFMT for `torch/nn/` (#128865 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128865 Approved by: https://github.com/ezyang	2024-07-25 02:48:42 +00:00
Xuehai Pan	dff6342a0b	[BE][Easy] enable UFMT for `torch/nn/parallel` (#128596 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128596 Approved by: https://github.com/mikaylagawarecki	2024-06-17 16:29:22 +00:00
Aaron Gokaslan	2d47385f0f	[BE]: Enable ruff TCH rules and autofixes for better imports (#127688 ) Automated fixes to put imports that are only used in type hints into TYPE_CHECKING imports. This also enables the RUFF TCH rules which will automatically apply autofixes to move imports in and out of TYPE_CHECKING blocks as needed in the future, this will make the initial PyTorch import faster and will reduce cyclic dependencies. Co-authored-by: Xuehai Pan <XuehaiPan@pku.edu.cn> Pull Request resolved: https://github.com/pytorch/pytorch/pull/127688 Approved by: https://github.com/XuehaiPan, https://github.com/ezyang, https://github.com/malfet	2024-06-06 16:55:58 +00:00
Matthew Hoffman	2491aa53a8	Make DataParallel generic (#102455 ) Fixes #102441 improves type hinting of the module attribute, since it can easily be bound in `DataParallel.__init__` ```python from torch.nn import DataParallel class MyModule(Module): ... my_data_parallel = DataParallel(MyModule(), device_ids=[0, 1, 2]) reveal_type(my_data_parallel) # Type of "my_data_parallel" is "DataParallel[MyModule]" reveal_type(my_data_parallel.module) # Type of "my_data_parallel.module" is "MyModule" ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/102455 Approved by: https://github.com/Skylion007	2023-06-03 00:33:01 +00:00
Matthew Hoffman	0ed22fce97	Merge type stubs torch nn parallel (#102194 ) Fixes merge issue for #101528 In the above PR, `torch.nn.parallel.parallel_apply.get_a_var` was marked private to appease the [public interface linter](https://github.com/pytorch/pytorch/actions/runs/4999216467/jobs/8955582204#step:14:21666): `ceeb242bc7` This broke CI pipelines running external dependencies that expected `get_a_var`'s name to not change. In this PR, we change the name back to `get_a_var` and include it in the `__all__` instead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102194 Approved by: https://github.com/ezyang	2023-05-26 20:10:47 +00:00
PyTorch MergeBot	023bc30b17	Revert "Merge type stubs for torch.nn.parallel (#101528 )" This reverts commit 6cabc105bb7c9cce4e23bdcc4a921613caae0f9a. Reverted https://github.com/pytorch/pytorch/pull/101528 on behalf of https://github.com/kit1980 due to Broke inductor tests https://github.com/pytorch/pytorch/actions/runs/5071348299/jobs/9107880424 ImportError: cannot import name 'get_a_var' from 'torch.nn.parallel.parallel_apply' ([comment](https://github.com/pytorch/pytorch/pull/101528#issuecomment-1561732862))	2023-05-24 18:23:52 +00:00
Matthew Hoffman	6cabc105bb	Merge type stubs for torch.nn.parallel (#101528 ) Fixes #91648 As explained in the tracking issue, the incomplete type stubs in `torch/nn/parallel` mask `DataParallel` methods relevant for subclassing and also mask type issues present in the code as well. One notable change here is the addition of [`allow_redefinition = True`](https://mypy.readthedocs.io/en/stable/config_file.html#confval-allow_redefinition) in `mypy.ini`, which allows for a common pattern: > Allows variables to be redefined with an arbitrary type, as long as the redefinition is in the same block and nesting level as the original definition. This is added specifically to allow for the type narrowing of `device_ids` in `torch.nn.parallel.data_parallel.data_parallel` from `Sequence[Union[int, torch.device]]` to `Sequence[int]`. Other than this, there are various renamings and `type: ignore` comments added to bypass errors that arose from the merging. @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/101528 Approved by: https://github.com/ezyang	2023-05-24 16:52:13 +00:00
Alex Henrie	5f2ec6293d	Unused variables in neural net classes and functions (#50100 ) Summary: These unused variables were identified by [pyflakes](https://pypi.org/project/pyflakes/). They can be safely removed to simplify the code and possibly improve performance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50100 Reviewed By: ezyang Differential Revision: D25797764 Pulled By: smessmer fbshipit-source-id: ced341aee692f429d2dcc3a4ef5c46c8ee99cabb	2021-01-06 08:16:57 -08:00
Ansley Ussery	c619892482	Fix errata (#49903 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49903 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D25718411 Pulled By: ansley fbshipit-source-id: 0cc365c5a53077752dc1c5a5c4a65b873baa3604	2020-12-28 20:40:41 -08:00
lixinyu	870a5a0d6d	Enable DataParallel to run zero input Module (#46565 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46565 Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D24405275 Pulled By: glaringlee fbshipit-source-id: a8baaf4cf227f7f21fc3b080a446f92f0effe18e	2020-10-22 18:04:33 -07:00
Alexander Grund	5b0f400488	Replace list(map(...)) constructs by list comprehensions (#46461 ) Summary: As discussed in https://github.com/pytorch/pytorch/issues/46392 this makes the code more readable and possibly more performant. It also fixes a bug detected by this where the argument order of `map` was confused: `030a24906e (diff-5bb26bd3a23ee3bb540aeadcc0385df2a4e48de39f87ed9ea76b21990738fe98L1537-R1537)` Fixes https://github.com/pytorch/pytorch/issues/46392 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46461 Reviewed By: ailzhang Differential Revision: D24367015 Pulled By: ezyang fbshipit-source-id: d55a67933cc22346b00544c9671f09982ad920e7	2020-10-19 18:42:49 -07:00
Michael Suo	c93e96fbd9	[jit] move script-related implementation out of torch/jit/__init__.py (#40902 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40902 See the bottom of this stack for context. Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D22360210 Pulled By: suo fbshipit-source-id: 4275127173a36982ce9ad357aa344435b98e1faf	2020-07-08 11:38:34 -07:00
chengjun	8d570bc708	Decouple DataParallel/DistributedDataParallel from CUDA (#38454 ) Summary: Decouple DataParallel/DistributedDataParallel from CUDA to support more device types. - Move torch/cuda/comm.py to torch/nn/parallel/comm.py with minor changes for common devices support. Torch.cuda.comm is kept as is for backward compatibility - Provide common APIs to arbitrary device types without changing existing CUDA APIs in torch.cuda space. - Replace the torch.cuda calls in DataParellel/DistributedDataParallel with the new APIs. Related RFC: [https://github.com/pytorch/pytorch/issues/36160](https://github.com/pytorch/pytorch/issues/36160) Pull Request resolved: https://github.com/pytorch/pytorch/pull/38454 Differential Revision: D22051557 Pulled By: mrshenli fbshipit-source-id: 7842dad0e5d3ca0f6fb760bda49182dcf6653af8	2020-07-07 12:48:16 -07:00
Shen Li	8d6a8d2b3f	Fix DDP bug in single process multiple device use cases (#36503 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36503 Test Plan: Imported from OSS Differential Revision: D21179274 Pulled By: mrshenli fbshipit-source-id: 0afce30ae0ddda753d1e240584a0f80df9aec4c2	2020-04-22 15:06:28 -07:00
Zachary DeVito	967cdc2baf	Simplify replicate logic (#36174 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36174 Test Plan: Imported from OSS Differential Revision: D20903301 Pulled By: zdevito fbshipit-source-id: 714a32fe417b7d1615886936c41505d1ba538f47	2020-04-13 11:21:43 -07:00
Natalia Gimelshein	cd9d9a2235	fix handling of replica parameters in DataParallel (#33907 ) Summary: In DataParallel, replica parameters are not leaves (because they are computed via broadcast from master parameters), and should be treated as such. Fixes https://github.com/pytorch/pytorch/issues/33552 Pull Request resolved: https://github.com/pytorch/pytorch/pull/33907 Differential Revision: D20150199 Pulled By: ngimel fbshipit-source-id: 5965d4115b6b3a8433063126ff6269567872fbeb	2020-03-10 10:35:44 -07:00
Jithun Nair	824e649d40	Specify requires_grad for Parameter replica so it's not always set to True by default (#32356 ) Summary: This is the proposed fix for issue https://github.com/pytorch/pytorch/issues/32018 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32356 Differential Revision: D19450648 Pulled By: mrshenli fbshipit-source-id: c63eeb6e9f5a87ebe613dd7013907559f295a7ea	2020-01-17 17:41:10 -08:00
Zachary DeVito	627f2823e0	remove _register_* bindings from python (#29499 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29499 This changes how DataParallel and trace module creation works so that we no longer need to mutate Module class after it has been created. The only remaining usage of register_* functions are now inside C++ tests. Test Plan: Imported from OSS Differential Revision: D18413652 Pulled By: zdevito fbshipit-source-id: f039e5400cd016632768be4547892f6a69645c20	2019-11-11 13:52:46 -08:00
Zachary DeVito	796363147f	Implement more of of the nn.Module API (#28828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28828 This updates torch::script::Module to more closely match the behavior of nn.Module. In particular, it implements the (optionally recurisive) iterators that retrieve submodules, parameters, and buffers and makes their names match the python versions. This also removes the individual accessors for Parameter, Module, Buffer, etc. and replaces them with a single `attr` function which is equivalent to writing `a.foo` in Python (`setattr` emulates `a.foo = v`). As we build out the user-facing API for TorchScript values this will end up matching how an attribute is accessed on general objects. This PR preservers the python bindings for script::Module by emulating the old API at the binding level. A followup will clean up the usage to more directly match the C++ API. Test Plan: Imported from OSS Differential Revision: D18197611 Pulled By: zdevito fbshipit-source-id: 7ee4dcbb258605d1c988314b05d938423f1ccee5	2019-11-06 22:58:25 -08:00
Michael Suo	341262754f	module dedupe (#26666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26666 Changes: - Introduce a `ConcreteModuleType` concept. This acts both as the key into the type cache, and as the source of truth for `ModuleValue::attr` queries. It needs to do both jobs because that's how we ensure correctness (if the types are different, it's because `ModuleValue::attr` would return different things). - Now `recursive_script` will first construct a `ConcreteModuleType` and search for a pre-existing type before starting compilation. - All previous paths to creating a `ScriptModule` (including inheriting from `ScriptModule`) are now rewritten to go through `create_script_module`, so that we have only a single place where construction happens. Behavioral changes: - Big change to `torch.jit.ScriptModule` inheritance: all attributes are now recursively scripted if possible, matching recursive scripting semantics. This makes it hard to keep something from being scripted (for example, a Python submodule). Possibly we'll need an `ignore()` type thing for attributes. In particular, this adds `self.training` to every ScriptModule, since it's present on every `nn.Module`. - I believe this change to be transparent to existing users of the inheritance API, since if you had an attribute that is unscriptable that you never used, there is no error. In some cases, we will create new attributes (even if they are unused), which will increase serialized model size from before. Test Plan: Imported from OSS Differential Revision: D17551196 Pulled By: suo fbshipit-source-id: b476d1c9feb3ddfd63406d90989aaf9dfe890591	2019-10-12 09:51:57 -07:00
Michael Suo	ffa422a8b3	kill _parameter_list (#27399 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27399 This was devised in a time when we didn't have module attributes. They are essentially just tensor lists, so represent them that way. This has the additional benefit of making the RNN forward pass faster because we effectively cache the flattened weights. The only complication part is that someone may come along and do: ``` my_rnn_mod.w_ih_l0 = torch.nn.Parameter(...) ``` This means we need to override setattr to keep the flattened weights cache up to date. Test Plan: Imported from OSS Differential Revision: D17785658 Pulled By: suo fbshipit-source-id: 7789cd1d0d4922bfd5eba1716976442fbf150766	2019-10-12 09:51:53 -07:00
David Riazati	10c4b98ade	Remove weak script (#22212 ) Summary: * Deletes all weak script decorators / associated data structures / methods * In order to keep supporting the standard library in script, this enables recursive script on any function defined in `torch.nn` * Most changes in `torch/nn` are the result of `ag -Q "weak" torch/nn/ -l \| xargs sed -i '/weak/d'`, only `rnn.py` needed manual editing to use the `ignore` and `export` to continue supporting the overloaded `forward` methods * `Sequential`/`ModuleList` no longer need to be added to constants since they are compiled on demand This should also fix https://github.com/pytorch/pytorch/issues/22212 Pull Request resolved: https://github.com/pytorch/pytorch/pull/22212 Differential Revision: D15988346 Pulled By: driazati fbshipit-source-id: af223e3ad0580be895377312949997a70e988e4f	2019-07-03 17:28:25 -07:00
davidriazati	61cc03fb8d	Make ScriptModule.training an attribute instead of a parameter (#21078 ) Summary: Redo of #19587 ](https://our.intern.facebook.com/intern/diff/15560540/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/21078 Pulled By: driazati Differential Revision: D15560540 fbshipit-source-id: f415775d87c163f93b3bbdd5f87c9ff73f58b049	2019-06-06 12:06:49 -07:00
David Riazati	fa8c132e24	Revert D15502768: [pytorch][PR] [jit] Make ScriptModule.training an attribute instead of a parameter Differential Revision: D15502768 Original commit changeset: 3022f2d57ec6 fbshipit-source-id: 5cd08d3c3a75e38e3aa9b75a0c0059a2c6c85a1e	2019-05-29 12:18:18 -07:00
David Riazati	28079c3906	Make ScriptModule.training an attribute instead of a parameter (#19587 ) Summary: Stack from [ghstack](https://github.com/ezyang/ghstack): * #19587 [jit] Make ScriptModule.training an attribute instead of a parameter Remove the hack we had previously where `training` was a buffer Pull Request resolved: https://github.com/pytorch/pytorch/pull/19587 Differential Revision: D15502768 Pulled By: driazati fbshipit-source-id: 3022f2d57ec6849868f9225d9bc2bfb7828cb318	2019-05-28 16:06:46 -07:00
Zachary DeVito	6cb1b994d8	Trace directly into first-class module form. (#19722 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19722 ghimport-source-id: b024666feccb324f5ba9aae4a6301723e04d9846 Reviewed By: jamesr66a Differential Revision: D15078535 Pulled By: zdevito fbshipit-source-id: b866b31c1864a090c545560cbecee81e34ad2d16	2019-04-25 15:53:03 -07:00
Zachary DeVito	87a6974193	Make it possible for self.forward to return a ScriptMethod (#19217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19217 ghimport-source-id: 6fdd7f5ac041dae950b47ca316f30682ede0b083 Reviewed By: suo Differential Revision: D14922120 Pulled By: zdevito fbshipit-source-id: 5e82e5d7ee72df6f401146d2519c80ea336ff40e	2019-04-24 11:14:34 -07:00
Shen Li	344acaa0ca	Revert replicate.py to disallow replicating multi-device modules (#19278 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19278 Based on discussion in https://github.com/pytorch/pytorch/pull/19278 and https://github.com/pytorch/pytorch/pull/18687, changes to replicate.py will be reverted to disallow replicating multi-device modules. Reviewed By: pietern Differential Revision: D14940018 fbshipit-source-id: 7504c0f4325c2639264c52dcbb499e61c9ad2c26	2019-04-16 10:03:38 -07:00
Shen Li	7ae0263e1b	Support replicating multi-GPU modules (#18687 ) Summary: If the input `network` resides on multiple GPUs, `devices` must be a 2D list with `devices[0]` matching `network`'s devices. See #18591 Pull Request resolved: https://github.com/pytorch/pytorch/pull/18687 Differential Revision: D14706162 Pulled By: mrshenli fbshipit-source-id: dca630d3308f2dbcf8b75629c452d7a64092ba42	2019-04-03 14:43:07 -07:00
David Riazati	a2381fa346	Add module attributes (#17309 ) Summary: Similar to `nn.Parameter`s, this PR lets you store any `IValue` on a module as an attribute on a `ScriptModule` (only from the Python front-end currently). To mark something as an attribute, it should wrapped in `jit.Attribute(value, type)` (ex. `self.table = torch.jit.Attribute(table, Dict[str, torch.Tensor])`) Followup Work: * (de)serializing for use in C++ * change `self.training` to be a `bool` attribute instead of a buffer * mutable attributes * string frontend support * documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/17309 Differential Revision: D14354316 Pulled By: driazati fbshipit-source-id: 67e08ab5229366b67fbc837e67b58831a4fb3318	2019-03-07 10:44:10 -08:00
Tongzhou Wang	44a607b90c	Fix autograd with buffers requiring grad in DataParallel (#13352 ) Summary: Causing a problem with spectral norm, although SN won't use that anymore after #13350 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/13352 Differential Revision: D14209562 Pulled By: ezyang fbshipit-source-id: f5e3183e1e7050ac5a66d203de6f8cf56e775134	2019-02-26 20:53:19 -08:00
Guoqiang Jerry Chen	678a472ee5	Script module data parallel (#16891 ) Summary: support data parallel for ScriptModule. see unit tests for testing done for this PR. I also tried traced version of resnet18 from torchvision. I'm yet to try a complete end-to-end data parallel training. This will be next steps. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16891 Differential Revision: D14002222 Pulled By: gqchen fbshipit-source-id: fce3598169113215599815c6978e66d3c3a8c282	2019-02-14 22:52:19 -08:00
Wei Yang	54107ae8cf	convert output_device at data_parallel from torch.device to index (#10189 ) Summary: - fixes #9984 Pull Request resolved: https://github.com/pytorch/pytorch/pull/10189 Differential Revision: D9545390 Pulled By: weiyangfb fbshipit-source-id: 3a6a705437553ba319e9fd4b7f676ff73857a27e	2018-09-11 20:27:07 -07:00
Jerry Ma	afd7477eaa	Add ``buffers(),` `named_buffers()`` methods. (#10554 ) Summary: This commit adds the ``buffers()`` and ``named_buffers()`` methods as analogues of ``parameters()`` and ``named_parameters()``. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10554 Reviewed By: SsnL Differential Revision: D9367762 Pulled By: jma127 fbshipit-source-id: f2042e46a7e833dce40cb41681dbd80d7885c74e	2018-08-16 16:26:48 -07:00
Ailing	f5aa8d55ad	fix detach in place error in DDP (#5829 ) * fix detach in DDP * fix typo * make lint happy	2018-03-16 09:22:04 -04:00
gchanan	22ec5f37ca	Support double backwards with parallel nn autograd functions. (#2508 )	2017-08-22 03:57:45 -04:00
Sam Gross	b4414c0dc3	Handle None in modules list. It's often useful to add None to an nn.ModuleList to keep the indexing of the module list to match some other property.	2017-07-03 18:53:21 -04:00
Adam Paszke	23ab9d481a	Add Module._all_buffers	2017-06-12 21:58:38 -04:00
Sam Gross	65b66264d4	Improve broadcast/reduce performance by coalescing tensors	2017-03-06 12:47:53 -08:00
Adam Paszke	d6fa3b3fd5	Deprecate nn.Container in favor of nn.Module	2017-01-16 19:07:37 -05:00
Adam Paszke	8d60e39fdc	Rename torch.nn.functions to torch.nn._functions	2016-12-30 23:02:57 +01:00
Adam Paszke	aa8916e7c6	Don't unpack single element tuples returned by functions	2016-11-23 18:48:41 +01:00
Adam Paszke	80a827d3da	Fix data_parallel bugs	2016-11-23 18:48:41 +01:00
Sam Gross	15377ac391	Copy Module._buffers in nn.parallel.replicate (#180 )	2016-10-31 12:12:29 -04:00
Sam Gross	f4ebc65a12	Add Module.modules() and Module.children() (#90 ) modules(): returns an iterator over all modules in the network children(): returns an iterator over immediate children Also fix __getitem__ in Sequential	2016-10-01 21:18:53 -04:00
Sam Gross	e8a5f00866	Auto GPU for CUNN (#71 )	2016-09-30 14:04:53 -04:00
Soumith Chintala	412019dbe4	fixing CPU builds by making cuda imports optional	2016-09-28 11:56:18 -04:00

1 2

51 Commits