pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Maggie Moss	48fe858fef	Fix error, remove file from pyrefly checking (#165094 ) Reported issue with formatting and parsing. Removing suppressions and avoiding this file in future type checking until we can get a more complete fix in . Pull Request resolved: https://github.com/pytorch/pytorch/pull/165094 Approved by: https://github.com/albanD	2025-10-10 04:34:51 +00:00
Maggie Moss	c855f8632e	Pyrefly suppressions 7/n (#164913 ) Adds suppressions to pyrefly will typecheck clean: https://github.com/pytorch/pytorch/issues/163283 Almost there! Test plan: dmypy restart && python3 scripts/lintrunner.py -a pyrefly check step 1: delete lines in the pyrefly.toml file from the project-excludes field step 2: run pyrefly check step 3: add suppressions, clean up unused suppressions before: https://gist.github.com/maggiemoss/4b3bf2037014e116bc00706a16aef199 after: INFO 0 errors (6,884 ignored) Pull Request resolved: https://github.com/pytorch/pytorch/pull/164913 Approved by: https://github.com/oulgen	2025-10-08 07:27:17 +00:00
Yuanyuan Chen	35c4130fd1	[2/N] Fix ruff warnings (#164460 ) Apply ruff `SIM` rules. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164460 Approved by: https://github.com/ezyang	2025-10-04 03:40:32 +00:00
Mikayla Gawarecki	9e8f27cc79	[BE] Make torch.nn.modules.* satisfy the docs coverage test (#158491 ) Options to address the "undocumented python objects": 1. Reference the functions in the .rst via the torch.nn.modules namespace. Note that this changes the generated doc filenames / locations for most of these functions! 2. [Not an option] Monkeypatch `__module__` for these objects (broke several tests in CI due to `inspect.findsource` failing after this change) 3. Update the .rst files to also document the torch.nn.modules forms of these functions, duplicating docs. #### [this is the docs page added](https://docs-preview.pytorch.org/pytorch/pytorch/158491/nn.aliases.html) This PR takes option 3 by adding an rst page nn.aliases that documents the aliases in nested namespaces, removing all the torch.nn.modules.* entries from the coverage skiplist except - NLLLoss2d (deprecated) - Container (deprecated) - CrossMapLRN2d (what is this?) - NonDynamicallyQuantizableLinear This mostly required adding docstrings to `forward`, `extra_repr` and `reset_parameters`. Since forward arguments are already part of the module docstrings I just added a very basic docstring. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158491 Approved by: https://github.com/janeyx99	2025-07-25 22:03:55 +00:00
Aaron Gokaslan	163f0d8f2a	[BE][Ez]: Auto add return type annotations for methods in torch/nn/module (#157925 ) Automatically type a bunch of methods in nn.Module using ruff's type inference rules Pull Request resolved: https://github.com/pytorch/pytorch/pull/157925 Approved by: https://github.com/albanD	2025-07-09 21:12:25 +00:00
Xuehai Pan	596b418391	[BE][PYFMT] migrate PYFMT for `{torch,test}/{nn,optim}/**` to `ruff format` (#144548 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144548 Approved by: https://github.com/ezyang	2025-06-14 11:27:04 +00:00
yang	b967b7b11e	Update rnn.py, fix `torch.nn.RNN` document error (#153620 ) I found the same issue as #147490 (@jibril-b-coulibaly). There's an equivalent in the [doc-string](https://docs.pytorch.org/docs/stable/generated/torch.nn.RNN.html#rnn) of `torch.nn.RNN`: ```python # Efficient implementation equivalent to the following with bidirectional=False def forward(x, hx=None): if batch_first: x = x.transpose(0, 1) seq_len, batch_size, _ = x.size() if hx is None: hx = torch.zeros(num_layers, batch_size, hidden_size) h_t_minus_1 = hx h_t = hx output = [] for t in range(seq_len): for layer in range(num_layers): h_t[layer] = torch.tanh( x[t] @ weight_ih[layer].T + bias_ih[layer] + h_t_minus_1[layer] @ weight_hh[layer].T + bias_hh[layer] ) output.append(h_t[-1]) h_t_minus_1 = h_t output = torch.stack(output) if batch_first: output = output.transpose(0, 1) return output, h_t ``` However there's something wrong. 1. Like mentioned in #147490, line 499 is wrong `fb55bac3de/torch/nn/modules/rnn.py (L499)` The input for RNNCell should be different for different layers. 2. The code contains several hidden reference-related issues that may result in unintended modifications to tensors. For example in line 504, this causes all elements in the final output list to point to the same tensor. `fb55bac3de/torch/nn/modules/rnn.py (L504)` 3. Some variable is not defined. Despite being a relatively minor issue in annotation, it can lead to significant confusion for those who are new to the concept. For example `weight_ih` in line 499 `fb55bac3de/torch/nn/modules/rnn.py (L499)` So, i write a runnable version to make it more clear: ```python # Efficient implementation equivalent to the following with bidirectional=False rnn = nn.RNN(input_size, hidden_size, num_layers) params = dict(rnn.named_parameters()) def forward(x, hx=None, batch_first=False): if batch_first: x = x.transpose(0, 1) seq_len, batch_size, _ = x.size() if hx is None: hx = torch.zeros(rnn.num_layers, batch_size, rnn.hidden_size) h_t_minus_1 = hx.clone() h_t = hx.clone() output = [] for t in range(seq_len): for layer in range(rnn.num_layers): input_t = x[t] if layer == 0 else h_t[layer - 1] h_t[layer] = torch.tanh( input_t @ params[f"weight_ih_l{layer}"].T + h_t_minus_1[layer] @ params[f"weight_hh_l{layer}"].T + params[f"bias_hh_l{layer}"] + params[f"bias_ih_l{layer}"] ) output.append(h_t[-1].clone()) h_t_minus_1 = h_t.clone() output = torch.stack(output) if batch_first: output = output.transpose(0, 1) return output, h_t ``` This code can reproduce the computation of torch.nn.RNN. For example: ```python import torch import torch.nn as nn torch.manual_seed(0) input_size, hidden_size, num_layers = 3, 5, 2 rnn = nn.RNN(input_size, hidden_size, num_layers) params = dict(rnn.named_parameters()) x = torch.randn(10, 4, 3) official_imp = rnn(x) my_imp = forward(x) assert torch.allclose(official_imp[0], my_imp[0]) assert torch.allclose(official_imp[1], my_imp[1]) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/153620 Approved by: https://github.com/mikaylagawarecki	2025-05-21 22:45:28 +00:00
cyy	d87aad6877	[5/N] Apply Ruff fixes and pyupgrade to Python 3.9 (#144205 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/144205 Approved by: https://github.com/albanD	2025-01-15 04:00:47 +00:00
Aaron Gokaslan	08db735629	[BE]: Update mypy to 1.13.0 (#140808 ) Update mypy to 1.13.0 . Should hopefully reduce linting time. Has support for orjson cache serialization which should improve mypy cache perf if orjson is installed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140808 Approved by: https://github.com/ezyang, https://github.com/malfet	2024-12-03 02:50:10 +00:00
PyTorch MergeBot	daa77f3d9f	Revert "[BE]: Update mypy to 1.13.0 (#140808 )" This reverts commit 00134d68af2ce50560fa5a74473665ea229e6c9d. Reverted https://github.com/pytorch/pytorch/pull/140808 on behalf of https://github.com/huydhn due to This is failing a distributed test in trunk, target determination missed this test and did not run it on PR ([comment](https://github.com/pytorch/pytorch/pull/140808#issuecomment-2512788426))	2024-12-02 20:47:43 +00:00
Aaron Gokaslan	00134d68af	[BE]: Update mypy to 1.13.0 (#140808 ) Update mypy to 1.13.0 . Should hopefully reduce linting time. Has support for orjson cache serialization which should improve mypy cache perf if orjson is installed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140808 Approved by: https://github.com/ezyang, https://github.com/malfet	2024-12-02 18:47:54 +00:00
Thomas J. Fan	1c669e7c4e	Document the parameter (hx) that RNN actually uses (#140575 ) Fixes https://github.com/pytorch/pytorch/issues/136925 This PR updates the docs to use `hx`, which is the parameter actually used by `RNN`: `629c243c82/torch/nn/modules/rnn.py (L650)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/140575 Approved by: https://github.com/ezyang	2024-11-14 14:45:17 +00:00
Tom Ritchford	c0582fd0f8	Remove unused Python variables in torch/[b-z]* (#136963 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/136963 Approved by: https://github.com/ezyang	2024-10-19 16:45:22 +00:00
PyTorch MergeBot	609447a626	Revert "[BE] typing for decorators - _jit_internal (#131573 )" This reverts commit f0f20f7e97716b4b077dca2a1a42930ccf990c1c. Reverted https://github.com/pytorch/pytorch/pull/131573 on behalf of https://github.com/clee2000 due to breaking lint internally D60265575 ([comment](https://github.com/pytorch/pytorch/pull/131572#issuecomment-2254328359))	2024-07-28 03:29:32 +00:00
Aaron Orenstein	f0f20f7e97	[BE] typing for decorators - _jit_internal (#131573 ) See #131429 Pull Request resolved: https://github.com/pytorch/pytorch/pull/131573 Approved by: https://github.com/oulgen, https://github.com/zou3519 ghstack dependencies: #131568, #131569, #131570, #131571, #131572	2024-07-25 22:24:19 +00:00
Xuehai Pan	973a1362b9	[BE] enable UFMT for `torch/ao/nn/` (#128861 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128861 Approved by: https://github.com/ezyang	2024-07-25 02:49:19 +00:00
Aaron Orenstein	5a0068cc69	[BE] mypy: disallow untyped decorators (#131428 ) Untyped decorators strip the types from their decorated function so even if the underlying function is fully typed then callers to it don't get any benefit from type annotations. Step 1 - Enable the error and override in all the offending files. #131429 Pull Request resolved: https://github.com/pytorch/pytorch/pull/131428 Approved by: https://github.com/justinchuby, https://github.com/oulgen	2024-07-23 21:50:55 +00:00
Xuehai Pan	62ccf6d7cd	[BE] enable UFMT for `torch/nn/modules` (#128594 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128594 Approved by: https://github.com/mikaylagawarecki	2024-06-23 05:37:57 +00:00
PyTorch MergeBot	d4022b4658	Revert "[BE] enable UFMT for `torch/nn/modules` (#128594 )" This reverts commit 95ac2d648279ebc73feccf6d8eccafa4b2759de8. Reverted https://github.com/pytorch/pytorch/pull/128594 on behalf of https://github.com/fbgheith due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/128594#issuecomment-2181788935))	2024-06-21 00:50:08 +00:00
Xuehai Pan	95ac2d6482	[BE] enable UFMT for `torch/nn/modules` (#128594 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128594 Approved by: https://github.com/mikaylagawarecki ghstack dependencies: #128596	2024-06-17 16:29:25 +00:00
Aaron Orenstein	27f9d3b0a1	Flip default value for mypy disallow_untyped_defs [8/11] (#127845 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127845 Approved by: https://github.com/oulgen ghstack dependencies: #127842, #127843, #127844	2024-06-08 18:49:56 +00:00
Xuehai Pan	67ef2683d9	[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#127689 ) Use `typing_extensions.deprecated` for deprecation annotation if possible. Otherwise, add `category=FutureWarning` to `warnings.warn("message")` if the category is missing. Note that only warnings that their messages contain `[Dd]eprecat(ed\|ion)` are updated in this PR. Resolves #126888 - #126888 This PR is split from PR #126898. - #126898 ------ Pull Request resolved: https://github.com/pytorch/pytorch/pull/127689 Approved by: https://github.com/Skylion007	2024-06-02 12:30:43 +00:00
PyTorch MergeBot	033e733021	Revert "[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#126898 )" This reverts commit 749a132fb0a8325cbad4734a563aa459ca611991. Reverted https://github.com/pytorch/pytorch/pull/126898 on behalf of https://github.com/fbgheith due to switching typing-extensions=4.3.0 to 4.9.0 causes internal failure ([comment](https://github.com/pytorch/pytorch/pull/126898#issuecomment-2142884456))	2024-05-31 19:47:24 +00:00
Xuehai Pan	749a132fb0	[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#126898 ) Use `typing_extensions.deprecated` for deprecation annotation if possible. Otherwise, add `category=FutureWarning` to `warnings.warn("message")` if the category is missing. Note that only warnings that their messages contain `[Dd]eprecat(ed\|ion)` are updated in this PR. UPDATE: Use `FutureWarning` instead of `DeprecationWarning`. Resolves #126888 - #126888 Pull Request resolved: https://github.com/pytorch/pytorch/pull/126898 Approved by: https://github.com/albanD	2024-05-29 12:09:27 +00:00
Mikayla Gawarecki	cc12668053	Fix swap_tensors path in _apply for modules that inherit from RNNBase (RNN, GRU, LSTM) (#122800 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122800 Approved by: https://github.com/albanD	2024-03-27 23:34:16 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	1b9c7e41bb	Remove .data call in LSTM as it is not necessary (#122733 ) Summary: Title Test Plan: CI Differential Revision: D55392057 Functional pre-dispatch tracing chokes on LSTM .data call today. While we need to fix it, it seems this call seems unnecessary here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/122733 Approved by: https://github.com/mikaylagawarecki, https://github.com/albanD	2024-03-27 19:08:22 +00:00
yuanx749	e317e39a02	Fix `nonlinearity` arg issue in RNN (#120234 ) Fixes #114617 This PR fix the the issue with `nonlinearity`, so that it can be passed as arg or kwarg. Alternatively, if making `nonlinearity` kwarg-only is preferred, I can revert to another commit. cc @mikaylagawarecki Pull Request resolved: https://github.com/pytorch/pytorch/pull/120234 Approved by: https://github.com/mikaylagawarecki	2024-02-28 20:53:18 +00:00
lancerts	238d87f74d	Add a short code snippet in the RNN doc (#119150 ) Fixes #109443, also remove a duplicated comment line `# Efficient implementation equivalent to the following:` in scaled_dot_product_attention doc. @mikaylagawarecki Pull Request resolved: https://github.com/pytorch/pytorch/pull/119150 Approved by: https://github.com/malfet	2024-02-06 17:41:51 +00:00
Catherine Lee	4f5785b6b3	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Co-authored-by: Catherine Lee <csl@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 21:07:01 +00:00
PyTorch MergeBot	40ece2e579	Revert "Enable possibly-undefined error code (#118533 )" This reverts commit 4f13f69a45ef53747e2eefffd65d91ce840b431b. Reverted https://github.com/pytorch/pytorch/pull/118533 on behalf of https://github.com/clee2000 due to sorry i'm trying to figure out a codev merge conflict, if this works i'll be back to rebase and merge ([comment](https://github.com/pytorch/pytorch/pull/118533#issuecomment-1917695185))	2024-01-30 19:00:34 +00:00
Edward Z. Yang	4f13f69a45	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 05:08:10 +00:00
vmoens	6f0f4f12ca	[BugFix] Prevent LSTM to run with wrong input shape (#115542 ) Fixes #114874 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115542 Approved by: https://github.com/mikaylagawarecki	2024-01-11 02:57:09 +00:00
Linus	7201edc0a5	Fix RNN class constructor signature (#115341 ) Fixes #114617 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115341 Approved by: https://github.com/mikaylagawarecki	2023-12-07 19:46:33 +00:00
zabboud	53e7de4b65	Issue 112599 - fix pydocstyle errors (#113177 ) Fixes #112599 Fixed errors relating to pydocstyle in the following files. The remaining errors are related to docstrings at the module level and at methods within each module, `forward()`, `reset_parameters`, `__init__` ..etc pydocstyle torch/nn/modules/pooling.py --count before: 49 after: 29 remaining errors: ``` torch/nn/modules/pooling.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/pooling.py:90 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:163 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:240 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:315 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:321 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:402 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:408 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:472 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:478 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:541 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:550 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:620 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:630 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:706 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:716 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:720 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/pooling.py:774 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:792 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:845 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:863 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:925 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:979 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1026 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1068 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1111 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1150 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1189 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1228 in public method `forward`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/upsampling.py --count before: 14 after: 7 remaining: ``` torch/nn/modules/upsampling.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/upsampling.py:142 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/upsampling.py:156 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/upsampling.py:160 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/upsampling.py:166 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/upsampling.py:216 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/upsampling.py:263 in public method `__init__`: D107: Missing docstring in __init__ ``` pydocstyle torch/nn/modules/rnn.py --count before: 47 after: 40 remaining ``` torch/nn/modules/rnn.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/rnn.py:59 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:160 in public method `__setattr__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:225 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/rnn.py:230 in public method `check_input`: D102: Missing docstring in public method torch/nn/modules/rnn.py:242 in public method `get_expected_hidden_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:256 in public method `check_hidden_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:272 in public method `check_forward_args`: D102: Missing docstring in public method torch/nn/modules/rnn.py:278 in public method `permute_hidden`: D102: Missing docstring in public method torch/nn/modules/rnn.py:284 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/rnn.py:305 in public method `__getstate__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:313 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:355 in public method `all_weights`: D102: Missing docstring in public method torch/nn/modules/rnn.py:471 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:478 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:481 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:503 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:762 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:768 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:771 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:774 in public method `get_expected_cell_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:786 in public method `check_forward_args`: D102: Missing docstring in public method torch/nn/modules/rnn.py:798 in public method `permute_hidden`: D102: Missing docstring in public method torch/nn/modules/rnn.py:809 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:820 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1030 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1036 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1039 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1046 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1054 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1123 in public class `RNNCellBase`: D101: Missing docstring in public class torch/nn/modules/rnn.py:1134 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1152 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1160 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1224 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1230 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1327 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1332 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1422 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1427 in public method `forward`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/pixelshuffle.py --count before: 13 after: 8 remaining: ``` torch/nn/modules/pixelshuffle.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/pixelshuffle.py:52 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pixelshuffle.py:56 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:59 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:105 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pixelshuffle.py:109 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:112 in public method `extra_repr`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/sparse.py --count before: 14 after: 8 remaining errors: ``` torch/nn/modules/sparse.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/sparse.py:124 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/sparse.py:153 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/sparse.py:162 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/sparse.py:167 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/sparse.py:320 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/sparse.py:350 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/sparse.py:396 in public method `extra_repr`: D102: Missing docstring in public method ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113177 Approved by: https://github.com/ezyang	2023-11-14 20:55:22 +00:00
Federico Galatolo	d118531733	Use `\odot` everywhere instead of mixing `\odot` and `` for the Hadamard product (#111763 ) This pull request addresses an inconsistency in the representation of the Hadamard product across PyTorch documentation. Currently, the notation varies among different modules: - In `torch.nn.LSTM` documentation the Hadamard product is represented with $\odot$ - In `torch.nn.GRU` documentation the Hadamard product is represented with $$ - In `torch.nn.LSTMCell` documentation the Hadamard product is represented with $$ - In `torch.nn.GRUCell` documentation the Hadamard product is represented with $$ - In `torch.ao.nn.quantized.dynamic.GRU` documentation the Hadamard product is represented with $$ This PR proposes consistently representing the Hadamard product throughout the documentation to enhance clarity and align with established standards. The notation $\odot$ will be uniformly adopted, following the convention in the [Deep Learning Book](https://www.deeplearningbook.org/contents/linear_algebra.html). Changes Made:* - Modified `torch.nn.GRU` documentation to represent the Hadamard product with $\odot$ - Modified `torch.nn.LSTMCell` documentation to represent the Hadamard product with $\odot$ - Modified `torch.nn.GRUCell` documentation to represent the Hadamard product with $\odot$ - Modified `torch.ao.nn.quantized.dynamic.GRU` documentation to represent the Hadamard product with $\odot$ Pull Request resolved: https://github.com/pytorch/pytorch/pull/111763 Approved by: https://github.com/albanD	2023-10-22 21:01:35 +00:00
FFFrog	003c5bb156	Add checks to `num_layers` for `RNN`, `LSTM`, `GRU` (#108853 ) Fixes #108223 As the title shown Pull Request resolved: https://github.com/pytorch/pytorch/pull/108853 Approved by: https://github.com/mikaylagawarecki	2023-09-09 19:33:52 +00:00
hasteinmetz	b535ed2c1a	Update to RNN documentation (issue #106085 ) (#106222 ) Addresses [issue #106085](https://github.com/pytorch/pytorch/issues/106085). In `torch/nn/modules/rnn.py`: - Adds documentation string to RNNBase class. - Adds parameters to __init__ methods for RNN, LSTM, and GRU, classes. - Adds type annotations to __init__ methods for RNN, LSTM, and GRU. In `torch/ao/nn/quantized/dynamic/modules/rnn.py`: - Adds type specifications to `_FLOAT_MODULE` attributes in RNNBase, RNN, LSTM, and GRU classes. > This resolves a `mypy` assignment error `Incompatible types in assignment (expression has type "Type[LSTM]", base class "RNNBase" defined the type as "Type[RNNBase]")` that seemed to be a result of fully specified type annotations in `torch/nn/modules/rnn.py`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/106222 Approved by: https://github.com/mikaylagawarecki	2023-08-31 00:50:32 +00:00
Aaron Gokaslan	660e8060ad	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-22 23:16:38 +00:00
PyTorch MergeBot	d59a6864fb	Revert "[BE]: Update ruff to 0.285 (#107519 )" This reverts commit 88ab3e43228b7440a33bf534cde493446a31538c. Reverted https://github.com/pytorch/pytorch/pull/107519 on behalf of https://github.com/ZainRizvi due to Sorry, but this PR breaks internal tests. @ezyang, can you please hep them get unblocked? It seems like one of the strings was prob accidentally modified ([comment](https://github.com/pytorch/pytorch/pull/107519#issuecomment-1688833480))	2023-08-22 19:53:32 +00:00
Aaron Gokaslan	88ab3e4322	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-20 01:36:18 +00:00
박종의 (PARK, Jongeui)	af0ed25ea8	Change >= in the GRU and the LSTM document to \ge (#107379 ) Change >= in the GRU document to \ge Pull Request resolved: https://github.com/pytorch/pytorch/pull/107379 Approved by: https://github.com/ezyang	2023-08-18 20:44:51 +00:00
FFFrog	2d2d43d9fb	add more check on LSTMCell (#107380 ) Just like #107223, operator ``LSTMCell`` have the same problems as ``GRUCell``, and add some check and tests related to fix it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107380 Approved by: https://github.com/ezyang	2023-08-18 20:44:17 +00:00
FFFrog	a4229690e3	Add Some Checks about dim (#107223 ) Fixes #106769 As mentioned in [GRUCell](https://pytorch.org/docs/stable/generated/torch.nn.GRUCell.html#grucell), `hidden` should have the same dimension as `input`, and the dimension should be either `1D` or `2D`. As for other aspects, it has been verified in `C++`, such as the batch of `Input` and `hidden` are the same, `Input`'s Dim1 and `input_size` are the same, `hidden`'s Dim1 and `hidden_size` are the same, etc. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107223 Approved by: https://github.com/albanD	2023-08-16 22:03:31 +00:00
Danni Li	1a6f1d816d	[Doc] Add proj_size < hidden_size in LSTM (#106364 ) Summary: Add parameter constraint: `proj_size` has to be smaller than `hidden_size` in RNNBase doc. Ref: `ceea08a986/torch/nn/modules/rnn.py (L83)` `ceea08a986/torch/nn/modules/rnn.py (L458)` Test Plan: Please see GitHub Actions. Differential Revision: D47943365 Fix: #105628 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106364 Approved by: https://github.com/mikaylagawarecki	2023-08-01 18:58:27 +00:00
chunyuan	cb6c3cbc91	inductor: enable weight prepack for LSTM (#103071 ) - Enabled LSTM weight prepack in inductor. - Added a mkldnn decomposition for lstm which won't change for different `seq_lens`. With the previous decomposition, for dynamic shapes use case where `seq_lens` changes, the graph will be different. - Extended several inductor utility functions to support `List(Tensor`) as input. Previously those functions only supported `Tensor` input. Update 2023-07-26: - https://github.com/pytorch/pytorch/pull/103851 has moved CPU weight packing to be after AOTAutograd. Fixed the support in this PR to follow the same way (mainly in `3b207f7f1c (diff-6dffed1ade0ba3e887f9a4eafa3bfcec267ab2365b8adcb91bd391f49b3fd2e3)`). LSTM is decomposed in `aten.mkldnn_rnn_layer` by layer and by direction. The weight prepack is done at the `mkldnn_rnn_layer` level. - Add a fix in rnn `__get_state__` function in case we need to recompile an `LSTM` module. When compiling the module, the weights tensors which are the `named_parameters` of the module are converted to `functional_tensor` here: `76fb72e24a/torch/nn/utils/stateless.py (L125-L128)` The forward function of LSTM will be called: `76fb72e24a/torch/_functorch/aot_autograd.py (L3379-L3381)` In the forward function, the `_flat_weights` are updated to be the same as the weights, thus becoming `functional_tensor`: `76fb72e24a/torch/nn/modules/rnn.py (L775-L778)` The weights tensors are converted back to the original tensors (which are not `functional_tensor` anymore) before exiting the `_reparametrize_module` context here: `76fb72e24a/torch/nn/utils/stateless.py (L130-L142)` But since `_flat_weights` is not in the `named_parameters` of the module, it's still `functional_tensor` ([link of the parameters that will be converted to functional and reverted back](`76fb72e24a/torch/_functorch/aot_autograd.py (L3695-L3698)`)). At this moment, if we need to recompile the model, `deepcopy` will be called: `76fb72e24a/torch/_dynamo/utils.py (L915-L917)` And it will report `UnImplemented` since we have `functional_tensor` (`_flat_weights`) and will trigger graph break which is not what we expect: `76fb72e24a/torch/_subclasses/meta_utils.py (L514)` Added a fix in the `__get_state__` to update the `_flat_weights` if ever weights have changed to fix this issue. The fix is covered in the `test_lstm_packed` UT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103071 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-07-28 13:54:32 +00:00
Aaron Gokaslan	6d43c89f37	[BE]: Update Ruff to 0.0.280 (#105724 ) Removes unusued loop values in python dictionary iteration. Automated fix from Ruff master Pull Request resolved: https://github.com/pytorch/pytorch/pull/105724 Approved by: https://github.com/ezyang, https://github.com/janeyx99	2023-07-22 23:03:34 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Justin Chu	79c5e33349	[BE] Enable ruff's UP rules and autoformat nn/ mps/ and torch/ (#105436 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105436 Approved by: https://github.com/malfet, https://github.com/albanD	2023-07-21 07:38:46 +00:00
Thomas Findlay	8399cf9bfe	Rnn base hidden size type check (#105659 ) Fixes #105631 Added a type and value check on `hidden_size` to align behaviour between GPU and CPU modes and alert users when the wrong type is supplied. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105659 Approved by: https://github.com/albanD, https://github.com/mikaylagawarecki	2023-07-20 22:45:43 +00:00
Danni Li	b33d63d97b	[BE] Use ValueError for input.dim check in torch.nn.modules (#105127 ) Summary: Use ValueError for input.dim check instead of Assertion Error. Fix: #104839 Test Plan: Please see GitHub actions. Differential Revision: D47427998 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105127 Approved by: https://github.com/albanD, https://github.com/Skylion007	2023-07-13 23:20:46 +00:00

1 2 3 4 5

231 Commits