pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
cyy	03e35efbf7	replace torch::make_unique with std::make_unique (#108866 ) It should be safe to remove the old torch::make_unique functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108866 Approved by: https://github.com/albanD	2023-09-14 20:52:26 +00:00
Aaron Gokaslan	0247ed27cc	Apply Clang-Tidy readability-container-size-empty (#93236 ) Not only is this change usually shorter and more readable, it also can yield better performance. size() is not always a constant time operation (such as on LinkedLists), but empty() always is. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93236 Approved by: https://github.com/malfet	2023-01-29 23:28:19 +00:00
Nikita Shulga	3a66a1cb99	[clang-tidy] Exclude cppcoreguidelines-avoid-magic-numbers (#57841 ) Summary: Add cppcoreguidelines-avoid-magic-numbers exclusion to clang-tidy Remove existing nolint warnings using following script: ``` for file in `git ls-files \| grep -v \.py`; do gsed '/^ *\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-magic-numbers)/d' -i $file; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57841 Reviewed By: samestep Differential Revision: D28295045 Pulled By: malfet fbshipit-source-id: 7c6e8d1213c9593f169ed3df6a916498f1a97163	2021-05-07 20:02:33 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Andres Suarez	8530c65e25	[codemod][fbcode/caffe2] Apply clang-format update fixes Test Plan: Sandcastle and visual inspection. Reviewed By: igorsugak Differential Revision: D25849205 fbshipit-source-id: ef664c1ad4b3ee92d5c020a5511b4ef9837a09a0	2021-01-09 14:37:36 -08:00
Elias Ellison	3f32332ee6	[JIT][Easy]move remove mutation to own file (#41137 ) Summary: This should be in its own file... Pull Request resolved: https://github.com/pytorch/pytorch/pull/41137 Reviewed By: jamesr66a Differential Revision: D22437922 Pulled By: eellison fbshipit-source-id: 1b62dde1a4ebac673b5c60aea4f398f734d62501	2020-07-08 17:00:35 -07:00
Elias Ellison	f90dc741eb	[JIT] Normalize op aliases (#38735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38735 Follow up to my comment https://github.com/pytorch/pytorch/pull/36597/#issuecomment-613674329 This adds a pass to convert op aliases into a normalized form. Having two ops generated in our IR that do the same thing makes the IR harder for downstream consumers of the IR, such as TorchScript passes but also ONNX, glow, etc. Another solution would have been to fix our code generation to only emit `aten::abs` from the start. This seems trickier, and doesn't really buy us much if we still have to expose `aten::absolute` in C++, as glaringlee of the C++ API thinks we should. Bike shedding: maybe this should be `CanonicalizeOps` instead Test Plan: Imported from OSS Differential Revision: D21673108 Pulled By: eellison fbshipit-source-id: c328618907de1af22e07f57fd27fa619978c2817	2020-05-21 21:47:17 -07:00
Elias Ellison	a3042ca89d	[JIT] Rewrite unaliased if output mutation (#37694 ) Summary: In a case like below, if x0 and x1 are both unaliased an only have a single use, than we can rewite the mutation to x2 without breaking observable semantics. This PR makes torchvision.models.alexnet functionalizable. ``` if cond: x0 = op() else: x1 = op() x2.add_(1) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/37694 Differential Revision: D21428275 Pulled By: eellison fbshipit-source-id: 1e2a39a8fb3819f1f225b7c345e986b3a3db253f	2020-05-06 15:26:31 -07:00
Michael Suo	9e32a1f5cd	[wip] update graph fuser aliasdb in-place (#37106 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37106 Recomputing the aliasdb on every fusion iteration + in every subblock is hugely expensive. Instead, update it in-place when doing fusion. The graph fuser pass operates by pushing nodes into a fusion group. So we start with ``` x, y = f(a, b, c) ``` and end with: ``` x_out, y_out = prim::fusionGroup(a, b, c) x_in, y_in = f(a_in, b_in, c_in) -> x_in, y_in ``` We destroy the `x` and `y` `Value*`s in the process. This operation is easy to express as an update to the aliasDb--`x_out` just takes on all the aliasing information `x` used to have. In particular, since we know `f` and `prim::fusionGroup` are purely functional, we don't have to mess with any write information. This PR is the bare minimum to get this working, in the interest of unscrewing the compilation times ASAP. Followups I want to do: - We don't have a way of expressing deletion of values in AliasDb. In `graph_fuser.cpp` we sometimes construct nodes that we end up throwing away, and we are littering `MemoryDAG` with references to dangling pointers. Because of the way the pass works, it's fine, but this is fragile so I want to fix it. - We should decouple alias analysis from write tracking, to simplify the job of keeping the write caches consistent as we mutate the aliasing information. - the tensorexpr fuser doesn't do this and thus is incorrect today, we need to update it to work. Test Plan: Imported from OSS Differential Revision: D21219179 Pulled By: suo fbshipit-source-id: 8ae5397b3a0ad90edec2fbc555647091f1ad5284	2020-04-30 22:21:35 -07:00
Michael Suo	5efd10518f	[jit] speed up alias analysis (#36345 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36345 During compilation, we spend a huge amount of time in alias analyis. This PR does a few things to speed it up. 1. Separate the analysis into two phases: one where we build up the necessary data structures, and the other where we service aliasing queries. This allows us to defer building indices/maintaining index consistency until after the "buildup" phase is done. 2. Properly memoize/dynamic program the memory locations lookups. 3. Done naively, setting wildcards invalidates the above memoization, trigger costly recomputation. So I added a cache-aware `setWildcards`. Sadly that means you need alias analysis to reach into the guts of memorydag, but the speedup is worth it. Sadly, these changes are kind of coupled for correctness reasons, so they're all here at once. I used this model (thanks IlyaOvodov) as a provisional benchmark. You can get it here: https://www.dropbox.com/s/jlyygn6yygj1jkx/yolov3.zip. Unzip at run `python test_timing.py`. Baseline: (752.076s) right before 6bc8ffe82462c77ac4f9b27452046cb1f8f07d92 After optimizing before inlining: (699.593s) After deferring cache construction: (426.180s) After cache-aware `setWildcards`: (193.678s) So a nice 75% speedup to overall compilation. There's a lot more to do in other places of the compilation pipeline though. Followup to this PR specifically: Everything that fans out from the `analyze` call is the "buildup" phase of AliasDB construction. This should be factored into a separate analysis pass to statically distinguish the two phases (right now we just null out stuff to accomplish the same thing dynamically). Test Plan: Imported from OSS Differential Revision: D20952727 Pulled By: suo fbshipit-source-id: 099f797222d7e71e5c04991584adc2c7eab5a70f	2020-04-30 18:27:41 -07:00
Elias Ellison	c516f84525	[JIT] Add Lower Tuples Call & Run remove mutation after list unrolling (#36829 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36829 This changes the IR complexity from the previous PR for the following tests: ``` ('Name', 'Ifs/Loops', 'non-tensor ops') Before: ('max_unpool1d', 0, 3) After: ('max_unpool1d', 0, 0) Before: ('max_unpool2d', 0, 3) After: ('max_unpool2d', 0, 0) Before: ('max_unpool3d', 0, 4) After: ('max_unpool3d', 0, 0) Before: ('adaptive_max_pool2d', 0, 3) After: ('adaptive_max_pool2d', 0, 0) Before: ('adaptive_max_pool3d', 0, 4) After: ('adaptive_max_pool3d', 0, 0) Before: ('adaptive_avg_pool2d', 0, 3) After: ('adaptive_avg_pool2d', 0, 0) Before: ('adaptive_avg_pool3d', 0, 4) After: ('adaptive_avg_pool3d', 0, 0) Before: ('upsample', 13, 68) After: ('upsample', 4, 28) Before: ('upsample', 13, 68) After: ('upsample', 0, 5) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 13, 67) After: ('interpolate', 4, 27) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 13, 67) After: ('interpolate', 4, 27) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 13, 67) After: ('interpolate', 4, 27) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 13, 67) After: ('interpolate', 4, 27) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 13, 57) After: ('interpolate', 4, 21) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 13, 57) After: ('interpolate', 4, 21) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 13, 57) After: ('interpolate', 4, 21) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 13, 77) After: ('interpolate', 4, 33) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 13, 77) After: ('interpolate', 4, 33) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 13, 77) After: ('interpolate', 4, 33) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 14, 68) After: ('interpolate', 0, 4) Before: ('interpolate', 15, 103) After: ('interpolate', 1, 23) Before: ('interpolate', 14, 70) After: ('interpolate', 0, 6) Before: ('interpolate', 15, 103) After: ('interpolate', 1, 21) Before: ('interpolate', 14, 70) After: ('interpolate', 0, 6) Before: ('interpolate', 15, 91) After: ('interpolate', 1, 13) Before: ('interpolate', 14, 59) After: ('interpolate', 0, 3) Before: ('interpolate', 15, 93) After: ('interpolate', 1, 16) Before: ('interpolate', 14, 61) After: ('interpolate', 0, 5) Before: ('interpolate', 15, 111) After: ('interpolate', 1, 28) Before: ('interpolate', 14, 77) After: ('interpolate', 0, 5) Before: ('interpolate', 15, 113) After: ('interpolate', 1, 27) Before: ('interpolate', 14, 79) After: ('interpolate', 0, 7) Before: ('test_nn_AdaptiveMaxPool2d_single', 0, 3) After: ('test_nn_AdaptiveMaxPool2d_single', 0, 0) Before: ('test_nn_AdaptiveMaxPool2d_tuple', 0, 3) After: ('test_nn_AdaptiveMaxPool2d_tuple', 0, 0) Before: ('test_nn_AdaptiveMaxPool3d_single', 0, 4) After: ('test_nn_AdaptiveMaxPool3d_single', 0, 0) Before: ('test_nn_AdaptiveMaxPool3d_tuple', 0, 4) After: ('test_nn_AdaptiveMaxPool3d_tuple', 0, 0) Before: ('test_nn_AdaptiveMaxPool3d_single_nonatomic', 0, 4) After: ('test_nn_AdaptiveMaxPool3d_single_nonatomic', 0, 0) Before: ('test_nn_AdaptiveMaxPool3d_tuple_nonatomic', 0, 4) After: ('test_nn_AdaptiveMaxPool3d_tuple_nonatomic', 0, 0) Before: ('test_nn_AdaptiveAvgPool2d_single', 0, 3) After: ('test_nn_AdaptiveAvgPool2d_single', 0, 0) Before: ('test_nn_AdaptiveAvgPool2d_single_1x1output', 0, 3) After: ('test_nn_AdaptiveAvgPool2d_single_1x1output', 0, 0) Before: ('test_nn_AdaptiveAvgPool2d_tuple', 0, 3) After: ('test_nn_AdaptiveAvgPool2d_tuple', 0, 0) Before: ('test_nn_AdaptiveAvgPool3d_single', 0, 4) After: ('test_nn_AdaptiveAvgPool3d_single', 0, 0) Before: ('test_nn_AdaptiveAvgPool3d_tuple', 0, 4) After: ('test_nn_AdaptiveAvgPool3d_tuple', 0, 0) ``` Test Plan: Imported from OSS Differential Revision: D21160758 Pulled By: eellison fbshipit-source-id: 68ccbf3af74398e8dbad7e6bedb639635dafdb2e	2020-04-28 23:28:02 -07:00
Elias Ellison	4f3af09162	[JIT] Incremental updates to Alias Db in Mutation Remover pass (#35421 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35421 This PR makes it so that we don't have to rebuild the entire alias db each time we remove a node in alias analysis. Test Plan: Imported from OSS Differential Revision: D20922470 Pulled By: eellison fbshipit-source-id: 9f43ed6dc743bf8a6b84a4aa38cff7059d46741d	2020-04-08 15:00:44 -07:00
Elias Ellison	4db87f4f97	[JIT] Allow mutated values as functional graph inputs (#33297 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33297 Allowing mutated values as inputs but not outputs has the effect of buffering up all mutated values as inputs to the graph. Just as we values which escape scope as graph inputs but not graph outputs - we should also allow values that get mutated. In both cases, the contract is that that the functional graph cannot write to graph inputs. Without this patch, if there is a single write to the Tensor wildcard set it would disable all optimization. Test Plan: Imported from OSS Differential Revision: D20607175 Pulled By: eellison fbshipit-source-id: c698e7cf3374e501cd5d835663991026a113ec6b	2020-04-08 14:59:26 -07:00
Meghan Lele	6384c2d81b	[JIT] clang-format JIT code (#35115 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35115 This commit runs the newly added tools/clang_format.py on the JIT codebase and includes all of the formatting changes thus produced. Testing: Ran the script, CI. Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D20568523 Pulled By: SplitInfinity fbshipit-source-id: e09bdb982ccf090eecfb7c7b461b8d0681eef82b	2020-03-26 11:24:51 -07:00
Elias Ellison	f090031e69	[JIT] remove list appends (#33199 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33199 Remove list appends when we can match them with a list construction. This helps create a larger functional graph Test Plan: Imported from OSS Differential Revision: D20603187 Pulled By: eellison fbshipit-source-id: a60e933b457479d40960994d8ffdf39ef49eaf6e	2020-03-24 23:46:03 -07:00
Elias Ellison	aab4beb87f	[JIT] Pass To Safely Remove Aten Inplace Ops (#33186 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33186 This helps create larger functional graphs. It has the potential to increase memory use, so in order to land this on by default we would probably also do a reuse of buffers pass. This is currently O(n * \| Removed Nodes \| ) because we have to rebuild the alias Db each time we make a change. This pass is critical to creating functional graphs, so this might be a compelling use case to build incremental updates to alias Db. Test Plan: Imported from OSS Differential Revision: D20603189 Pulled By: eellison fbshipit-source-id: 105db52bf38e02188ca6df6d36294466d3309a0a	2020-03-24 23:45:58 -07:00
Elias Ellison	5b2f8cef08	[JIT] Functional Graph Pass (#33020 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33020 This is a pass to create functional blocks. The other PRs in the stack help avoid some of the limitations that are are often found in graphs. It's possible that this would work well with a graph that is frozen. Follow up work items that will help this pass: - We don't currently have any capacity in alias analysis to tell whether a Value that came from the wildcard set "re-escapes" back into the wildcard set. - More comments on the semantics of the graph and correctness conditions - We could consider using dynamic dag if the perf of this is a limitation. - potential make Functional Graphs Functional Blocks instead, so that we do not repeatedly copy constants, also to make IR read easier. Test Plan: Imported from OSS Differential Revision: D20603188 Pulled By: eellison fbshipit-source-id: 6822a6e65f4cc2676f8f6445fe8aa1cb858ebeeb	2020-03-24 23:44:18 -07:00

17 Commits