pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Michael Suo	0517222dc8	[package] Correct usage of miniz API in PyTorchStreamReader (#55725 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55725 We were previously checking m_last_error on the miniz struct directly, which fails to preserve internal invariants and can the leave the reader broken in specific situations (reading a non-existent file). Using the provided error checking API fixes this. Differential Revision: D27693105 Test Plan: Imported from OSS Reviewed By: SplitInfinity Pulled By: suo fbshipit-source-id: 20c520bb1d590fb75751bca1e970df4f2b7eb043	2021-04-13 11:50:08 -07:00
Lillian Johnson	b72a72a477	torch.Package extend PyTorchStreamWriter to track written records (#52218 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52218 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D26429794 Pulled By: Lilyjjo fbshipit-source-id: 5f68e7991c673ada629d0370c705520243d0637a	2021-02-22 15:02:41 -08:00
Zachary DeVito	60518d10f6	[deploy] torch::deploy API (#51754 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51754 This API allows you to manage multiple python interpreters in a single process to deploy PyTorch models packaged with torch.package. torch/csrc/deploy/deploy.h contains the API definition torch/csrc/deploy/test_deploy.cpp has some examples. Notes: * mutex is added to PyTorchStreamReader to make it safe to use from multiple threads at once. * USE_DEPLOY is only true for the special libtorch_deployinterpreter.so library, when enabled we use a hash table to maintain PyObject <> at::Tensor mappping rather than the internal pointer in Tensor since >1 interpreter may have a reference to the tensor. * serialization.py has some additional functions for creating pickle objects but keeping storages in memory for use transfering tensors between interpreters Test Plan: Imported from OSS Reviewed By: wconstab Differential Revision: D26329468 Pulled By: zdevito fbshipit-source-id: d75f4ebb9a27f1d911179d9996041bcb3ca04a07	2021-02-18 02:30:08 -08:00
Michael Suo	c357f8b826	[package] make torch.package produce unified format (#51826 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51826 Looks like this: ``` resnet.pt ├── .data # Data folder named so it can't clash with torch.package codemodules. │ │ # Names/extensions automatically added to avoid namingconflicts. │ ├── 94286146172688.storage # tensor data │ ├── 94286146172784.storage │ ├── extern_modules # torch.package metadata │ ├── version # version metadata │ └── ... ├── model # package pickled model created w/ │ │ # exporter.save_pickel('model','model.pkl', resnet_model) │ └── model.pkl └── torchvision # all code dependencies for packaged picked └── models # models are captured as source files ├── resnet.py └── utils.py ``` Since `version` is hardcoded in our zip reader/writer implementation, add it as an option that defaults to "version" but accepts other locations for putting the version metadata. Test Plan: Imported from OSS Reviewed By: zdevito Differential Revision: D26295649 Pulled By: suo fbshipit-source-id: 2d75feeb7de0f78196b4d0b6e2b814a7d58bd1dd	2021-02-09 07:45:59 -08:00
Liang Liu	19f4c5110e	Add another torch::jit::load API to load PyTorch model with shared_ptr PyTorchStreamReader input (#48802 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48802 Current torch::jit::load API only supports unique_ptr ReadAdaptInterface input, but for some cases, torch::jit::load may not be the only consumer of the reader adapter. This diff enables an overload of torch::jit::load to load shared_ptr PyTorchStreamReader. Reviewed By: malfet, houseroad Differential Revision: D25241904 fbshipit-source-id: aa403bac9ed820cc0e94342aebfe524a1d5bf913	2020-12-06 18:09:25 -08:00
Martin Yuan	b553c06abb	Throw an exception in the constructor of torchscript serialization to avoid double-exception (#44266 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44266 If PyTorchStreamWriter is writing to a file in a non-existing path, it throws an exception. In unwinding the destructor calls writeEndOfFile() and throws again. To avoid this double-exception, a check and throw is added in the constructor. In such case the destructor will not be called and the exception can go through the unwinding. Test Plan: python test/test_jit.py TestSaveLoad.test_save_nonexit_file Reviewed By: dreiss Differential Revision: D23560770 Pulled By: iseeyuan fbshipit-source-id: 51b24403500bdab3578c7fd5e017780467a5d06a	2020-10-28 22:41:19 -07:00
Dmytro Dzhulgakov	cbdaa20c88	[serialize] Expose zip file alignment calculation functions (#43531 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43531 It's useful for building some tooling out of tree to manipulate zip files in PyTorch-y way Test Plan: contbuild Reviewed By: houseroad Differential Revision: D23277361 fbshipit-source-id: e15fad20e792d1e41018d32fd48295cfe74bea8c	2020-08-25 02:32:58 -07:00
Dmytro Dzhulgakov	478fb925e6	[jit] PyTorchStreamReader::getAllRecord should omit archive name prefix (#43317 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43317 Previous version was returning the path with a prefix so subsequent `getRecord` would fail. There's only one place in PyTorch codebase that uses this function (introduced in https://github.com/pytorch/pytorch/pull/29339 ) and it's unlikely that anyone else is using it - it's not a public API anyway. Test Plan: unittest Reviewed By: houseroad Differential Revision: D23235241 fbshipit-source-id: 6f7363e6981623aa96320f5e39c54e65d716240b	2020-08-21 10:39:57 -07:00
David Reiss	7369dc8d1f	Use CPU Allocator for reading from zip container Summary: This code path is used to read tensor bodies, so we need it to respect alignment and padding requirements. Test Plan: Ran an internal test that was failing. Reviewed By: zdevito Differential Revision: D22225622 fbshipit-source-id: f2126727f96616366850642045ab9704f3885824	2020-06-25 10:51:49 -07:00
Mike Ruberry	e66445878d	Adds dynamic versioning pattern (#40279 ) Summary: BC NOTE: This change makes it so modules saved with torch.jit.save in PyTorch 1.6 can be loaded by previous versions of PyTorch unless they use torch.div or (soon) torch.full. It also lets tensors saved using torch.save be loaded by previous versions. So this is the opposite of BC-breaking, but I'm using that label to highlight this issue since we don't have a "BC-improving" label. PR NOTE: When an operator's semantics change in PyTorch we want to do two things: 1) Preserve the semantics of older serialized Torchscript programs that use the operator 2) Ensure the new semantics are respected Historically, this meant writing a Versioned Symbol that would remap older versions of the operator into current PyTorch code (1), and bumping the produced file format version (2). Unfortunately, bumping the produced file format version is a nuclear option for ensuring semantics are respected, since it also prevents older versions of PyTorch from loading anything (even tensors!) from newer versions. Dynamic versioning addresses the nuclear consequences of bumping the produced file format version by only bumping it when necessary. That is, when an operator with changed semantics is detected in the serialized Torchscript. This will prevent Torchscript programs that use the changed operator from loading on earlier versions of PyTorch, as desired, but will have no impact on programs that don't use the changed operator. Note that this change is only applicable when using torch.jit.save and torch.jit.load. torch.save pickles the given object using pickle (by default), which saves a function's Python directly. No new tests for this behavior are added since the existing tests for versioned division in test_save_load already validate that models with div are loaded correctly at version 4. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40279 Reviewed By: dzhulgakov Differential Revision: D22168291 Pulled By: mruberry fbshipit-source-id: e71d6380e727e25123c7eedf6d80e5d7f1fe9f95	2020-06-24 12:52:50 -07:00
Martin Yuan	0e753b2818	Fix SIGABORT caused by double exception in PyTorchStreamReader when file not found. (#33243 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33243 If a file does not exist in an archive, PyTorchStreamReader throws an exception. However, when PyTorchStreamReader is destructed another exception is thrown while processing the first exception. As a result of this double exception there is SIGABORT. Thanks dreiss for catching this bug and suggesting the fix. It happened when he used _load_for_mobile to load a torch script file without bytecode session. A unittest is added to test this case. Test Plan: Imported from OSS Differential Revision: D19859205 Pulled By: iseeyuan fbshipit-source-id: 8f96b6256f1a1f933fce1c256d64604c7e9269e4	2020-02-12 16:27:15 -08:00
Sebastian Messmer	643ca5def2	Replace c10::guts::stuff with std::stuff (#30915 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30915 Since we now have C++14, we don't need these c10::guts helpers anymore ghstack-source-id: 95777609 Test Plan: waitforsandcastle Differential Revision: D18869639 fbshipit-source-id: 97716f932297c64c6e814410ac47b444c33d4e2e	2019-12-16 13:57:19 -08:00
David Riazati	8c6f0c0587	Detect TorchScript archives in torch.load (#29339 ) Summary: This PR looks for a `constants.pkl` file at the top level in a zip file in `torch.load`. If found, it calls `torch.jit.load` instead and issues a warning to call `torch.jit.load` directly Pull Request resolved: https://github.com/pytorch/pytorch/pull/29339 Differential Revision: D18611095 Pulled By: driazati fbshipit-source-id: f070a02f6b5509054fc3876b3e8356bbbcc183e1	2019-11-22 12:30:30 -08:00
Jeremy Lilley	e80f7506c2	In torch::save(), make padding computation faster. (#29425 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29425 This change saves roughly 5-6% in the TorchSaveSmallTensor benchmark (torch::save() on a tensor with 64 random floats) by reusing the padding string across records. ghstack-source-id: 93517961 Test Plan: Correctness: buck test mode/dev-nosan caffe2/test/... Benchmark buck build mode/opt experimental/jeremyl/c2/... buck-out/opt/gen/experimental/jeremy/c2/SerializationBench Differential Revision: D18385731 fbshipit-source-id: 20bcbe1efd2fb7e3012dd68080542f2a74a7d4f2	2019-11-08 15:03:25 -08:00
Jeremy Lilley	abf55eb3a8	Pickler: convert std::stringstream cases. (#29351 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29351 When torch::save()ing a smallish tensor, we spend ~5% of the time still in std::stringstream constructors. This removes the last couple of cases. Benchmark shows ~5% improvement: TorchSaveSmallTensor Pre: 13.12us TorchSaveSmallTensor Post: 12.48us ghstack-source-id: 93517928 Test Plan: buck build mode/opt experimental/jeremyl/c2: buck-out/opt/gen/experimental/jeremyl/c2/SerializationBench --bm_regex=TorchSaveSmallTensor Differential Revision: D18365066 fbshipit-source-id: a3284bec004751cedae1cdadf27f969422faff8e	2019-11-08 14:26:40 -08:00
Jeremy Lilley	3d745508eb	String optimizations related to serialization. (#28230 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28230 This change improves the pickling small data benchmark by roughly 30%. (25.8usec -> 18.05usec). One of the main issues was that we were spending 25%+ of the cpu profile time in std::[o]stringstream constructors alone. Two main parts - Change some std::stringstream to std::ostringstream, when they showed up on hot-ish paths, and it was trivial to convert them. Roughly 27% of the std::stringstream constructor time is spent building the constituent std::basic_istream. If the istream isn't needed, don't construct it. - For a couple of very hot paths (e.g. Pickler::pushGlobal), just convert to traditional string::append(). std::ostringstream is convenient, but not particularly efficient. ghstack-source-id: 92153103 Test Plan: Benchmarking: buck build mode/opt experimental/jeremyl/c2:SerializationBench Correctness: buck test mode/dev-nosan caffe2/test/... Differential Revision: D17982181 fbshipit-source-id: 7fd4d267293231244c10c1e5b8f4951a7a3d852f	2019-10-18 07:39:30 -07:00
Jeremy Lilley	ac61adb5ef	String opts related to deserialization. (#28263 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28263 When looking at profiles of deserializing small data from torch::load(), we found some straightforward string-related changes that in aggregate improve the base time by 25%. One of the main problems was over-use of std::stringstream - the constructors alone were 18%+ of the time spent. This change improves unpickling/deserializing by converting a handful of the hottest usecases from the profiles: - unpickler's readString() goes from 10.3% of time to mostly out of the picture - QualifiedHame constructor (particularly Join call) was 8.9% of time, but afterwards disappears from the profiles. - getRecordID/hasRecord were ~5% each, but also get somewhat smaller. ghstack-source-id: 92158727 Test Plan: Benchmark in buck build mode/opt experimental/jeremyl/c2:SerializationBench Correctness in buck test mode/dev-nosan caffe2/test/... Differential Revision: D17997056 fbshipit-source-id: fc6d6c7da7557ff23c8e8c7dbe4c060abf860018	2019-10-18 07:36:17 -07:00
Zachary DeVito	58ed8ca9e1	clean up exported source format (#28129 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28129 The previous PR in the stack removed the need to order classes/functions or have correct import statements. This resolved circular depedency issues that can arise when class constructors like ModuleList put new instances of themselves in a common namespace. This PR changes our export format to no longer produce this information. By doing so we can make the logic signficantly simpler, since we just keep track of an individual PythonPrint object per file. Notes: * PythonPrint was changed to manage its own stream/list of ranges. It was doing this anyway internally, this just makes the API more clear. * Since we are changing the serialization format, I also removed op_version_set. It is now replaced with the VERSION number that written in the zip archive. This further simplifies the code emission process. * A test of op_version_set was removed since there is no longer any behavior to test. Test Plan: Imported from OSS Differential Revision: D17961610 Pulled By: zdevito fbshipit-source-id: ada362c4ca34d05393a1a7e799c94785ab9d9825	2019-10-16 22:47:24 -07:00
Jeremy Lilley	2e0294cb39	Make JIT Serialization support arbitrary std::function<> IO (#28039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28039 Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17939034 fbshipit-source-id: 344cce46f74b6438cb638a8cfbeccf4e1aa882d7	2019-10-15 22:12:04 -07:00
Will Feng	964d3d8b38	Revert D17822962: [pytorch][PR] Make JIT Serialization support arbitrary std::function<> IO Test Plan: revert-hammer Differential Revision: D17822962 Original commit changeset: d344a7e59707 fbshipit-source-id: ba153a2110faf91d103bd0f8dea4e9613bd6b0da	2019-10-15 13:55:11 -07:00
Jeremy Lilley	cbe5ab1109	Make JIT Serialization support arbitrary std::function<> IO (#27586 ) Summary: Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27586 Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17822962 Pulled By: jjlilley fbshipit-source-id: d344a7e59707f3b30d42280fbab78f87399e4d10	2019-10-15 12:39:58 -07:00
Basil Hosmer	2dff0b6f6a	Add some extra info to PyTorchStreamReader/Writer errors (#27006 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27006 Prompted by the lack of filename info in the error message for torch.jit.save() to an unwriteable location... looks like most serialization code paths have upstream checks that include more info, but adding the extra info here as well seemed simple and cheap enough, so went ahead and did it more generally. Test Plan: Imported from OSS Differential Revision: D17642686 Pulled By: bhosmer fbshipit-source-id: 435ae4c8b86bd730157b3dde19882e031c6415cd	2019-10-01 17:11:36 -07:00
Zachary DeVito	e2ccccee9a	Load tensors directly from pickle archive Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23281 Test Plan: Imported from OSS Differential Revision: D16452815 Pulled By: zdevito fbshipit-source-id: 918eef3ad444b598ab655c39037e4baafdcb51e1	2019-08-22 11:48:09 -07:00
Owen Anderson	d1e0a3dd15	Compress debug symbols when serializing TorchScript models. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23659 Differential Revision: D16603775 fbshipit-source-id: f2912048bdee36b3bcaa779e801c61bfbb5f30e5	2019-08-01 22:30:27 -07:00
Lu Fang	af6eea9391	Add the support of feature store example in pytorch model in fblearner (#20040 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20040 Add the support of feature store example in fblearner pytorch predictor, end to end Reviewed By: dzhulgakov Differential Revision: D15177897 fbshipit-source-id: 0f6df8b064eb9844fc9ddae61e978d6574c22916	2019-05-20 12:58:27 -07:00
Lu Fang	a918f1d9af	Adding a hook (wrapper) for non-std stream reader in PyTorchStreamReader (#15551 ) Summary: To implement a stream is very annoying, since it is closely defined with the underlying storage streambuffer. So in this PR, we add ReadAdapterInterface and PyTorchStreamReader will use it. We implement IStreamAdapter as a wrapper of std::istream. And keep the user interface unchanged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15551 Reviewed By: zrphercule Differential Revision: D13568907 Pulled By: houseroad fbshipit-source-id: 93708cb801248a6c101f35cb14d1631029365c3c	2019-01-04 22:50:07 -08:00
Sebastian Messmer	5a4082612f	Fix include paths for Backend.h Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14643 Reviewed By: ezyang Differential Revision: D13283492 fbshipit-source-id: 9919af9707d094118efc963543320e01b07d7bc5	2018-12-03 21:53:19 -08:00
Zachary DeVito	170ff7764f	Use a zip archive as our container format (#14521 ) Summary: After consulting with Owen, who pointed out the existence of the miniz library, I decided to take one last shot at using zip as our container format. miniz makes this surprisingly feasible and I think the benefits of using zip are large enough that we should do it. This replaces our custom container format with a zip archive, preserving all of the desirable features of our custom format, such as append-oriented writing, and mmap'able tensor data while adding a bunch of debugging advantages: 1. You can unzip and explore the container to debug what is going on with a model. 2. You can edit the model using a text editor (e.g. change the definition of a method, or editing the json-serialized meta-data), re-zip the file use OSX's native 'Compress' option, and re-load the result into pytorch. Note: this enables you to, e.g., print-debug serialized models. 3. We can easily enable features like compression in the future. 4. Stock python , without pytorch installed, and other programming languages can reasonably consume this format,using json and zipfile packages, which enables people to build tools like visualizers without those visualizers depending on pytorch. This will be especially useful if you want to, for instance, write a visualizer in javascript. Notes: * This add miniz (https://github.com/richgel999/miniz) as a dependency. miniz is a self-contained library for reading/writing zipfiles that unlike other zip libraries also includes libz compatible compress/decompress support. It is a single header and a single C file without any other dependencies. Note that the instructions for miniz explicitly state: > Please use the files from the releases page in your projects. Do not use the git checkout directly! So we have checked in the 'release' source. Miniz supports zip64, and its API is amenable to doing zip-align style things to align data. * Removes 'size' from RecordRef. This allows you to edit files in the zip archive without editing the meta-data file. Very important if you want to print-debug serialized models. * PyTorchStreamReader/PyTorchStreamWriter keep mostly the same API (though keys become strings) However, their implementation is completely swapped out to use miniz. * Code exists to check for the old magic number to give a decent warning to our preview users after we change the format. * Container version information is now put in a stand-alone 'version' file in the archive and serves a similar purpose to the other container version info. * All files in the zip archive start at 64-byte boundaries, using an approach similar to zip-align. Tests check that this property remains true. While the writer does this, the reader doesn't depend on it, allowing user-created archives that can use compression, and do not have to align data. * Added test to check for > 4GB files and archives. Disabled by default because it takes almost 2 minutes to run. * torchscript files are now optional: if a submodule does not have methods, it will not be written. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14521 Reviewed By: jamesr66a Differential Revision: D13252945 Pulled By: zdevito fbshipit-source-id: 01209294c0f6543d0fd716f85a38532249c52f8c	2018-11-30 19:19:29 -08:00

29 Commits