pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Jerry Zhang	3fc889e976	Tensor construction codemod(ResizeLike) - 1/7 (#15073 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15073 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: dzhulgakov Differential Revision: D13419563 fbshipit-source-id: 8c284405fa3a867303216df876ee6b20d8a46551	2018-12-19 21:38:48 -08:00
Zachary DeVito	92314c83fa	re-enable copy of python files, but be careful that the copy is only … (#14982 ) Summary: …done once This allow no-op build to work correctly even when BUILD_CAFFE2_OPS is on. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14982 Differential Revision: D13413960 Pulled By: zdevito fbshipit-source-id: 6e5412a8c375af8a47c76f548cdd31cff15f3853	2018-12-11 16:54:08 -08:00
Jerry Zhang	20d1bff292	Tensor construction codemod - 1/3 (#14828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14828 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: bddppq Differential Revision: D13335160 fbshipit-source-id: a3ae4c5a86bfbdaf2d5aa14e0eef57255e829fd4	2018-12-06 11:47:32 -08:00
Dmytro Dzhulgakov	da9e49e586	Remove Context dependency from Tensor class (#14269 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14269 Removes reference to Context proper and instead adds a bool argument for async copy (the same as `copy_`) For CopyFrom - I haven't tweaked all callsites yet. Instead I rely on a terrible hack that pointer to context is implicitly converted to bool when passed, haha :) It's not a good code and I propose to fix it in a follow up diff (maybe using clangr tooling). Reviewed By: ezyang Differential Revision: D13117981 fbshipit-source-id: 7cb1dc2ba6a4c50ac26614f45ab8318ea96e3138	2018-11-28 15:45:38 -08:00
Jerry Zhang	a228a95b94	Rename ndim() -> dim() - 1/6 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(ndim()->dim()): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12935693 fbshipit-source-id: f24f1c10cd5bbb9e63cda0a0da989e6e3766380a	2018-11-07 07:30:11 -08:00
Jerry Zhang	2e1b7a6f4f	Renaming dim() to size() - 1/3 (#13434 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13434 Codemod generated with clangr shard mode, 50 files per diff, clangr code(dim->size): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12867223 fbshipit-source-id: 3e05be1a370ebd1a273bd4c70499d019fd056ac4	2018-10-31 17:43:52 -07:00
Jerry Zhang	eea2ee6d29	Renaming size() to numel() - 1/17 Summary: Codemod generated with clangr shard mode, 25 files per diff Reviewed By: li-roy Differential Revision: D10866237 fbshipit-source-id: 020fcfdf52083430c5b674eda8e07ad3adfcc838	2018-10-26 15:36:59 -07:00
Jerry Zhang	3b919a6f82	Renaming dims() to sizes() (caffe2/caffe2) - 1/4 Summary: Codemod generated with clangr shard mode, 25 files per diff, for renaming dims() to sizes() Reviewed By: ezyang Differential Revision: D10842786 fbshipit-source-id: 551421a2cb4d2f2fc7f43775d4554643de0f0694	2018-10-24 17:36:08 -07:00
Edward Yang	99bc541b5b	size_from_dim(0) is like numel() but worse. Don't do it. (#12729 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12729 This may have a dependency on D10380678 if size_from_dim(0) was required because numel() used to return -1 in some cases. This is no longer true. Reviewed By: li-roy, dzhulgakov Differential Revision: D10415069 fbshipit-source-id: 39f46f56249ecaf3533f62a0205b3a45d519d789	2018-10-18 18:06:37 -07:00
Christian Puhrsch	a6630e25af	Remove many caffe2::TIndex and replace them with int64_t (#11943 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11943 See title Reviewed By: ezyang Differential Revision: D9992645 fbshipit-source-id: e8f80d6ea762971513e5e8072975ceea53e1f11a	2018-09-22 18:11:04 -07:00
James Sun	85408e744f	Move filler interface to operator schema (#10522 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10522 Move filler interface to operator schema to avoid extra code for caffe2 mobile. Reviewed By: dzhulgakov Differential Revision: D9312940 fbshipit-source-id: 77fb2406f0c6b171a1912a207e05e36da50c6966	2018-08-15 12:40:18 -07:00
Jerry Zhang	e0d43572c1	Cleaner semantics for Reserve (#10261 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10261 1. Reserve Currently, Reserve will allocate new memory and old data in the tensor is also preserved, and Resize is relying on this behavior in some call-site, e.g. https://github.com/pytorch/pytorch/blob/master/caffe2/operators/reservoir_sampling.cc#L103, where we should be using Extend. We want to bring semantics of Reserve to be more aligned with std::vector, i.e. we want it to be an optimization about memory allocation and remove the semantics about preserving the data. We'll remove the guarantee that data will be preserved after Reserve, and Extend will be the only API that preserves old data when we do in-place extension of memory. This also helps with the later refactoring on split Storage from Tensor. Also, we'll only pass in the outer dimension to Reserve which means the later dimensions should be set before we call Reserve. 2. Extend/Shrink Previously, Extend actually means ExtendBy and Shrink means ShrinkTo, I would like to add a ExtendTo for convenience, and change Shrink to ShrinkTo. Old functions calling Extend is still there, although it actually means Extend by, but I think it still makes sense to have it. 3. Usage Patterns The expected usage patterns right now is: ``` t->Resize({0, 32, 32, 32}); t->template mutable_data<T>(); // set meta_ t->Reserve(100); auto* t_data = t->template mutable_data<T>(); // feed data to tensor using t_data for (int i = 0; i < 100; ++i) { t->Extend(1, 50, &context_); // you can continue to use t_data if you have reserved enough space // otherwise, you should call t->template mutable_data<T> again to // get the new data pointer since Extend will allocate new memory even // though the original data is preserved. } ``` Reviewed By: ezyang Differential Revision: D9128147 fbshipit-source-id: e765f6566d73deafe2abeef0b2cc0ebcbfebd096	2018-08-06 14:40:16 -07:00
Xiaomeng Yang	57d2d4bcff	Optimize reduce ops for 2d and 3d (#9992 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9992 Optimize reduce ops for 2d and 3d Reviewed By: houseroad Differential Revision: D9042505 fbshipit-source-id: 62af2125aa6439106293e59bdf6a2b920792fd2d	2018-08-04 13:53:58 -07:00
Taewook Oh	7dc870bd7b	Delete invalid 'template' keyword (#10173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10173 With D9024330, `Extend` fundtion is no more a template, which makes the `template` keyword here invalid. For some reason current version of LLVM doesn't catch this, but the latest one does. Reviewed By: jerryzh168 Differential Revision: D9133462 fbshipit-source-id: 54ac9aad01f81b9b4e7b6e2864b8961478d2d860	2018-08-02 14:50:11 -07:00
Jerry Zhang	aebf3b47ae	Remove template parameter from Tensor (#9939 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9939 Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: ezyang, houseroad Differential Revision: D9024330 fbshipit-source-id: e0b8295d2dc6ebe2963383ded5af799ad17164ba	2018-07-27 10:56:39 -07:00
Jerry Zhang	969b62f276	Revert D8121878: Remove template parameter from Tensor Differential Revision: D8121878 Original commit changeset: 4a5e9a677ba4 fbshipit-source-id: d8e2c0bb145b52fbcca323b22d1d3346f0b3249e	2018-07-26 14:02:04 -07:00
Jerry Zhang	cd5adc7b5f	Remove template parameter from Tensor (#13 ) Summary: Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: xw285cornell Differential Revision: D8121878 fbshipit-source-id: 4a5e9a677ba4ac82095df959851a054c81eccf81	2018-07-26 10:25:23 -07:00
James Sun	6de038286a	Add random data filler to predictor bench to support production nets (#9520 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9520 Add random data filler to predictor bench to support production nets Reviewed By: salexspb Differential Revision: D8712757 fbshipit-source-id: 2c732b2ba71ab210f9222adf94d08442ca71dc03	2018-07-18 00:46:02 -07:00
Orion Reblitz-Richardson	9ec0a2aef4	fbshipit-source-id: ba600fcd2b5cefc7621357bdeb05e24cea02e5af	2018-06-27 04:50:56 -07:00
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Henry Lu	10667a914e	Add linter for enforcing caffe operator documentation Summary: Add check that every time we register a caffe operator to CPU or GPU that documentation is added for the particular operator. Reviewed By: dzhulgakov Differential Revision: D5443110 fbshipit-source-id: 3793c3d29bea1228078cb30bdf8243ac0ab90664	2017-07-24 15:27:47 -07:00
Victor Gao	34be12353b	comment out unused parameters Summary: This uses `clang-tidy` to comment out unused parameters (in functions, methods and lambdas) in fbcode. Cases that the tool failed to handle are fixed manually. Reviewed By: igorsugak Differential Revision: D5454343 fbshipit-source-id: 5dee339b4334e25e963891b519a5aa81fbf627b2	2017-07-21 15:14:43 -07:00
Aapo Kyrola	65e675e3e1	Fix net construct bench Summary: Net construct bench was using old version of data_parallel_model API. Reviewed By: bddppq Differential Revision: D5453281 Tags: easy fbshipit-source-id: 93e1ba58511c7b25235ee50d9862fd0614b344c9	2017-07-19 11:23:39 -07:00
Aapo Kyrola	95291f0f74	Revert D5348078: Add linter for enforcing caffe operator documentation Summary: This reverts commit c3fa22fc7ca8066d5fc8fa780b23d7867fd3380e Differential Revision: D5348078 fbshipit-source-id: f536e647cbd221b26ccbc105a5f5f8bdbcc119ab	2017-07-17 18:36:38 -07:00
Henry Lu	32b13d6243	Add linter for enforcing caffe operator documentation Summary: Add lint rule to check that every time we register a caffe operator to CPU or GPU that documentation is added for the particular operator. Reviewed By: dzhulgakov Differential Revision: D5348078 fbshipit-source-id: c3fa22fc7ca8066d5fc8fa780b23d7867fd3380e	2017-07-17 08:17:23 -07:00
Junjie Bai	4fddc04054	Use the same schema of switching to device reduce sum for SumSqrElements Summary: Based on benchmark script located at `caffe2/experiments/python/device_reduce_sum_bench.py`, device reduce sum is slower for N <= 10000, so we only switch to use device reduce for large N in SumElements. This diff applies the same schema for SumSqrElements. Reviewed By: jamesr66a Differential Revision: D5369868 fbshipit-source-id: ae13a611aff9d3464d1c4950ee155c740a2da339	2017-07-05 10:52:17 -07:00
Junjie Bai	f3a59aedff	Use cub::DeviceReduce for faster math::Sum CUDA version Summary: Port SumElements and softmax_ops.cu to use device reduce sum Reviewed By: akyrola Differential Revision: D5351881 fbshipit-source-id: ca9604186c261ffcb1480da2a17baab8a4809372	2017-06-30 15:04:06 -07:00
haracejacob	2ec294a8bb	Fix a few typos and grammars in comment Summary: Fix a few typos and grammars in comment by using language-check, python library spell_checker source code is here : https://github.com/17-1-SKKU-OSS/011A/blob/master/spell_checker/spell_checker.py here is the text file which indicates what things should be fixed : https://github.com/17-1-SKKU-OSS/011A/tree/master/spell_checker/fix/caffe2 Closes https://github.com/caffe2/caffe2/pull/719 Differential Revision: D5165118 Pulled By: aaronmarkham fbshipit-source-id: 7fb8ef7a99d03cd5fd2f9ebdb01b9865e90fc37b	2017-06-14 18:22:39 -07:00
Thomas Dudziak	60c78d6160	Fixes range/xrange for Python 3 Summary: As title Differential Revision: D5151894 fbshipit-source-id: 7badce5d3122e8f2526a7170fbdcf0d0b66e2638	2017-06-07 00:04:26 -07:00
Pieter Noordhuis	bbd7aee9ab	Revert D4952993: [Caffe2] fix mkl_sparse and migrate sparsity experiments Summary: This reverts commit 86c03676ab4e47f04d2d0dd438a4a1c849bbbff0 Differential Revision: D4952993 fbshipit-source-id: 5c213c48ac44ce6aefccacc6d80534648d3c516a	2017-05-17 14:46:56 -07:00
Yiming Wu	f359d70ae7	fix mkl_sparse and migrate sparsity experiments Summary: Migrate experiments folder to fb/sparse folder. Keep FunHashOp and SparseFunHashOp because they are now assumed as a default Op in depr. What I did # Migrate FunHashOp and SparseFunHashOp and their unitests to core-caffe2, make sure tests are passed. # Migrate other Ops in experiment folder to fb/sparse folder. Write new TARGETS files for them. Make sure tests are passed. # Make sure all related tests passed. # Fix MKL definition btw. Make sure that FC_Sparse is not compiled when there is no MKL support Reviewed By: salexspb Differential Revision: D4952993 fbshipit-source-id: 86c03676ab4e47f04d2d0dd438a4a1c849bbbff0	2017-05-16 18:33:51 -07:00
Alexander Sidorov	fc77ae1736	remote some experimental files from open-source repo Differential Revision: D4948835 fbshipit-source-id: 1115914a19d70ae214557132f24e4c302470f47e	2017-04-25 13:31:50 -07:00
Aaron Markham	58f7f2b441	doxygen python block added Summary: Closes https://github.com/caffe2/caffe2/pull/226 Differential Revision: D4793550 Pulled By: JoelMarcey fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e	2017-03-29 06:46:16 -07:00
Joel Marcey	d9d6f1e905	Fix missplaced semi-colon	2017-02-15 20:41:05 -08:00
Aaron Markham	93841acc1a	added docs for learning_rate_op and tweaked docs formatting	2017-02-15 17:07:54 -08:00
Fei Sun	fc2b6e8ed6	Migrate build_sgd to python directory Summary: Currently build_sgd is in facebook specific directory. Need to move it to python so that the open source world can use it. Reviewed By: salexspb Differential Revision: D4547016 fbshipit-source-id: d699b7b1ab8051afdeadedb4d247ec2a04a7a3e7	2017-02-13 13:31:37 -08:00
Bram Wasti	3833dad5f6	manual sync of old never sync'd files	2017-01-06 15:28:45 -08:00
Aapo Kyrola	d38499f727	Optimize BlobIsDefined() + benchmark --> net construction 95 secs to 8.2 secs! Summary: I have noticed that constructing the Xray model takes quite a while. To measure this, I wrote a benchmark script that creates a resnet-50 model on 8 gpus. This takes about 95 secs -- which is kind of annoying when you want to quickly debug stuff. Profiling (using Python's cProfile), I was able to see that the most of the time is used in net.BlobIsDefined(), which does a linear search over external inputs and operator outputs. Thus it gets slower and slower with large nets. This can be fully optimized by keeping a separate lookup table of operator inputs and outputs (and external inputs and outputs). It is a bit annoying to keep this separate data structure, but I setup the unit tests to ensure things are doing correctly over Clones. After the optimization, the net construction drops from 95 secs to 8.2 secs! Reviewed By: azzolini Differential Revision: D4288307 fbshipit-source-id: 0bb82c8bde9d86a2702b298f4aa706cba509346e	2016-12-15 12:01:30 -08:00
Yangqing Jia	f09d2b2b35	changes to make c2 build.	2016-07-21 16:39:08 -07:00
Yangqing Jia	09bed67e4f	add untracked files	2016-07-21 11:26:41 -07:00

41 Commits