pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 13:44:15 +08:00

Author	SHA1	Message	Date
Iurii Zdebskyi	444039c47b	Bool tensor. Part 0: Boolean storage implementation (#16810 ) Summary: This is the first commit from a series of planned changes in order to add boolean tensors to PyTorch. The whole plan looks like this: 0. Storage Implementation (this change) 1. Tensor Creation. 2. Tensor Conversions. 3. Tensor Indexing. 4. Tensor Operations. 5. Back compatibility related changes. This feature was requested by the community: https://github.com/pytorch/pytorch/issues/4764 https://github.com/pytorch/pytorch/issues/4219 https://github.com/pytorch/pytorch/issues/4288 Change: Added boolean type to the Storage class for CPU and CUDA backends. Tested via: 1. unit tests 2. running this: -> import torch -> torch.BoolStorage <class 'torch.BoolStorage'> -> torch.cuda.BoolStorage <class 'torch.cuda.BoolStorage'> Pull Request resolved: https://github.com/pytorch/pytorch/pull/16810 Reviewed By: gchanan Differential Revision: D14087246 Pulled By: izdeby fbshipit-source-id: 042642ced1cb0fd1bb6bff05f9ca871a5c54ee5e	2019-02-19 08:22:13 -08:00
Francisco Massa	2d958b7f77	Storage.clone maintains original device (#14751 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/14673 As pointed out by vishwakftw , the root case of the `deepcopy` issue was that `storage.clone()` would create a new storage in the default device. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14751 Reviewed By: soumith Differential Revision: D13323061 Pulled By: fmassa fbshipit-source-id: bfe46ebd78f0b6cd9518c11d09de7849282ed2a2	2018-12-05 08:33:56 -08:00
Matthew Rocklin	eadc5071e8	Use torch.save in _StorageBase.__reduce__ (#9184 ) Summary: Previously this used the ``.toliist`` method, which converted the storage object into a list of Python objects, and then sent those to pickle. For storage objects of non-trivial size, this was very slow. Now we reuse the logic of the ``torch.save`` function to efficiently turn the Storage object into bytes, and send those instead. This reduces the semantic information (it's harder to interpret the bytes) but should be orders of magnitude more efficient when serializing data with the pickle protocol or with copy For future work it would be nice to develop a mechanism to get a buffer of bytes out of a Storage object, and use that alongside the current ``from_buffer`` method. See #9168 for context Closes https://github.com/pytorch/pytorch/pull/9184 Differential Revision: D8747794 Pulled By: soumith fbshipit-source-id: ac598e660c043788ed1ffab3d0303812886edf79	2018-07-06 07:24:53 -07:00
Sam Gross	720c7b1e2c	Move repeat to torch/_utils.py (#4712 ) This moves the implementation of repeat to _utils so that the autograd function can call it directly instead of relying on forward being called on tensors. This also removes _range, which was previously necessary because we shadowed the built-in range() function.	2018-01-17 17:30:43 -05:00
Luca Antiga	9b31280ccf	Have __sizeof__ account for size of stored elements (#3821 ) * Have __sizeof__ account for size of stored elements * Conform to sizeof specification	2017-11-27 11:22:34 -05:00
Sam Gross	24d92b5d9f	Concatenate directly into shared memory when constructing batches (#1323 ) This saves an extra memory copy, which speeds up data loading a bit (5-10% with accimage). As part of this change: * torch.cat accepts keyword argument out * sepcifiying out=None is treated like not specifying out	2017-04-22 03:40:30 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Sam Gross	6464e69e21	Docs for torch.Storage (#475 )	2017-01-18 03:22:30 -05:00
Sam Gross	24af02154c	Use ForkingPickler for sharing tensor/storages across processes (#344 ) This hooks into the (internal) ForkingPickler class in multiprocessing to reduce tensors, storages, and CUDA events instead of our queue from joblib. This makes it easier to use the standard multiprocessing classes in later versions of Python. This also exposes: - Tensor/Storage.share_memory_() - Module.share_memory() These methods move the CPU tensors and storages to shared memory. If you're using the "fork" method of multiprocessing, these objects can be directly inherited instead of serialized through a queue.	2016-12-28 20:34:23 -05:00
Sam Gross	1af9a9637f	Refactor copy and release GIL during copy (#286 )	2016-12-11 21:54:58 +01:00
Adam Paszke	2fd78112ab	Add half copy/conversions	2016-11-17 14:34:33 -08:00
Sam Gross	ee14cf9438	Add support for pinned memory: (#127 ) torch.Storage/Tensor.pin_memory() torch.Storage/Tensor.is_pinned()	2016-10-15 18:38:26 -04:00
Sam Gross	cb5d4e836f	Lazy load CUDA and THNN modules (#64 )	2016-09-28 19:29:53 -04:00
Adam Paszke	1828e7c42f	Add async CUDA copy	2016-09-27 15:12:48 -07:00
Adam Paszke	8fdec15a55	Codemod to remove camel case method naming	2016-09-20 08:40:28 -07:00

15 Commits