pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Will Feng 8cde4c4d22 Remove Variable::Impl and DifferentiableViewImpl (#17072 )

Summary:
As part of the Variable/Tensor merge work: https://github.com/pytorch/pytorch/issues/13638, we make the following changes in this PR:
1. Remove the `Variable::Impl` class and the `DifferentiableViewImpl` class
2. Change all `Variable.data()` call sites to either use `Variable` directly, or use `Variable.tensor_data()`
3. Remove `Variable.data()` API
3. Add `Variable.variable_data()` that matches `tensor.data` in Python API, which creates a new `Variable` that shares the same storage and tensor metadata with the original `Variable`, but with a completely new autograd history.

After this PR, Variable doesn't wrap a Tensor internally anymore, and both Variable and Tensor use the same TensorImpl class as its `impl_`. The only difference is that Variable always has AutogradMeta in its TensorImpl, but Tensor doesn't.

**Note that this PR is BC-breaking in the following use cases:**

**Use Case 1:**
Previously, `x.data = y` works even if `x` and `y` are of different TensorImpl type (e.g. `x` is a CPU dense tensor whose impl is of type TensorImpl, while `y` is a CPU sparse tensor whose impl is of type SparseTensorImpl). However, after this PR, `x.data = y` doesn't work anymore if `x` and `y` are of different TensorImpl type, because the underlying implementation `variable.set_data(tensor)` no longer works if `variable` and `tensor` have different TensorImpl type.

**Use Case 2:**
If a tensor `x`'s `grad` is sparse, accumulating dense gradients to `x` will change the tensor that `x.grad` is pointing to. This is better illustrated with the following example:
```python
params = torch.tensor([1.5, 1.5]).requires_grad_()
with torch.no_grad():
    # Change gradient to a sparse tensor
    params.grad = torch.sparse_coo_tensor(torch.tensor([[1, 1]]).long(), torch.tensor([1., 1.]))

grad_saved = params.grad
params.backward(torch.tensor([1.5, 1.5]))
assert id(grad_saved) == id(params.grad)  # This will fail after this PR
```
The assertion in the last line will fail after this PR, because adding dense gradients to sparse gradients will change the `params.grad` tensor reference.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/17072

Differential Revision: D14075257

Pulled By: yf225

fbshipit-source-id: 0e681df641270dea586042dd26db59f2e76b5957

2019-05-23 21:09:04 -07:00

functions

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

utils

Fix wrap(at::Scalar) (#18632 )

2019-03-30 11:36:11 -07:00

anomaly_mode.cpp

C++ changes toward libtorch and libcaffe2 unification (#19554 )

2019-04-26 01:38:10 -07:00

anomaly_mode.h

C++ changes toward libtorch and libcaffe2 unification (#19554 )

2019-04-26 01:38:10 -07:00

autograd.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

edge.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

engine.cpp

Add ScalarType argument to Type::options() (#19270 )

2019-04-21 21:16:07 -07:00

engine.h

Enable autograd to recognize the XLA backend as one providing multiple devices (#17847 )

2019-03-20 13:58:36 -07:00

function_hook.cpp

C++ changes toward libtorch and libcaffe2 unification (#19554 )

2019-04-26 01:38:10 -07:00

function_hook.h

C++ changes toward libtorch and libcaffe2 unification (#19554 )

2019-04-26 01:38:10 -07:00

function.cpp

Cleanup includes in torch/csrc/autograd/* (#19923 )

2019-05-06 13:48:42 -07:00

function.h

Replace AT_CHECK with TORCH_CHECK [shard 10/10]

2019-05-15 07:35:37 -07:00

grad_mode.cpp

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

grad_mode.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

init.cpp

PyTorch Profiler Shape aggregation support (#20035 )

2019-05-07 14:47:01 -07:00

input_buffer.cpp

Cleanup includes in torch/csrc/autograd/* (#19923 )

2019-05-06 13:48:42 -07:00

input_buffer.h

Enable autograd to recognize the XLA backend as one providing multiple devices (#17847 )

2019-03-20 13:58:36 -07:00

input_metadata.h

Add ScalarType argument to Type::options() (#19270 )

2019-04-21 21:16:07 -07:00

profiler_cuda.cpp

Unify cudaGetDeviceCount implementations. (#18445 )

2019-03-26 09:50:14 -07:00

profiler.cpp

restore hidden visibility by default for Linux builds (#20461 )

2019-05-20 16:49:37 -07:00

profiler.h

restore hidden visibility by default for Linux builds (#20461 )

2019-05-20 16:49:37 -07:00

python_anomaly_mode.cpp

Cleanup includes in torch/csrc/* (#19924 )

2019-05-06 14:03:18 -07:00

python_anomaly_mode.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

python_cpp_function.cpp

Enable performance-unnecessary-value-param in .clang-tidy (#15026 )

2018-12-13 16:15:35 -08:00

python_cpp_function.h

Enable performance-unnecessary-value-param in .clang-tidy (#15026 )

2018-12-13 16:15:35 -08:00

python_engine.cpp

Cleanup includes in torch/csrc/autograd/* (#19923 )

2019-05-06 13:48:42 -07:00

python_engine.h

Cleanup includes in torch/csrc/autograd/* (#19923 )

2019-05-06 13:48:42 -07:00

python_function.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

python_function.h

Add ScalarType argument to Type::options() (#19270 )

2019-04-21 21:16:07 -07:00

python_hook.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

python_hook.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

python_legacy_variable.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

python_legacy_variable.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

python_variable_indexing.cpp

Enable assignment for QTensor in pytorch frontend (#19676 )

2019-04-24 16:05:34 -07:00

python_variable_indexing.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

python_variable.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

python_variable.h

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

README.md

[Re-checkpointing] Autograd container for trading compute for memory (#6467 )

2018-04-10 15:26:24 -04:00

record_function.cpp

Finish removal of AT_CHECK, officially deprecate the macro. (#20600 )

2019-05-20 11:57:15 -07:00

record_function.h

Speed up RecordFunction with sampled callbacks (#20307 )

2019-05-15 14:48:49 -07:00

saved_variable.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

saved_variable.h

Cleanup includes in torch/csrc/autograd/* (#19923 )

2019-05-06 13:48:42 -07:00

symbolic.h

Canonicalize all includes in PyTorch. (#14849 )

2018-12-08 19:38:30 -08:00

type_and_shape.h

Extends type and shape tracing with device (#9796 )

2018-08-07 12:25:17 -07:00

variable.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

variable.h

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

VariableTypeManual.cpp

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

VariableTypeUtils.h

Remove Variable::Impl and DifferentiableViewImpl (#17072 )

2019-05-23 21:09:04 -07:00

README.md

Autograd

Autograd is a hotspot for PyTorch performance, so most of the heavy lifting is implemented in C++. This implies that we have to do some shuffling between Python and C++; and in general, we want data to be in a form that is convenient to manipulate from C++.

Our general model is that for any key data type that autograd manipulates, there are two implementations: a C++ type and a Python object type. For example, consider variables in autograd: we have both Variable in variable.h (the C++ type) and THPVariable in python_variable.h (the Python type.) (By the way, THP stands for TorcH Python, not to be confused with THPP, TorcH C++). Variable contains the payload of a variable, while THPVariable just contains a shared_ptr reference to Variable, as well as references to other Python objects which the Python runtime needs to know about. A lot of data accessor implementations in python_variable.cpp simply reach through to the underlying Variable and return the appropriate value.

The most complicated application of this principle is Function, which also supports users implementing custom behavior in Python. We have the following classes:

Function in function.h, the C++ type.
THPFunction in python_function.h, the Python object type. In python_function.cpp, you can see the boilerplate that tells the Python interpreter about this object.
PyFunction in python_function.h, a subclass of Function which forwards apply to a Python THPFunction. (NOT a Python object, despite its name!)

Outside of PyFunction, the C++ objects largely avoid referencing Python objects (there are a few exceptions, like pyobj in Variable, and PyFunction, whose whole point is to let C++ call into Python). And pyobj in Function to ensure uniqueness of the associated python wrapper (if it exists).