pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Jerry Zhang c98896b76f [quant][pt2e] Add more precise representation for quantized add (#104130 )

Summary:
The planned e2e for quantization in pytorch 2.0 export is the following:

float_model -> prepare_pt2e -> calibration -> convert_pt2e -> ...

inside convert_pt2e, we will first produce a q/dq representation of the quantized model, similar to the previous output of
convert_to_reference_fx in fx grah mode quantization:

```
torch.ops.quantized_decomposed.dequantize_per_tensor -> torch.ops.aten.add -> torch.ops.quantized_decomopsed.quantize_per_tensor
torch.ops.quantized_decomposed.dequantize_per_tensor   /
```

Then we'll rewrite the above to a more precise representation that express the intention in a more precise manner, since
here we actually want to do int8 addition, instead of simulating the int8 addition with fp32 operations, the representation for
quantized add is:

```
def quantized_add(x_i8, x_scale, x_zero_point, y_i8, y_scale, y_zero_point, out_scale, out_zero_point):
    x = (x_scale / out_scale) * x_i8
    y = (y_scale / out_scale) * y_i8
    out = x + y
    out -= (x_zero_point * x_scale - y_zero_point * y_scale) / out_scale
    out += out_zero_point
    return out
```

Test Plan:
```
buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_add (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)'
```

Reviewed By: kimishpatel

Differential Revision: D45628032

Pull Request resolved: https://github.com/pytorch/pytorch/pull/104130
Approved by: https://github.com/kimishpatel

2023-06-27 20:11:30 +00:00

_awaits

[jit] Support Awaitable type (#90863 )

2023-01-30 17:38:59 +00:00

Revert "DDP + C10D sparse all_reduce changes (#103916 )"

2023-06-26 22:37:58 +00:00

_C_flatbuffer

…

_custom_op

Add API to construct the functional variant of an op (#102293 )

2023-06-02 13:36:50 +00:00

_decomp

[decomp] Add decomposition for torch.renorm (#103858 )

2023-06-21 20:57:43 +00:00

_dispatch

Reland of https://github.com/pytorch/pytorch/pull/101818 (#103888 )

2023-06-21 21:00:56 +00:00

_dynamo

[quant][pt2e] Add more precise representation for quantized add (#104130 )

2023-06-27 20:11:30 +00:00

_export

[RFC]: Integrate assertions functionalization to export (after AOT export) (#103887 )

2023-06-27 18:14:29 +00:00

_functorch

[dynamo] FSDP + AC + torch.compile (#103953 )

2023-06-24 01:40:56 +00:00

_higher_order_ops

Stop Dynamo from peeking into wrap's body (#104076 )

2023-06-26 17:16:51 +00:00

_inductor

[Inductor][FX passes] Remove config.split_cat_fx_passes & Add config.experimental_patterns (#104208 )

2023-06-27 20:08:40 +00:00

_lazy

…

_logging

Add graph break logging option instead of config flag (#103202 )

2023-06-12 19:52:31 +00:00

_prims

Replace _prims_common.check with torch._check* (#103240 )

2023-06-21 00:46:17 +00:00

_prims_common

[decomp] Add decomposition for torch.renorm (#103858 )

2023-06-21 20:57:43 +00:00

_refs

[decomp] Add decomposition for torch.renorm (#103858 )

2023-06-21 20:57:43 +00:00

_subclasses

[pt2] grad support (#102264 )

2023-06-21 10:13:09 +00:00

amp

Fix missing mandatory device_type argument in autocast docstring (#97223 )

2023-06-27 01:54:54 +00:00

[quant][pt2e] Add more precise representation for quantized add (#104130 )

2023-06-27 20:11:30 +00:00

autograd

Deprecate "Type" and support more devices for save_on_cpu (#103245 )

2023-06-09 05:05:01 +00:00

backends

[BE] Deprecate has_XYZ attributes (#103279 )

2023-06-10 05:17:17 +00:00

compiler

torch.compiler public namespace (#102182 )

2023-06-13 19:52:17 +00:00

contrib

…

cpu

Quantization oneDNN backend only support VNNI CPU (#103653 )

2023-06-19 09:50:07 +00:00

csrc

Multioutput backward formula: allow conditional guards against saving (#103750 )

2023-06-27 15:12:09 +00:00

cuda

[pt2] grad support (#102264 )

2023-06-21 10:13:09 +00:00

distributed

[DTensor][Random] Introduce CudaRNGStateTracker to maintain parallel RNG state for DTensor (#103235 )

2023-06-27 19:00:25 +00:00

distributions

Fix Dirichlet.log_prob() when x=0 and alpha=1 (#103605 )

2023-06-15 16:16:50 +00:00

fft

…

func

[pt2] grad support (#102264 )

2023-06-21 10:13:09 +00:00

futures

Enable xdoctest runner in CI for real this time (#83816 )

2022-12-29 05:32:42 +00:00

Preserve all submodules/parameters/buffers when unpickle graph module (#104115 )

2023-06-26 06:59:48 +00:00

jit

Fix shape function for transpose convolution (#102139 )

2023-06-21 17:50:56 +00:00

legacy

…

lib

Use size_t in THManagedMapAllocator (#103331 )

2023-06-13 04:50:30 +00:00

linalg

Fix Math Typesetting for torch.linalg.matrix_exp (#101363 )

2023-05-15 00:31:12 +00:00

masked

Fix autograd issue with identity conversions (#92022 )

2023-06-21 21:23:03 +00:00

monitor

Enable xdoctest runner in CI for real this time (#83816 )

2022-12-29 05:32:42 +00:00

mps

[doc] Improve mps package description (#104184 )

2023-06-27 15:50:36 +00:00

multiprocessing

[BE]: enable PLE error codes in ruff and fix bugs (#101079 )

2023-05-11 23:57:25 +00:00

nested

[BE] Enable C419 rule for any all shortcircuiting (#99890 )

2023-04-25 15:02:13 +00:00

Revert "DDP + C10D sparse all_reduce changes (#103916 )"

2023-06-26 22:37:58 +00:00

onnx

Extend torch->onnx export for quantized convolutional ops (#102759 )

2023-06-23 22:50:17 +00:00

optim

Fix lr_scheduler serialization contains bound methods issue (#102627 )

2023-06-23 03:53:15 +00:00

package

Integrating new API usage metadata logger (#101762 )

2023-05-26 00:24:26 +00:00

profiler

[PyPer][ET] Refactor EG to ET (#99694 )

2023-06-22 19:41:54 +00:00

quantization

AO migration: replace torch internal callsites (#94170 )

2023-02-07 02:32:23 +00:00

signal

Fix flake8 lint errors reported by ruff - take 2 (#99798 )

2023-04-23 23:09:51 +00:00

sparse

[core][pruning][sparse][feature] SparseSemiStructured tensor subclass (#102135 )

2023-06-27 19:21:06 +00:00

special

…

testing

[fake_pg] allow fake_pg allgather to do some simple validation (#104213 )

2023-06-27 09:39:16 +00:00

utils

Fixed benchmark_utils.Fuzzer (#101553 )

2023-06-26 08:03:27 +00:00

__config__.py

…

__future__.py

…

__init__.py

Make torch.empty* deterministic by filling with NaN or max int value (#101849 )

2023-06-21 02:53:22 +00:00

_appdirs.py

…

_classes.py

[BE] [2/3] Rewrite super() calls in functorch and torch (#94588 )

2023-02-10 21:16:33 +00:00

_deploy.py

…

_guards.py

Lift user defined attributes into inputs for certain cases (user defined types and tensors) (#103386 )

2023-06-20 23:45:19 +00:00

_jit_internal.py

default should be used as default value in boolean_dispatch (#103463 )

2023-06-14 03:16:31 +00:00

_linalg_utils.py

Remove deprecated torch.symeig (#70988 )

2023-01-31 11:59:11 +00:00

_lobpcg.py

Bump black version to 23.1.0 (#96578 )

2023-03-15 06:27:59 +00:00

_lowrank.py

…

_meta_registrations.py

REDO of dropout support for mem eff #102038 (#103704 )

2023-06-26 23:05:03 +00:00

_namedtensor_internals.py

Enable xdoctest runner in CI for real this time (#83816 )

2022-12-29 05:32:42 +00:00

_ops.py

Raise AttributeError in _OpsNamespace if __self__ attribute is requested (#104096 )

2023-06-27 01:42:06 +00:00

_python_dispatcher.py

…

_sources.py

[BE] [2/3] Rewrite super() calls in functorch and torch (#94588 )

2023-02-10 21:16:33 +00:00

_storage_docs.py

…

_tensor_docs.py

Added is_xla (#103100 )

2023-06-22 23:31:04 +00:00

_tensor_str.py

Add torch._utils.render_call, improve printoptions (#102623 )

2023-05-31 22:08:04 +00:00

_tensor.py

This extra message would have helped with Wav2Vec2 debugging. (#103002 )

2023-06-06 04:28:16 +00:00

_torch_docs.py

Make torch.empty* deterministic by filling with NaN or max int value (#101849 )

2023-06-21 02:53:22 +00:00

_utils_internal.py

Add top level function to check if running with deploy (#101420 )

2023-05-16 16:05:49 +00:00

_utils.py

fix hpu storage serialization (#101680 )

2023-06-21 21:19:49 +00:00

_VF.py

[BE] [2/3] Rewrite super() calls in functorch and torch (#94588 )

2023-02-10 21:16:33 +00:00

_vmap_internals.py

[BE] Enable C419 rule for any all shortcircuiting (#99890 )

2023-04-25 15:02:13 +00:00

_weights_only_unpickler.py

Add float to list of allowed ops (#94910 )

2023-02-15 23:13:21 +00:00

abi-check.cpp

…

CMakeLists.txt

enable more ASAN tests (#101483 )

2023-06-15 05:21:15 +00:00

custom_class_detail.h

More tidy fixes (#93069 )

2023-01-27 06:40:50 +00:00

custom_class.h

More fixes and improved clang-tidy checkers (#93213 )

2023-02-01 14:44:17 +00:00

extension.h

…

functional.py

[BE] Do not expose torch.functional.opt_einsum (#102004 )

2023-05-23 01:52:40 +00:00

hub.py

Add --offload-to-disk support to minifier (#100546 )

2023-05-05 05:25:03 +00:00

library.h

[PyTorch] Delete c10::guts::if_constexpr (#101991 )

2023-05-23 23:19:35 +00:00

library.py

[torch.library] Change Library.__del__ into weakref.finalize (#101829 )

2023-05-22 19:51:08 +00:00

overrides.py

[AMP] Support XLA:TPU (#96370 )

2023-06-23 19:46:42 +00:00

py.typed

…

quasirandom.py

[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 )

2023-02-07 21:10:56 +00:00

random.py

add rng_state support for custom device (#98069 )

2023-04-10 22:36:55 +00:00

README.txt

…

return_types.py

…

script.h

…

serialization.py

Add docstring to torch.serialization.register_package (#104046 )

2023-06-26 23:28:32 +00:00

storage.py

fix hpu storage serialization (#101680 )

2023-06-21 21:19:49 +00:00

torch_version.py

…

types.py

Allow new_full's fill_value argument type to be complex (#91345 )

2023-03-21 12:34:00 +00:00

version.py.tpl

[bazel] add build for functorch (#101475 )

2023-05-18 20:29:08 +00:00

README.txt

Note [TH abstraction violation]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

TH/THC provide some hpp headers, which are proper C++ headers rather than
C headers.  These headers serve double duty as *internal implementation
detail* headers, whose contents should largely not be used by external
clients.

Ideally, we would not install these headers at all; instead, you should
use public functions (in headers like `THTensor.h`, NOT `THTensor.hpp`)
to manipulate these structs.  However, there are a few places
in torch/csrc where we violate this abstraction.  They are marked with
a pointer to this note.  Each of those sites will have to be refactored
when we refactor the guts of THTensor and related structures.