pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Jerry Zhang c98896b76f [quant][pt2e] Add more precise representation for quantized add (#104130 )

Summary:
The planned e2e for quantization in pytorch 2.0 export is the following:

float_model -> prepare_pt2e -> calibration -> convert_pt2e -> ...

inside convert_pt2e, we will first produce a q/dq representation of the quantized model, similar to the previous output of
convert_to_reference_fx in fx grah mode quantization:

```
torch.ops.quantized_decomposed.dequantize_per_tensor -> torch.ops.aten.add -> torch.ops.quantized_decomopsed.quantize_per_tensor
torch.ops.quantized_decomposed.dequantize_per_tensor   /
```

Then we'll rewrite the above to a more precise representation that express the intention in a more precise manner, since
here we actually want to do int8 addition, instead of simulating the int8 addition with fp32 operations, the representation for
quantized add is:

```
def quantized_add(x_i8, x_scale, x_zero_point, y_i8, y_scale, y_zero_point, out_scale, out_zero_point):
    x = (x_scale / out_scale) * x_i8
    y = (y_scale / out_scale) * y_i8
    out = x + y
    out -= (x_zero_point * x_scale - y_zero_point * y_scale) / out_scale
    out += out_zero_point
    return out
```

Test Plan:
```
buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_add (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)'
```

Reviewed By: kimishpatel

Differential Revision: D45628032

Pull Request resolved: https://github.com/pytorch/pytorch/pull/104130
Approved by: https://github.com/kimishpatel

2023-06-27 20:11:30 +00:00

_nvfuser

adding symbolic link to get CI to run tests where cmake is not run on CI node (#95402 )

2023-03-01 19:01:18 +00:00

ao/sparsity

Remove commented out pdb (#101993 )

2023-05-23 02:20:10 +00:00

autograd

[autograd] disable backward/grad for complex scalar output (#92753 )

2023-02-23 11:38:27 +00:00

backends/xeon

enable taskset core pinning in addition to numactl (#96011 )

2023-03-07 01:19:46 +00:00

benchmark_utils

Add asan slow test shard (#99925 )

2023-04-26 21:10:55 +00:00

bottleneck_test

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

cpp

Adds the initial support for AOTInductor model and interface (#104202 )

2023-06-27 00:37:26 +00:00

cpp_api_parity

[BE] Remove unnecessary dict comprehensions (#97116 )

2023-03-20 00:56:57 +00:00

cpp_extensions

[functorch] disable C++ Function under functorch transforms (#103957 )

2023-06-23 11:01:44 +00:00

custom_backend

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

custom_operator

Enable TestTorchbind on Windows (#96507 )

2023-03-16 16:18:08 +00:00

distributed

[DTensor][Random] Introduce CudaRNGStateTracker to maintain parallel RNG state for DTensor (#103235 )

2023-06-27 19:00:25 +00:00

distributions

Fix Dirichlet.log_prob() when x=0 and alpha=1 (#103605 )

2023-06-15 16:16:50 +00:00

dynamo

REDO of dropout support for mem eff #102038 (#103704 )

2023-06-26 23:05:03 +00:00

edge

[Specialized Kernel] Update yaml syntax to use kernel instead of dispatch (#104070 )

2023-06-27 09:53:20 +00:00

error_messages

…

expect

REDO of dropout support for mem eff #102038 (#103704 )

2023-06-26 23:05:03 +00:00

export

[RFC]: Integrate assertions functionalization to export (after AOT export) (#103887 )

2023-06-27 18:14:29 +00:00

forward_backward_compatibility

REDO of dropout support for mem eff #102038 (#103704 )

2023-06-26 23:05:03 +00:00

functorch

[functorch] Remove test_functionalize (#103748 )

2023-06-23 14:38:50 +00:00

[fx] Better replacements finder in subgraph rewriter (#100556 )

2023-05-16 14:08:44 +00:00

inductor

[Inductor][FX passes] Remove config.split_cat_fx_passes & Add config.experimental_patterns (#104208 )

2023-06-27 20:08:40 +00:00

jit

[jit] fix duplicated module input and output values in tracing module (#102510 )

2023-06-27 03:43:06 +00:00

jit_hooks

Migrate PyTorch to C++17 (#85969 )

2022-12-08 02:27:48 +00:00

lazy

Use yaml.SafeLoader instead of legacy yaml.Loader (#100443 )

2023-05-02 18:32:36 +00:00

mobile

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

XFAIL test_MaxUnpool_index_errors CUDA slow tests (#103905 )

2023-06-22 18:05:10 +00:00

onnx

Extend torch->onnx export for quantized convolutional ops (#102759 )

2023-06-23 22:50:17 +00:00

onnx_caffe2

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

optim

Fix lr_scheduler serialization contains bound methods issue (#102627 )

2023-06-23 03:53:15 +00:00

package

Adding serialization ID to inline container (#100994 )

2023-05-17 17:08:48 +00:00

profiler

[PyPer][ET] Refactor EG to ET (#99694 )

2023-06-22 19:41:54 +00:00

quantization

[quant][pt2e] Add more precise representation for quantized add (#104130 )

2023-06-27 20:11:30 +00:00

scripts

Fix flake8 lint errors reported by ruff - take 2 (#99798 )

2023-04-23 23:09:51 +00:00

test_img

…

typing

…

_test_bazel.py

[bazel] add build for functorch (#101475 )

2023-05-18 20:29:08 +00:00

allowlist_for_publicAPI.json

temp fix for segment reduce undocumented FC window (#94242 )

2023-02-07 18:27:01 +00:00

conftest.py

Do not collect and skip non-disabled tests when rerunning disabled tests (#102107 )

2023-05-27 12:10:36 +00:00

create_dummy_torchscript_model.py

Add basic Module serialization BC test (#96238 )

2023-03-08 21:01:27 +00:00

delete.py

…

HowToWriteTestsUsingFileCheck.md

…

linear.py

…

load_torchscript_model.py

Add basic Module serialization BC test (#96238 )

2023-03-08 21:01:27 +00:00

mkl_verbose.py

…

mkldnn_verbose.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

run_doctests.sh

Enable xdoctest runner in CI for real this time (#83816 )

2022-12-29 05:32:42 +00:00

run_test.py

Change tools search order (#104214 )

2023-06-27 15:54:34 +00:00

simulate_nccl_errors.py

Fix G001,G002,G003 in logs to % syntax (#97812 )

2023-04-01 01:43:33 +00:00

test_ao_sparsity.py

Back out "[core][pruning][be] rename BaseSparsifier to BasePruner (#98747 )" (#99171 )

2023-04-15 00:37:45 +00:00

test_autocast.py

Add autocast support for IPU (#103890 )

2023-06-22 15:38:45 +00:00

test_autograd.py

Update SavedVariable to support saving non-input leafs (#104039 )

2023-06-22 21:52:35 +00:00

test_binary_ufuncs.py

Fixed indentation error in test_binary_ufuncs.py (#102244 )

2023-05-25 21:21:30 +00:00

test_bundled_images.py

…

test_bundled_inputs.py

[BE] Apply almost all remaining flake8-comprehension checks (#94676 )

2023-02-12 01:01:25 +00:00

test_comparison_utils.py

Add missing __main__ in two unittests (#97302 )

2023-03-22 19:09:08 +00:00

test_compile_benchmark_util.py

torch.compile benchmark utility (#97699 )

2023-04-12 03:02:06 +00:00

test_complex.py

Fix CPU vectorized eq and ne operations for complex types (#97374 )

2023-04-14 02:02:16 +00:00

test_content_store.py

Add load_storage (#100519 )

2023-05-05 05:25:03 +00:00

test_cpp_api_parity.py

Make M1 tests green (#82213 )

2022-08-05 16:12:08 +00:00

test_cpp_extensions_aot.py

[MPS] Prerequisite for MPS C++ extension (#102483 )

2023-06-07 17:28:31 +00:00

test_cpp_extensions_jit.py

[functorch] disable C++ Function under functorch transforms (#103957 )

2023-06-23 11:01:44 +00:00

test_cpp_extensions_open_device_registration.py

extend serialization for tensor metadata (#99808 )

2023-06-14 01:43:21 +00:00

test_cuda_expandable_segments.py

Revert "Revert "Expandable blocks in allocator (#96995 )"" (#99275 )

2023-04-17 23:46:08 +00:00

test_cuda_multigpu.py

Refactor multigpu tests to test_cuda_multigpu (#104059 )

2023-06-27 05:32:05 +00:00

test_cuda_nvml_based_avail.py

Reduce pytest blocklist (#96016 )

2023-03-07 18:30:27 +00:00

test_cuda_primary_ctx.py

[BE] Use TEST_MULTIGPU from common_cuda.py (#103982 )

2023-06-22 00:07:44 +00:00

test_cuda_sanitizer.py

Reduce pytest blocklist (#96016 )

2023-03-07 18:30:27 +00:00

test_cuda_trace.py

Reduce pytest blocklist (#96016 )

2023-03-07 18:30:27 +00:00

test_cuda.py

Refactor multigpu tests to test_cuda_multigpu (#104059 )

2023-06-27 05:32:05 +00:00

test_custom_op_testing.py

[operator_compile_check] Add FakeTensor testing (#103595 )

2023-06-16 16:55:51 +00:00

test_dataloader.py

[DataLoader] Adding StackDataset (#101338 )

2023-05-18 00:57:12 +00:00

test_datapipe.py

[codemod][3.10][NamedTuple] Use typing_extensions to get NamedTuple Generics (#101830 )

2023-05-19 22:50:18 +00:00

test_decomp.py

disable tf32 for rnn tests and norm tests (#102005 )

2023-05-24 02:22:58 +00:00

test_deploy.py

…

test_determination.py

[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 )

2023-02-07 21:10:56 +00:00

test_dispatch.py

Remove unnecessary skips in test_dispatch.py (#85557 )

2022-09-26 15:35:47 +00:00

test_dlpack.py

add additional stream priority for cuda streams (#101956 )

2023-05-27 02:36:16 +00:00

test_dynamic_shapes.py

fix soundness bug with unsupported constraints (#102897 )

2023-06-10 01:59:55 +00:00

test_expanded_weights.py

tf32 context fixes for various tests (#103137 )

2023-06-15 02:33:12 +00:00

test_fake_tensor.py

[operator_compile_check] Add FakeTensor testing (#103595 )

2023-06-16 16:55:51 +00:00

test_flop_counter.py

De-register forward hooks upon exiting flop counter context (#103744 )

2023-06-20 19:34:02 +00:00

test_foreach.py

Reland "Move tensor grouping to ATen" (#103912 )

2023-06-21 09:26:33 +00:00

test_function_schema.py

[frontend] Expose real_type getter for torch.Argument (#91938 )

2023-01-12 01:26:50 +00:00

test_functional_autograd_benchmark.py

…

test_functional_optim.py

[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 )

2023-02-07 21:10:56 +00:00

test_functionalization_of_rng_ops.py

[philox_rand] Add decomps (#100206 )

2023-04-28 02:20:13 +00:00

test_functionalization.py

Upgrade LoggingTensor mode and add traceback collection (#103734 )

2023-06-21 22:04:30 +00:00

test_futures.py

Enable mypy allow redefinition (#102046 )

2023-05-24 07:05:30 +00:00

test_fx_experimental.py

fix conv+bn folding issue for mixed dtype (#99696 )

2023-04-23 05:13:40 +00:00

test_fx_passes.py

Introduce aggressive merge to CapabilityPartitioner (#100195 )

2023-05-05 23:20:17 +00:00

test_fx_reinplace_pass.py

*_scatter ops should preserve input stride/storage_offset (#91029 )

2022-12-22 19:41:53 +00:00

test_fx.py

Preserve all submodules/parameters/buffers when unpickle graph module (#104115 )

2023-06-26 06:59:48 +00:00

test_hub.py

torch.hub: add safe weights_only option to load_state_dict_from_url (#98479 )

2023-04-11 12:44:25 +00:00

test_import_stats.py

…

test_indexing.py

[Dynamo] Fix lineinfo generation on PY3.11+ (#103525 )

2023-06-14 05:41:43 +00:00

test_itt.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_jit_autocast.py

[BE] Enable C419 rule for any all shortcircuiting (#99890 )

2023-04-25 15:02:13 +00:00

test_jit_cuda_fuser.py

adding symbolic link to get CI to run tests where cmake is not run on CI node (#95402 )

2023-03-01 19:01:18 +00:00

test_jit_disabled.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_jit_fuser_legacy.py

[BE] Prefer dash over underscore in command-line options (#94505 )

2023-02-09 20:16:49 +00:00

test_jit_fuser_te.py

Fix flake8 lint errors reported by ruff - take 2 (#99798 )

2023-04-23 23:09:51 +00:00

test_jit_fuser.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_jit_legacy.py

[BE] Prefer dash over underscore in command-line options (#94505 )

2023-02-09 20:16:49 +00:00

test_jit_llga_fuser.py

[BE] Deprecate has_XYZ attributes (#103279 )

2023-06-10 05:17:17 +00:00

test_jit_profiling.py

[BE] Prefer dash over underscore in command-line options (#94505 )

2023-02-09 20:16:49 +00:00

test_jit_simple.py

[BE] Prefer dash over underscore in command-line options (#94505 )

2023-02-09 20:16:49 +00:00

test_jit_string.py

…

test_jit.py

Allow C++ custom class to define __repr__ and use it from Python (#100724 )

2023-05-10 15:46:45 +00:00

test_jiterator.py

[ROCm] add skipCUDAIfVersionLessThan to unskip test_jiterator for ROCm (#99197 )

2023-04-17 16:05:16 +00:00

test_kernel_launch_checks.py

…

test_legacy_vmap.py

Make index_add_ error if input source shape is wrong (#100321 )

2023-06-08 06:51:10 +00:00

test_license.py

…

test_linalg.py

Unify GELU tanh approximation in _addmm_activation GPU back-end (#104061 )

2023-06-24 18:36:45 +00:00

test_logging.py

…

test_masked.py

Support per-parameter test decoration (#91658 )

2023-01-04 21:08:32 +00:00

test_maskedtensor.py

Tighten FakeTensor reentrancy asserts, add debugging (#102091 )

2023-05-24 05:37:51 +00:00

test_matmul_cuda.py

[CUBLAS] Specify alignment for cuBlasLt addmm (#98975 )

2023-04-18 06:19:30 +00:00

test_meta.py

[pt2] add metas for _pdist_forward and _pdist_backward (#103817 )

2023-06-22 11:18:05 +00:00

test_metal.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_mkl_verbose.py

…

test_mkldnn_fusion.py

[BE] Deprecate has_XYZ attributes (#103279 )

2023-06-10 05:17:17 +00:00

test_mkldnn_verbose.py

…

test_mkldnn.py

[BE] Deprecate has_XYZ attributes (#103279 )

2023-06-10 05:17:17 +00:00

test_mobile_optimizer.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_model_dump.py

[BE] Modernize PyTorch even more for 3.8 with pyupgrade (#94520 )

2023-02-10 18:02:50 +00:00

test_module_init.py

Adding nn.ZeroPad1d and nn.ZeroPad3d (#96295 )

2023-03-10 03:51:41 +00:00

test_modules.py

Add partial derivative unit tests (#103809 )

2023-06-25 00:36:10 +00:00

test_monitor.py

…

test_mps.py

[MPS] Handle deserialization more permissively (#98834 )

2023-06-15 15:51:03 +00:00

test_multiprocessing_spawn.py

[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 )

2023-02-07 21:10:56 +00:00

test_multiprocessing.py

Revert "Revert "Expandable blocks in allocator (#96995 )"" (#99275 )

2023-04-17 23:46:08 +00:00

test_namedtensor.py

Revert "remove torch.equal usages (#89527 )"

2022-12-02 21:36:13 +00:00

test_namedtuple_return_api.py

Upstream xformers code (#100583 )

2023-05-18 16:15:34 +00:00

test_native_functions.py

[Dynamo] Fix nested function resume execution (#100426 )

2023-05-11 03:10:23 +00:00

test_native_mha.py

Use scaled_dot_product_attention within attention.cpp (#87312 )

2022-10-31 04:06:31 +00:00

test_nestedtensor.py

Allow discontiguous NestedTensors to empty_like (#98383 )

2023-05-03 02:27:08 +00:00

test_nn.py

Add non-recursive module.to_empty option (#104197 )

2023-06-26 21:47:22 +00:00

test_nnapi.py

Revert "Added round_with_scale_factor arg to ATen (#97868 )"

2023-04-28 20:47:00 +00:00

test_numba_integration.py

…

test_numpy_interop.py

…

test_nvfuser_dynamo.py

adding symbolic link to get CI to run tests where cmake is not run on CI node (#95402 )

2023-03-01 19:01:18 +00:00

test_nvfuser_frontend.py

adding symbolic link to get CI to run tests where cmake is not run on CI node (#95402 )

2023-03-01 19:01:18 +00:00

test_openmp.py

…

test_ops_fwd_gradients.py

Mitigate flaky test_ops_fwd_gradients on macOS (#89410 )

2022-11-22 00:13:38 +00:00

test_ops_gradients.py

[custom_op] explicit autograd API (#101824 )

2023-05-23 18:31:29 +00:00

test_ops_jit.py

[BE] Enable C419 rule for any all shortcircuiting (#99890 )

2023-04-25 15:02:13 +00:00

test_ops.py

[inductor] remove fft and svd ops from fake_incorrect_kernels (#103616 )

2023-06-22 03:01:43 +00:00

test_optim.py

[optim][BE] split test file into logical parts: SWA, LR, optim (#101100 )

2023-05-12 16:41:44 +00:00

test_overrides.py

Make gen_annotated_args support kwargs (#98396 )

2023-04-06 19:42:26 +00:00

test_package.py

Reduce pytest blocklist part 2 (#96397 )

2023-03-10 19:10:43 +00:00

test_per_overload_api.py

Add OpOverload.decompose API (#83075 )

2022-08-09 18:53:19 +00:00

test_prims.py

Upgrade LoggingTensor mode and add traceback collection (#103734 )

2023-06-21 22:04:30 +00:00

test_proxy_tensor.py

[pt2] add metas for _pdist_forward and _pdist_backward (#103817 )

2023-06-22 11:18:05 +00:00

test_pruning_op.py

Add missing __main__ in two unittests (#97302 )

2023-03-22 19:09:08 +00:00

test_public_bindings.py

[AMP] Support XLA:TPU (#96370 )

2023-06-23 19:46:42 +00:00

test_python_dispatch.py

Upgrade LoggingTensor mode and add traceback collection (#103734 )

2023-06-21 22:04:30 +00:00

test_pytree.py

Simulate treespec flattening/unflattening (#101896 )

2023-06-23 10:53:15 +00:00

test_quantization.py

[Quant][PT2E]Enable x86 inductor quantizer (#98730 )

2023-06-17 06:10:23 +00:00

test_reductions.py

[Fix] Inbound check of sorter indices in searchsorted (#95109 )

2023-02-20 04:59:11 +00:00

test_scatter_gather_ops.py

[pytorch] Accelerate indexing_backward_kernel with duplicates (#99441 attempt 2) (#100505 )

2023-05-03 23:52:58 +00:00

test_schema_check.py

Get SchemaCheckMode to error on ops that return inputs directly. Expose as a dynamo backend, eager_debug (#99744 )

2023-04-27 20:12:42 +00:00

test_segment_reductions.py

Allow data size equal to 0 for SegmentReduce (#99733 )

2023-04-23 01:59:45 +00:00

test_serialization.py

Fix lr_scheduler serialization contains bound methods issue (#102627 )

2023-06-23 03:53:15 +00:00

test_set_default_mobile_cpu_allocator.py

…

test_shape_ops.py

Jetson Update for CI Redo (#94549 )

2023-02-21 17:13:38 +00:00

test_show_pickle.py

…

test_sort_and_select.py

Make 1D integer sorting work in parallel (#100081 )

2023-06-02 07:41:28 +00:00

test_sparse_csr.py

Sparse Compressed mm avoid creating temp sparse (#104062 )

2023-06-26 16:45:04 +00:00

test_sparse_semi_structured.py

[core][pruning][sparse][feature] SparseSemiStructured tensor subclass (#102135 )

2023-06-27 19:21:06 +00:00

test_sparse.py

Revert "sparse_mask: backward support for sparse lhs (#95165 )"

2023-06-23 18:40:15 +00:00

test_spectral_ops.py

Revert "[cuda rng] Making offset calculation independent of device properties (#98988 )"

2023-04-19 17:23:40 +00:00

test_stateless.py

[Functorch] Skip docs setup if called in optimize mode (#100750 )

2023-05-08 23:36:57 +00:00

test_static_runtime.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_subclass.py

[torchdynamo hash update] update the pinned torchdynamo hash (#85225 )

2022-09-26 20:07:13 +00:00

test_sympy_utils.py

Fix a number of issues with divs in ValueRangeAnalysis (#100547 )

2023-05-04 12:31:55 +00:00

test_tensor_creation_ops.py

Revert "[dynamo 3.11] enable other torch 3.11 dynamo-related tests (#99180 )"

2023-05-12 16:18:22 +00:00

test_tensorboard.py

[BE] Enable C419 rule for any all shortcircuiting (#99890 )

2023-04-25 15:02:13 +00:00

test_tensorexpr_pybind.py

…

test_tensorexpr.py

PT2/TorchScript interoperability fix (#94678 )

2023-02-15 01:21:10 +00:00

test_testing.py

[inductor] Create triton_helpers module for helper functions (#99880 )

2023-04-27 15:10:50 +00:00

test_throughput_benchmark.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_torch.py

Make torch.empty* deterministic by filling with NaN or max int value (#101849 )

2023-06-21 02:53:22 +00:00

test_transformers.py

REDO of dropout support for mem eff #102038 (#103704 )

2023-06-26 23:05:03 +00:00

test_type_hints.py

…

test_type_info.py

…

test_type_promotion.py

[BE] Enable flake8-comprehension rule C417 (#97880 )

2023-03-30 14:34:24 +00:00

test_typing.py

[BE] Enable flake8-comprehension rule C417 (#97880 )

2023-03-30 14:34:24 +00:00

test_unary_ufuncs.py

add Half support for logsigmoid, threshold, elu, gelu, hardtanh, hardsigmoid, hardswish, hardshrink, softshrink, leakyrelu, softplus, glu, silu, mish, and prelu on CPU (#98745 )

2023-05-27 16:20:21 +00:00

test_utils.py

Add torch._utils.render_call, improve printoptions (#102623 )

2023-05-31 22:08:04 +00:00

test_view_ops.py

Add overflow check for stride calculation (#94900 )

2023-04-09 01:30:55 +00:00

test_vulkan.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00

test_weak.py

[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 )

2023-02-07 21:10:56 +00:00

test_xnnpack_integration.py

[BE] [3/3] Rewrite super() calls in test (#94592 )

2023-02-12 22:20:53 +00:00