pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Files

Jerry Zhang 7ddf212f33 [quant][fx] Fully align convert with the reference model design and simplify the implementation (#73863 )

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/73863

This PR fully aligns the convert function with the design: https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md
and simplifies the implementation of convert function by always produce a reference quantized model (with reference patterns) first,
and then lower the model to a quantized model that is runnable with PyTorch native backend (fbgemm/qnnpack).

This PR makes the convert.py much easier to understand than the previous implementation, and we are able to remove majority of code
in quantization_patterns.py as well (in followup PRs).

Test Plan:
```
python test/test_quantization.py TestQuantizeFx
python test/test_quantization.py TestQuantizeFxOps
python test/test_quantization.py TestFXNumericSuiteCoreAPIs
python test/test_quantization.py TestFXNumericSuiteCoreAPIsModels
```
and other internal/oss regression tests

Imported from OSS

Reviewed By: andrewor14

Differential Revision: D34778506

fbshipit-source-id: 0678b66addf736039a8749b352f6f569caca962b
(cherry picked from commit 33ec9caf23f3ab373d827117efbd9db0668b2437)

2022-03-11 17:11:30 +00:00

_reference

[quant][fx] Fully align convert with the reference model design and simplify the implementation (#73863 )

2022-03-11 17:11:30 +00:00

dynamic

[quant][fx] Fully align convert with the reference model design and simplify the implementation (#73863 )

2022-03-11 17:11:30 +00:00

modules

[quant][fx] Fully align convert with the reference model design and simplify the implementation (#73863 )

2022-03-11 17:11:30 +00:00

__init__.py

Un-ignore F403 in .flake8 (#55838 )

2021-04-13 09:24:07 -07:00

functional.py

Fix functional.max_poolNd warning spam in the CI

2022-03-04 18:42:23 +00:00