pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Supriya Rao 6f63126b5c [quant][fx] Add pass in convert to fold quant-dequant sequence (#54860 )

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/54860

Currently we insert a quantize_per_tensor op when we encounter the quantizable input,
so if it has multiple uses and not all are quantizable then we need to add a dequantize op
before these ops.

In this pass - For a sequence of quantize_per_tensor - dequantize, we combine them
since it is a no-op.

[internal only][pyper]

Before this change we had redundant dequantize nodes in the graph
Example 1x inline_cvr graph https://www.internalfb.com/intern/everpaste/?handle=GODBxAlUMzGHD6 (98143776f5)MSACpHKKu9qjorbsIXAAAz
 FC layers -> 37
 quantize_per_tensor -> 30
 dequantize -> 49

After this change
https://www.internalfb.com/intern/everpaste/?handle=GAl0uQnOlDNmpLoSAB-GZqRxu9wMbsIXAAAz
 FC layers -> 37
 quantize_per_tensor -> 30
 dequantize -> 39

We remove extra 10 dequantize nodes in the graph.

Test Plan:
python test/test_quantization.py test_fold_quant_dequant

Imported from OSS

Reviewed By: vkuzo

Differential Revision: D27390506

fbshipit-source-id: 56e6fb8496171246eccf4bd45eb8bebd87fcb740

2021-03-30 08:40:24 -07:00

[quant][fx] Add pass in convert to fold quant-dequant sequence (#54860 )

2021-03-30 08:40:24 -07:00

ns for fx: add weight matching for linear fp16 emulation (#54257 )

2021-03-25 22:35:38 -07:00

__init__.py

[quant][fix] Fix quant type classification for float_qparam qconfig (#48069 )

2020-11-18 18:22:08 -08:00

_correct_bias.py

Remove py2 compatible future imports (#44735 )

2020-09-16 12:55:57 -07:00

_equalize.py