pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Zafar Takhirov	02dec91212	[quant] AO migration of the `torch/quantization/utils.py` (phase 1) (#64919 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64919 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the quantization utilities. ghstack-source-id: 138303325 Test Plan: `buck test mode/dev //caffe2/test:quantization` Reviewed By: jerryzh168 Differential Revision: D30899082 fbshipit-source-id: 85eb38c419e417147e71758b682cd095308dd0c9	2021-09-16 21:30:18 -07:00
Zafar Takhirov	c151d62f45	[quant] AO migration of the `quant_types.py` (phase 1) (#64916 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64916 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the quant_type.py from torch.quantization to torch.ao.quantization. At this point both locations will be supported. Eventually the torch.quantization will be deprecated. Test Plan: `buck test mode/dev //caffe2/test:quantization -- TestAOMigrationQuantization` Reviewed By: vkuzo Differential Revision: D30898422 fbshipit-source-id: 3e6126b49f0565a4136d6928cea9eb25368927ff	2021-09-15 17:30:00 -07:00
Charles David Hernandez	126ff6222e	Moving getattr_from_fqn to torch.quantization.utils (#63107 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63107 moving this function because the functionality would be useful outside of ns ghstack-source-id: 135727260 Test Plan: buck test //caffe2/test:quantization_fx mode/dev-nosan --keep-going --config client.id=nuclide --show-full-output -- suite Reviewed By: supriyar Differential Revision: D30260735 fbshipit-source-id: 58deabdd0f3b03b0ee7ee92be0548a0945084d65	2021-08-12 20:59:01 -07:00
Supriya Rao	08d1a12d69	[quant] add reduce_range option to FusedMovingAvgFakeQuantize module (#62863 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62863 To make this consistent with other observers, add reduce_range option that can be used to update quant_min/max Test Plan: python test/test_quantization.py test_fused_mod_reduce_range Imported from OSS Reviewed By: raghuramank100 Differential Revision: D30146602 fbshipit-source-id: a2015f095766f9c884611e9ab6942528bc9bc972	2021-08-10 09:27:01 -07:00
Jerry Zhang	3c1d1170a4	[quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62488 Instead of attaching weight observer/fake_quant to the float linear and conv, we can compute the quantization parameters and attach that as a dictionary to these modules so that we can reduce the model size and make the reference module clearer TODO: the numerics for linear and conv in reference quantized model is still not correct since we did not quantize weight, we may explore things like parameterization to implement this support Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D30053979 fbshipit-source-id: b5f8497cf6cf65eec924df2d8fb10a9e154b8cab	2021-08-09 16:55:14 -07:00
Angela Yi	c0b7c59e55	[quant] Equalization Observer modifications (#59953 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59953 The following modifications were made to the equalization observers due to design changes: - [InputEqualizationObserver] Replaced `calculate_qparams()` with `calculate_scaled_minmax()` since we will need to return the scaled min/max values to update the following input quantization observer - [WeightEqualizationObserver] We no longer need a row observer since this will be taken care of by the following weight quantization observer - [WeightEqualizationObserver] Following the previous comment, we no longer need to calculate the scaled qparam values. Instead, we will use the equalization scale to later scale the weights and the qparams will be taken care of by the weight quantization observer. Test Plan: `python test/test_quantization.py TestEqualizeFx.test_input_weight_eq_observer` Imported from OSS Reviewed By: supriyar Differential Revision: D29135332 fbshipit-source-id: be7e468273c8b62fc183b1e1ec50f6bd6d8cf831	2021-06-16 22:32:30 -07:00
Jerry Zhang	7484c56fa3	[quant][graphmode][fx] Fix a condition check for CopyNode (#53585 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53585 Previously fp16_static CopyNode would be marked as unquantized because of an incorrect condition check of whether a Node is statically quantized or not. This PR fixes that. Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D26912677 fbshipit-source-id: 4ddb538714c5ba2db28430de5e1cf2931baf1993	2021-03-11 09:32:20 -08:00
Jerry Zhang	b685864f50	[quant][graphmode][fx] Add reference option support for linear_static_fp16 (#52650 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52650 linear_dynamic_fp16 has following dtypes for activation, weight, bias, output: (fp32, fp16, fp32, fp32) linear_static_fp16 has following dtypes: (fp16, fp16, fp16, fp16) Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D26599803 fbshipit-source-id: b4a8345d355125070be718a227288cc848cc8bbc	2021-02-27 08:25:44 -08:00
Jerry Zhang	177694681e	[quant][graphmode][fx] Add reference option support for linear_dynamic_fp16 (#52534 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52534 Currently linear_dynamic_fp16 has a signature that's tied to fbgemm/qnnpack We'll need to produce a pattern equivalent to linear_dynamic_fp16 to support extensions to other backends Test Plan: python test/test_quantization.py TestQuantizeFxOps.test_linear_dynamic_fp16 Imported from OSS Reviewed By: vkuzo Differential Revision: D26557726 fbshipit-source-id: 270c9f781f73c79416a092b7831294cabca84b0c	2021-02-26 21:12:22 -08:00
Zafar	7d28f1c81d	[quant][refactor] Minor refactor of some typos (#50304 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50304 Does not include any functional changes -- purely for fixing minor typos in the `fuser_method_mappings.py` Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25857248 Pulled By: z-a-f fbshipit-source-id: 3f9b864b18bda8096e7cd52922dc21be64278887	2021-01-12 15:23:13 -08:00
Jerry Zhang	ed57f804fa	[quant][refactor] Move some util functions from torch/quantization/fx/utils.py to torch/quantization/utils.py (#48107 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48107 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D25026495 fbshipit-source-id: 3634b6b95a18670232600874b1e593180ea9f44c	2020-11-18 22:32:19 -08:00
Jerry Zhang	dd77d5a1d4	[quant][refactor] factor out get_combined_dict function (#47781 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47781 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24900303 fbshipit-source-id: 1a2cb0ec536384abcd140e0d073f0965ed2800cd	2020-11-11 21:01:31 -08:00

12 Commits