pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Author	SHA1	Message	Date
Jongsoo Park	2bca280a31	Revert D41683102: Multisect successfully blamed D41683102 for test or build failures (#90117 ) Summary: This diff is reverting D41683102 D41683102 has been identified to be causing the following test or build failures: Tests affected: - https://www.internalfb.com/intern/test/281475051072735/ Here's the Multisect link: https://www.internalfb.com/intern/testinfra/multisect/1444960 Here are the tasks that are relevant to this breakage: T124964606: 41 tests started failing for oncall ads_trainer_release in the last 2 weeks We're generating a revert to back out the changes in this diff, please note the backout may land if someone accepts it. Test Plan: NA Reviewed By: jspark1105 Differential Revision: D41710842 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90117 Approved by: https://github.com/soumith	2022-12-03 19:54:04 +00:00
alexmsettle	b703e4b3c2	Add hierarchical module names to torchFX graph.node #87659 (#87742 ) Fixes #87659 Pass down the module hierarchy from module.named_modules() to the name field of graph.node. This makes it so the name of each node contains descriptive information about the network architecture. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87742 Approved by: https://github.com/jerryzh168	2022-12-02 05:58:06 +00:00
Jerry Zhang	c627211651	[quant][fx][graphmode][be] Change the type for output of convert to be torch.nn.Module (#69959 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69959 GraphModule is an implementation detail, We don't want to expose it in quantization apis Test Plan: python test/test_quantization.py TestQuantizeFx.test_quantized_model_type Imported from OSS Reviewed By: supriyar Differential Revision: D33119103 fbshipit-source-id: d8736ff08b42ee009d6cfd74dcb3f6150f71f3d2	2021-12-29 20:33:32 -08:00
Jerry Zhang	508845f2b5	[quant] AO migration of the `torch/quantization/quantize_fx.py` and `torch/quantization/fx/` (#65033 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65033 1. Move the file: ``` hg mv caffe2/torch/quantization/fx caffe2/torch/ao/quantization/fx hg mv caffe2/torch/quantization/quantize_fx.py caffe2/torch/ao/quantization/quantize_fx.py ``` 2. Create new files ``` touch caffe2/torch/quantization/quantize_fx.py touch caffe2/torch/quantization/fx/__init__.py ``` 3. import things in the new files 4. add tests to test/quantization/ao_migration/test_quantization_fx.py this is because we have some fx import in quantize_fx and fx/.py Test Plan: buck test mode/dev //caffe2/test:quantization Reviewed By: vkuzo, z-a-f Differential Revision: D30949749 fbshipit-source-id: 9e5d4d039c8a0a0820bc9040e224f0d2c26886d3	2021-09-22 09:29:15 -07:00
Jerry Zhang	670853295a	[quant][tensorrt] Add tensorrt backend config (#64623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64623 The config api will change, but we'll add configs gradually for TensorRT to unblock experimentation Test Plan: python torch/fx/experimental/fx2trt/example/unittests.py Imported from OSS Reviewed By: vkuzo Differential Revision: D30800474 fbshipit-source-id: 3c4640de1205a0f19b62943ab84f386d80394ec2	2021-09-14 15:27:33 -07:00
Jerry Zhang	7ffcf15503	[quant][graphmode][api] Add backend_config_dict to prepare_fx api (#64135 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64135 We want to start aligning the api with the design in https://github.com/pytorch/pytorch/wiki/Extending-PyTorch-Quantization-to-Custom-Backends We plan to gradually move things from `prepare_custom_config_dict` and `convert_custom_config_dict` to `backend_config_dict` and allow custom backend developer to define their own way of quantizing operators. Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: zou3519 Differential Revision: D30699456 fbshipit-source-id: e3c068da8d3da2270f57719f7159cc71cafa8598	2021-09-01 15:32:47 -07:00
Angela Yi	45c31cabb5	[quant] Input Weight Equalization - prepare modifications (#59747 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59747 Modifies prepare_fx for input-weight equalization. If a current node is being equalized (there exists a EqualizationQConfig), then the EqualizationObserver will be inserted before its quantization observer. For a singular linear layer, the general flow looks like: Original graph: `x0 -> linear -> x1`, `w -> linear` After prepare: `x0 -> InpEqObs -> MinMaxObs -> linear1 -> MinMaxObs -> x1` `w -> WeightEqObs -> MinMaxObs -> linear1` For two connected linear layers, the general flow looks like: Original graph: `x0 -> linear1 -> linear2 -> x1`, `w1 -> linear1`, `w2 -> linear2` After prepare: `x0 -> InpEqObs -> MinMaxObs -> linear1 -> MinMaxObs -> InpEqObs -> linear2 -> MinMaxObs -> x1` `w1 -> WeightEqObs -> MinMaxObs -> linear1`, `w2 -> WeightEqObs -> MinMaxObs -> linear2 Test Plan: `python test/test_quantization.py TestEqualizeFx.test_input_equalization_prepare` Original model with one `nn.Linear` layer ``` LinearModule( (linear): Linear(in_features=1, out_features=1, bias=True) ) ``` Graph after `prepare_fx`: ``` graph(): %x : [#users=1] = placeholder[target=x] %x_equalization_process_0 : [#users=1] = call_module[target=x_equalization_process_0](args = (%x,), kwargs = {}) %x_activation_post_process_0 : [#users=1] = call_module[target=x_activation_post_process_00](args = (%x_equalization_process_0,), kwargs = {}) %linear : [#users=1] = call_module[target=linear](args = (%x_activation_post_process_0,), kwargs = {}) %linear_activation_post_process_0 : [#users=1] = call_module[target=linear_activation_post_process_0](args = (%linear,), kwargs = {}) return linear_activation_post_process_0 ``` -------------------------------------- Original model with two connected functional linear layers ``` FunctionalLinearModule( (linear1): Linear() (linear2): Linear() ) ``` Graph after `prepare_fx`: ``` graph(): %x : [#users=1] = placeholder[target=x] %x_equalization_process_0 : [#users=1] = call_module[target=x_equalization_process_0](args = (%x,), kwargs = {}) %x_activation_post_process_0 : [#users=1] = call_module[target=x_activation_post_process_00](args = (%x_equalization_process_0,), kwargs = {}) %linear1_w : [#users=1] = get_attr[target=linear1.w] %linear1_w_equalization_process_0 : [#users=1] = call_module[target=linear1_w_equalization_process_0](args = (%linear1_w,), kwargs = {}) %linear1_w_activation_post_process_0 : [#users=1] = call_module[target=linear1_w_activation_post_process_00](args = (%linear1_w_equalization_process_0,), kwargs = {}) %linear1_b : [#users=1] = get_attr[target=linear1.b] %linear : [#users=1] = call_function[target=torch.nn.functional.linear](args = (%x_activation_post_process_0, %linear1_w_activation_post_process_0), kwargs = {bias: %linear1_b}) %linear_activation_post_process_0 : [#users=1] = call_module[target=linear_activation_post_process_0](args = (%linear,), kwargs = {}) %linear_activation_post_process_0_equalization_process_0 : [#users=1] = call_module[target=linear_activation_post_process_0_equalization_process_0](args = (%linear_activation_post_process_0,), kwargs = {}) %linear2_w : [#users=1] = get_attr[target=linear2.w] %linear2_w_equalization_process_0 : [#users=1] = call_module[target=linear2_w_equalization_process_0](args = (%linear2_w,), kwargs = {}) %linear2_w_activation_post_process_0 : [#users=1] = call_module[target=linear2_w_activation_post_process_00](args = (%linear2_w_equalization_process_0,), kwargs = {}) %linear2_b : [#users=1] = get_attr[target=linear2.b] %linear_1 : [#users=1] = call_function[target=torch.nn.functional.linear](args = (%linear_activation_post_process_0_equalization_process_0, %linear2_w_activation_post_process_0), kwargs = {bias: %linear2_b}) %linear_1_activation_post_process_0 : [#users=1] = call_module[target=linear_1_activation_post_process_0](args = (%linear_1,), kwargs = {}) return linear_1_activation_post_process_0 ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D29135316 fbshipit-source-id: 91697e805ede254dbb2a42ee4c23eb1c1c64590e	2021-06-16 22:32:28 -07:00
Jerry Zhang	a344b09db2	[quant][fx][graphmode] Remove Quantizer class (#59606 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59606 Test Plan: python test/test_quantization.py TestQuantizeFx Imported from OSS Reviewed By: vkuzo Differential Revision: D28951432 fbshipit-source-id: 3301f7200a4c7166673c27f9ac7ff559f1e6935d	2021-06-15 21:54:57 -07:00
Vasiliy Kuznetsov	d75e99b709	fx quant: enable qconfig_dict to target function invocations by order (#59605 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59605 Enables targeting of individual function invocations by execution order. For example, given a module such as ``` class M1(torch.nn.Module): def forward(self, x): x = torch.add(x, x) x = torch.add(x, x) return x class M2(torch.nn.Module): def __init__(self): self.m1 = M1() def forward(self, x): x = self.m1(x) return x ``` We can now target the first add of `m1` with ``` qconfig_dict = { "module_name_function_order": ("m1", torch.add, 0, custom_qconfig), } ``` Test Plan: ``` python test/test_quantization.py TestQuantizeFx.test_qconfig_module_name_function_order ``` Imported from OSS Reviewed By: hx89 Differential Revision: D28951077 fbshipit-source-id: 311d423724a31193d4fa4bbf3a712b46464b5a29	2021-06-11 08:53:40 -07:00
Angela Yi	e574c2c025	[quant][fx] Validate qconfig_dict keys (#58566 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58566 Validates the keys of the qconfig_dict, prepare_custom_config_dict, convert_custom_config_dict, and fuse_custom_config_dict. If the user passes in an invalid key or makes a type, we will throw and error and let the user know what keys are supported. Test Plan: Imported from OSS python test/test_quantization.py Reviewed By: jerryzh168 Differential Revision: D28540923 fbshipit-source-id: 5958c32017b7d16abd219aefc8e92c42543897c2	2021-05-21 15:20:05 -07:00
James Reed	7b73fdf597	[FX] Fix retracing wrapped functions (#58061 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58061 Test Plan: Imported from OSS Reviewed By: yuhc Differential Revision: D28358801 Pulled By: jamesr66a fbshipit-source-id: c7c9a8a80e5bfe1eb1f6d2cf858ac7e57153a860	2021-05-17 19:50:16 -07:00
Jerry Zhang	1719cb82f3	[quant][graphmode][fx] Support preserving attributes in deepcopy of observed/quantized graphmodule (#56550 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56550 Add support for preserving a list of attributes on observed/quantized GraphModule Test Plan: python test/test_quantization.py TestQuantizeFx.test_deepcopy_preserve_attributes Imported from OSS Reviewed By: vkuzo, kazhang Differential Revision: D27899317 fbshipit-source-id: ebf21334715e5ab764aaa27eed534cc0cdf9f2b5	2021-04-22 15:02:44 -07:00
Sam Estep	75024e228c	Add lint for unqualified `type: ignore` (#56290 ) Summary: The other half of https://github.com/pytorch/pytorch/issues/56272. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56290 Test Plan: CI should pass on the tip of this PR, and we know that the lint works because the following CI runs (before this PR was finished) failed: - https://github.com/pytorch/pytorch/runs/2384511062 - https://github.com/pytorch/pytorch/actions/runs/765036024 Reviewed By: seemethere Differential Revision: D27867219 Pulled By: samestep fbshipit-source-id: e648f07b6822867e70833e23ddafe7fb7eaca235	2021-04-21 08:07:23 -07:00
Supriya Rao	7cec4b3d4a	[quant][fx] add _remove_qconfig flag to convert_fx (#53166 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53166 Context: For fx modules that consist of scriptmodules, calling delattr(module, 'qconfig') throws an attribute error. will follow up with a separate issue/repro to fix this problem This PR adds a temporary flag to convert_fx API to preserve the qconfig attributes on the converted model We will remove this flag once we reach a conclusion on calling delattr on scriptmodules Test Plan: python test/test_quantization.py test_preserve_qconfig Imported from OSS Reviewed By: jerryzh168 Differential Revision: D26771518 fbshipit-source-id: 9fd72816576856ffb4aa11f8fde08303d1df10a2	2021-03-03 12:58:05 -08:00
Jerry Zhang	cb6b65699f	[quant][graphmode][fx] Add support for packed params in state_dict (#51639 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51639 Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D26228185 fbshipit-source-id: 6cf8b4106fec9c6900521a2afe0de6f3d29cc896	2021-02-26 15:13:50 -08:00
Jerry Zhang	626756ac39	[quant][graphmode][api] debug --> reference (#52179 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52179 Rename debug to reference. We'll use this to produce a reference quantized model that can be used as a common interface between pytorch quantized model and backends. Test Plan: python test/test_quantization.py TestQuantizeFx Imported from OSS Reviewed By: vkuzo Differential Revision: D26424656 fbshipit-source-id: a0299b023f6ba7d98f5750724c517b0ecb987b35	2021-02-19 14:20:01 -08:00
Supriya Rao	096adf4b8b	[quant][fx] Scope support for call_function in QuantizationTracer (#51086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51086 Previously we only supported getting scope for call_module and custom qconfig dict for call_module. This PR extends the Scope class to record the scope for all node types. For call_function qconfig if module_name is specified it takes precedence over function qconfig. Test Plan: python test/test_quantization.py test_qconfig_for_call_func Imported from OSS Reviewed By: jerryzh168 Differential Revision: D26077602 fbshipit-source-id: 99cdcdedde2280e51812db300e17d4e6d8f477d2	2021-01-28 08:32:24 -08:00
Jerry Zhang	55ac7e53ae	[quant][graphmode][fx] Support preserved_attributes in prepare_fx (#50306 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50306 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D25857747 fbshipit-source-id: fac132fb36ed9cf207aea40429b5bc3f7c72c35d	2021-01-11 12:10:02 -08:00
Jerry Zhang	f10e7aad06	[quant][graphmode][fx] Scope support for call_method in QuantizationTracer (#50173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50173 Previously we did not set the qconfig for call_method node correctly since it requires us to know the scope (module path of the module whose forward graph contains the node) of the node. This PR modifies the QuantizationTracer to record the scope information and build a map from call_method Node to module path, which will be used when we construct qconfig_map Test Plan: python test/test_quantization.py TestQuantizeFx.test_qconfig_for_call_method Imported from OSS Reviewed By: vkuzo Differential Revision: D25818132 fbshipit-source-id: ee9c5830f324d24d7cf67e5cd2bf1f6e0e46add8	2021-01-11 10:43:58 -08:00
Jerry Zhang	f6f0fde841	[reland][quant][graphmode][fx] Standalone module support {input/output}_quantized_idxs (#49754 ) (#50058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50058 This PR adds the support for {input/output}_quantized_idxs for standalone module. if input_quantized_idxs = [] and output_quantized_idxs = [], the standalone module will be expecting float input and produce float output, and will quantize the input and dequantize output internally if input_quantized_idxs = [0] and otuput_qiuantized_idxs = [0], the standalone module will be expecting quantized input and produce quantized output, the input will be quantized in the parent module, and output will be dequantized in the parent module as well, this is similar to current quantized modules like nn.quantized.Conv2d For more details, please see the test case Test Plan: python test/test_quantization.py TestQuantizeFx.test_standalone_module Imported from OSS Imported from OSS Reviewed By: vkuzo Differential Revision: D25768910 fbshipit-source-id: 96c21a3456cf192c8f1400afa4e86273ee69197b	2021-01-05 20:27:46 -08:00
Mike Ruberry	46cf6d332f	Revert D25684692: [quant][graphmode][fx] Standalone module support {input/output}_quantized_idxs Test Plan: revert-hammer Differential Revision: D25684692 (`89b4899ea5`) Original commit changeset: 900360e01c0e fbshipit-source-id: 8b65fa8fbc7b364fbddb5f23cc696cd9b7db98cd	2020-12-24 15:50:52 -08:00
Jerry Zhang	89b4899ea5	[quant][graphmode][fx] Standalone module support {input/output}_quantized_idxs (#49754 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49754 This PR adds the support for {input/output}_quantized_idxs for standalone module. if input_quantized_idxs = [] and output_quantized_idxs = [], the standalone module will be expecting float input and produce float output, and will quantize the input and dequantize output internally if input_quantized_idxs = [0] and otuput_qiuantized_idxs = [0], the standalone module will be expecting quantized input and produce quantized output, the input will be quantized in the parent module, and output will be dequantized in the parent module as well, this is similar to current quantized modules like nn.quantized.Conv2d For more details, please see the test case Test Plan: python test/test_quantization.py TestQuantizeFx.test_standalone_module Imported from OSS Reviewed By: raghuramank100 Differential Revision: D25684692 fbshipit-source-id: 900360e01c0e35b26fe85f4a887dc1fd6f7bfb66	2020-12-23 22:36:57 -08:00
Jerry Zhang	f474ffa1a9	[quant][graphmode][fx] Change standalone module api (#49719 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49719 We find there are multiple use cases for standalone module, one use case requires standalone module to produce a module that takes float Tensor as input and outputs a float Tensor, the other needs to produce a modulee that takes quantized Tensor as input and outputs a quantized Tensor. This is similar to `quantized_input_idxs` and `quantized_output_idxs` so we want to nest prepare_custom_config_dict in the standalone module configuration, for maximum flxibility we also include qconfig_dict for stand alone module as well in case user needs to have special qconfig_dict for the standalone module in the future. Changed from ```python prepare_custom_config_dict = { "standalone_module_name": ["standalone_module"], "standalone_module_class": [StandaloneModule] } ``` to ```python prepare_custom_config_dict = { "standalone_module_name": [("standalone_module", qconfig_dict1, prepare_custom_config_dict1)], "standalone_module_class": [(StandaloneModule, qconfig_dict2, prepare_custom_config_dict2)] } ``` The entries in the config are: 1. name/module_class 2. optional qconfig_dict, when it is None, we'll use {"": qconfig} where qconfig is the one from parent qconfig_dict 3. optional prepare_custom_config_dict, when it is None, we'll use default value of prepare_custom_config_dict for prepare API (None) Test Plan: python test/test_quantization.py TestQuantizeFx.test_standalone_module Imported from OSS Reviewed By: raghuramank100 Differential Revision: D25675704 fbshipit-source-id: 0889f519a3e55a7a677f0e2db4db9a18d87a93d4	2020-12-22 21:58:40 -08:00
Vasiliy Kuznetsov	92df8706a0	fx quant: move {input\|output}_quantized_idxs cfg from convert to prepare (#49238 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49238 Moves the `input_quantized_idxs` and `output_quantized_idxs` options from the convert config to the prepare config. This is done because these operations are related to placing observers, which is numerics changing during QAT. The next PR will adjust the behavior of `input_quantized_idxs` in prepare in QAT to prevent placing a fake_quant at the input if the input is marked quantized. Placing a fake_quant there can lead to numerical inaccuracies during calibration, as it would start with scale=1 and zp=0, which may be different from the quantization parameters of the incoming quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25498762 fbshipit-source-id: 17ace8f803542155652b310e5539e1882ebaadc6	2020-12-16 18:53:27 -08:00
Jerry Zhang	fadec77c30	[quant][fx][graphmode] Renable torchvision test (#48602 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48602 Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D25224917 fbshipit-source-id: efc73f425253c4eb7ae51064b6760416097f0437	2020-12-04 10:13:38 -08:00
Vasiliy Kuznetsov	f5d94244b2	fx quant: more typehints, part 3 (#48794 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48794 Adds typehints to function I/O in `torch/quantization/quantize_fx.py`, for readability. Test Plan: ``` mypy torch/quantization/ ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25307084 fbshipit-source-id: 67bdf95b78836dcabc7d829e1854ca5b8ceb8346	2020-12-03 19:28:16 -08:00
Jerry Zhang	ea573ea944	[qunat][graphmode][fx] Standalone module takes float as input and output (#48671 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48671 Standalone module might be called separately so it's better to use float as interface. Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D25256184 fbshipit-source-id: e209492a180ce1f81f31c8d6057956a74bad20b1	2020-12-02 20:34:25 -08:00
Vasiliy Kuznetsov	f80aaadbae	fx quantization: add option to leave graph inputs and/or outputs quantized (#48624 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48624 Before this PR, there was an assumption that all graph inputs and outputs are in floating point, with some exceptions for `standalone_module`. This PR adds an option to specify either inputs or outputs as being quantized. This is useful for incremental migrations of models using Eager mode. Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25231833 fbshipit-source-id: 9f9da17be72b614c4c334f5c588458b3e726ed17	2020-12-01 10:39:51 -08:00
Vasiliy Kuznetsov	6b80b664bb	quant: enable mypy on torch/quantization/fx (#48331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48331 Enables mypy to not ignore type errors in FX quantization files. Fixes the easy typing errors inline, and comments out the harder errors to be fixed at a later time. After this PR, mypy runs without errors on `torch/quantization`. Test Plan: ``` > mypy torch/quantization/ Success: no issues found in 25 source files ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25133348 fbshipit-source-id: 0568ef9405b292b80b3857eae300450108843e80	2020-11-21 15:29:27 -08:00
Jerry Zhang	8aaca4b46a	[reland][quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 ) (#48038 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48038 nn.ReLU works for both float and quantized input, we don't want to define an nn.quantized.ReLU that does the same thing as nn.ReLU, similarly for nn.quantized.functional.relu this also removes the numerical inconsistency for models quantizes nn.ReLU independently in qat mode Test Plan: Imported from OSS Imported from OSS Reviewed By: vkuzo Differential Revision: D25000462 fbshipit-source-id: e3609a3ae4a3476a42f61276619033054194a0d2	2020-11-17 09:52:21 -08:00
Vasiliy Kuznetsov	4779553921	Revert "[quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 )" (#47949 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47949 This reverts commit 1478e5ec2aa42b2a9742257642c7c1d3203d7309. Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24966363 Pulled By: vkuzo fbshipit-source-id: ca1126f699eef84027a15df35962728296c8a790	2020-11-14 08:40:30 -08:00
Jerry Zhang	1478e5ec2a	[quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47415 nn.ReLU works for both float and quantized input, we don't want to define an nn.quantized.ReLU that does the same thing as nn.ReLU, similarly for nn.quantized.functional.relu this also removes the numerical inconsistency for models quantizes nn.ReLU independently in qat mode Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D24747035 fbshipit-source-id: b8fdf13e513a0d5f0c4c6c9835635bdf9fdc2769	2020-11-12 10:56:30 -08:00
Jerry Zhang	1239d067ae	[quant][graphmode][fx] Support standalone_module_class (#47705 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47705 Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D24872380 fbshipit-source-id: db2ec7ba03da27203033fbebc11666be572622bb	2020-11-11 09:15:14 -08:00
Jerry Zhang	366888a5e2	[quant][graphmode][fx] Remove logging for standalone module api calls (#47032 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47032 these are not top level apis, not supposed to be called directly by user. Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24610602 fbshipit-source-id: c5510f06b05499387d70f23508470b676aea582c	2020-10-29 18:39:43 -07:00
Jerry Zhang	c2a3951352	[quant][graphmode][fx] Remove inplace option for convert_fx (#46955 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46955 Initially we were thinking of adding a `invalidate_quantized_float_parameters` option to free the memory of quantized floating parameters, but it turns out we will do module swap just like in eager mode for the modules that are quantized, so the old floating point module will not be referenced after quantization. therefore this feature is only needed for functionals, since most people are using quantization with modules we may not need this. we'll revisit after we find there is a need for this. Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24579400 fbshipit-source-id: fbb0e567405dc0604a2089fc001573affdade986	2020-10-28 21:07:19 -07:00
Jerry Zhang	cd8ed93287	[quant][graphmode][fx][api] Remove `inplace` option from prepare_fx (#46954 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46954 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24579401 fbshipit-source-id: adce623ce819fa220f7bb08d1ff3beaa69850621	2020-10-28 08:00:12 -07:00
Jerry Zhang	d92bf921db	[quant][graphmode][fx] Remove `inplace` option for fuse_fx (#46953 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46953 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24579402 fbshipit-source-id: 5e0b8abf682287ab3c7dd54c2fc2cf309295e147	2020-10-27 22:34:11 -07:00
Jerry Zhang	998b9b9e68	[quant][graphmode][fx] custom_module support static/dynamic/weight_only quant (#46786 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46786 Previously we only support static quant, this PR added support for other types of quantization. Note qat is actually orthogonal to these quant types, this is referring to the convert step where we convert the observed module to a quantized module. for qat, user will provide a CustomModule -> FakeQuantizedCustomModule in prepare_custom_config_dict and FakeQuantizedCustomModule -> static/dynamic/weight_only quantized CustomModule in convert_custom_config_dict. Test Plan: Imported from OSS Reviewed By: raghuramank100 Differential Revision: D24514701 fbshipit-source-id: 2918be422dd76093d67a6df560aaaf949b7f338c	2020-10-27 21:41:33 -07:00
Jerry Zhang	37dbc6117f	[quant][eagermode] Add additional_fuser_method_mapping to config (#46355 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46355 Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24319562 fbshipit-source-id: be9800723c0b3e36f26e73c25c0c6ae1d4344f45	2020-10-24 02:18:04 -07:00
Jerry Zhang	343260a1cc	[quant][graphmode][fx] Add support for additional_{fusion/quant}_pattern (#46346 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46346 Allow user to provide additional fusion/quant patterns for fx graph mode Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24317437 fbshipit-source-id: 719927cce50c74dffa4f848bd5c98995c944a26a	2020-10-23 15:03:42 -07:00
Jerry Zhang	bd90379df5	[quant][graphmode][fx] Add support for additional_fuse_method_mapping (#46345 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46345 Allow user to add more fusion mappings Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24317439 fbshipit-source-id: 3b144bbc305e41efbdf3e9fb25dbbeaad9e86c6a	2020-10-22 15:15:31 -07:00
Jerry Zhang	23fad9111e	[quant][graphmode][fx] Add additional_qat_module_mapping (#46344 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46344 Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24317438 fbshipit-source-id: f9e73aeb4c7a107c8df0bae8319464e7d5d7275b	2020-10-22 13:11:26 -07:00
Jerry Zhang	ab28bd528d	[quant][graphmode][fx] Support quantizing FloatFunctional (#46634 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46634 Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24438227 fbshipit-source-id: f33439d51112e13f59ee4292e804495d38fa3899	2020-10-22 01:21:17 -07:00
Jerry Zhang	746febdeac	[quant][graphmode][fx] Add additional_object_mapping argument to convert (#46338 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46338 Should we merge quantized module and quantized operator configurations? Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24317435 fbshipit-source-id: 3575251fe9d80a6628b8c3243c2ed92ea5e921e3	2020-10-21 16:39:07 -07:00
Jerry Zhang	a06b95b2ba	[quant][graphmode][fx] Support non_traceable_module/module_class (#46298 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46298 Allow user to specify a list of qualified names for non traceable submodule or type of the non traceable submodule See quantize_fx.py for api Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24294210 fbshipit-source-id: eb1e309065e3dfbf31e63507aaed73587f0dae29	2020-10-19 18:50:08 -07:00
Jerry Zhang	49903a5cd5	[quant][graphmode][fx] Move custom_module_class config to prepare/convert_custom_config_dict (#46251 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46251 Test Plan: Imported from OSS Reviewed By: raghuramank100 Differential Revision: D24290810 fbshipit-source-id: 7a96f04a0f33f0315943ac18ef2d08e4f5a5d1c0	2020-10-14 10:43:48 -07:00
Jerry Zhang	67a0c0af27	[quant][fx][graphmode] Add prepare_custom_config_dict and convert_custom_config_dict (#46223 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46223 Also move standalone module config to the prepare_custom_config_dict Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24266900 fbshipit-source-id: fe3ff5b8c657af3f377041e7881d400938e044f8	2020-10-13 14:19:49 -07:00
Jerry Zhang	7f6a1b2bd5	[quant][fx][graphmode][api] Change API for custom module (#45920 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45920 See docs for new way of defining custom modules Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24145856 fbshipit-source-id: 488673fba503e39e8e303ed5a776fe36899ea4e3	2020-10-12 23:42:27 -07:00
Vasiliy Kuznetsov	7094c09ff7	quantizaton: add API usage logging (#46095 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46095 Adds logging on usage of public quantization APIs. This only works in FB codebase and is a no-op in OSS. Test Plan: The test plan is fb-only Reviewed By: raghuramank100 Differential Revision: D24220817 fbshipit-source-id: a2cc957b5a077a70c318242f4a245426e48f75e5	2020-10-09 16:51:27 -07:00
Jerry Zhang	2b204e6db3	[quant][fx][graphmode] Run symbolic_trace in quantization (#45919 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45919 As discussed with JIT team, we'll run symbolic trace in quantization functions prepare_fx now takes orginal pytorch model (torch.nn.Module) instead of `GraphModule` as input Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24145857 fbshipit-source-id: 2b7a4ca525a7a8c23a26af54ef594c6a951e4024	2020-10-08 17:26:03 -07:00

1 2

62 Commits