pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Jianyu Huang	6f7d5bb3e1	Temporarily disable the test_quantized_rnn test (#32742 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32742 As Title says (Check https://github.com/pytorch/pytorch/issues/32644). ghstack-source-id: 97352793 Test Plan: CI Differential Revision: D19611029 fbshipit-source-id: 9f4a155c909f419e41c1d7078eb2796dd17cedd2	2020-01-28 16:50:59 -08:00
James Reed	812b1ad869	[quantization] FP16 dynamic quantized Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32331 Test Plan: Imported from OSS Differential Revision: D19441158 Pulled By: jamesr66a fbshipit-source-id: c04247ffe707be68718c486c31bc6c6040f7dc11	2020-01-27 15:45:32 -08:00
Jerry Zhang	4cd6b5cda6	[quant] Re-enable test_nested that has different qconfig for shared ClassType (#32206 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32206 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D19508028 fbshipit-source-id: 5de3c2ef17de146feca03d7135a7e04f393de398	2020-01-23 15:32:57 -08:00
Pritam Damania	f050b16dd9	Move pytorch distributed tests to separate folder for contbuild. (#30445 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30445 Create distributed and rpc directories under caffe/test for better management of unit tests. Differential Revision: D18702786 fbshipit-source-id: e9daeed0cfb846ef68806f6decfcb57c0e0e3606	2020-01-22 21:16:59 -08:00
Jerry Zhang	4314620ba0	[jit] Module clone work with shared ClassType (#31970 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31970 Now that the ClassType can be shared among different module instances, we'll preserve the sharing in clone as well, that is if the original module has a ClassType that is shared, we'll clone this ClassType once and share it between different module instances as well. Test Plan: build/test/test_jit Imported from OSS Differential Revision: D19406251 fbshipit-source-id: 2881c695f6e718e5432040a3817cf187a62017bf	2020-01-15 11:24:53 -08:00
James Reed	a3cdb7eca3	Fix default instantation of dynamic quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31433 Test Plan: Imported from OSS Differential Revision: D19164539 Pulled By: jamesr66a fbshipit-source-id: 7045817ab3dfb530c4480a10523c4c6bcdbfc7eb	2019-12-18 16:59:00 -08:00
Michael Suo	62b10721fb	Actually make flake8 do something (#30892 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30892 Fixes all outstanding lints and actually installs a properly configured flake8 Test Plan: Imported from OSS Differential Revision: D18862825 Pulled By: suo fbshipit-source-id: 08e9083338a7309272e17bb803feaa42e348aa85	2019-12-06 17:50:50 -08:00
James Reed	4fd20c0816	Kill hypothesis deadline testing (#30890 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30890 We've received way too many complaints about this functionality making tests flaky, and it's not providing value to us anyway. Let's cut the shit and kill deadline testing Test Plan: Imported from OSS Differential Revision: D18857597 Pulled By: jamesr66a fbshipit-source-id: 67e3412795ef2fb7b7ee896169651084e434d2f6	2019-12-06 13:36:14 -08:00
Jerry Zhang	58cdf1429c	Add tests for quantizing traced models (#30476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30476 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18795724 fbshipit-source-id: 9253e102bf458d9185f68848071a4e4eff9f9b08	2019-12-05 23:03:45 -08:00
Jerry Zhang	1fa4908ac0	Refactor test_quantization.py and enable `test_nested` (#30475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30475 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18795727 fbshipit-source-id: c9942c5361e0a34e91a08b8fc27405799db7ff4f	2019-12-05 21:56:03 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Lingyi Liu	59ca9b7430	Graph-mode quantization for convolution from traced model (#30245 ) Summary: In the PR, we enhance the graph-mode quantization for aten::_convolution, which could be generated from tracing path. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30245 Differential Revision: D18671597 Pulled By: lly-zero-one fbshipit-source-id: 78a2470fbb0fe0def55d63c6bda7cbb5c89f7848	2019-11-23 01:24:50 -08:00
Lingyi Liu	7d3afc4186	enable the per channel dynamic quantization (#30122 ) Summary: The PR tried to enable the per-channel(row-wise) dynamic quantization for linear operator. Given we have seen some accuracy drop due to the per-tensor quantization, we expect the per-channel could help improve the accuracy. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30122 Differential Revision: D18630541 Pulled By: lly-zero-one fbshipit-source-id: d52685deec5e7de46cd686ae649a8c8765b9cacf	2019-11-21 10:12:05 -08:00
Jerry Zhang	b2291d4600	Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` (#29416 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29416 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18580906 fbshipit-source-id: 5370300b89e26c2b4662b17e51284e8708cb5843	2019-11-19 19:12:55 -08:00
James Reed	20fb8a814c	PackedSequence support for quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29585 Test Plan: Imported from OSS Differential Revision: D18436569 Pulled By: jamesr66a fbshipit-source-id: 0f32c0fcc897894e30d8e7ff203392c1a961ce60	2019-11-12 20:13:38 -08:00
Jerry Zhang	4bcf4796aa	Make HistogramObserver scriptable with `@torch.jit.ignore` (#27950 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27950 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18360139 fbshipit-source-id: 5459ae49c087886e4990de136198773a75b1c572	2019-11-07 18:02:44 -08:00
James Reed	821f8bfc2f	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18374904 Pulled By: jamesr66a fbshipit-source-id: f1a9b58998bc365b9baad38c21fd4bb510dd639c	2019-11-07 13:45:39 -08:00
Mike Ruberry	84a6583ba1	Revert D18359880: Fix tracing for dynamic quantized LSTM Test Plan: revert-hammer Differential Revision: D18359880 Original commit changeset: 0ff2cad294a1 fbshipit-source-id: 834cd43b39fb754f90c8b18b8ab9b837f2b511ab	2019-11-06 21:10:33 -08:00
James Reed	f17e02fd94	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18359880 Pulled By: jamesr66a fbshipit-source-id: 0ff2cad294a1871123015dfc704eaf73a7ac1d9e	2019-11-06 17:02:12 -08:00
Xiang Gao	25e261d6d5	assertEquals is deprecated, use assertEqual instead Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28335 Differential Revision: D18263456 Pulled By: ngimel fbshipit-source-id: c0f79071feaa5a4c3c4b20505013bf7c4b5455d5	2019-11-05 09:52:21 -08:00
Jerry Zhang	d690521cf6	Add e2e test for conv+bn (#27348 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27348 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18182920 fbshipit-source-id: 40edc4d85903f979cd4755d6785d2842faa4d566	2019-11-01 11:28:47 -07:00
Jerry Zhang	59c5de4d0e	Don't permute in quantized::conv2d pattern (#27347 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27347 it's already done in the op, we don't need to permute again Test Plan: test_jit.py we'll test in e2e tests Imported from OSS Differential Revision: D18182919 fbshipit-source-id: 04dd2a19a719828fbc7b62e451b81752187e0fcb	2019-10-31 15:58:28 -07:00
Jerry Zhang	1c436ded44	Remove `test_quantizer.py` and reuse one of its test in `test_quantization.py` (#27269 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27269 Remove `test_quantizer.py`, add and rewrite one of the tests in `test_quantizer` in `test_quantization.py` The conv test is removed for now since conv pattern is still broken, we'll add another test later ghstack-source-id: 92869823 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18182916 fbshipit-source-id: 325b5d8e877228d6a513e3ddf52c974479250d42	2019-10-29 19:04:21 -07:00
Zafar Takhirov	a5ac7f6387	Changing observer name Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27779 Test Plan: Imported from OSS Differential Revision: D17886605 Pulled By: z-a-f fbshipit-source-id: 68c50b482e65015336ff27171fd730da493525b6	2019-10-17 11:36:03 -07:00
Raghuraman Krishnamoorthi	ac0f18437f	MovingAverage Observer (#27396 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27396 Observer that estimates moving averages of min and max values per batch, more suited for quantization aware training instead of minmax observers that track extremal values across batches ghstack-source-id: 91369018 Test Plan: buck test caffe2/test:quantization -- 'test_per_tensor_observers $test_quantization\.ObserverTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_per_channel_observers $test_quantization\.ObserverTest$' --print-passing-details Differential Revision: D17727213 fbshipit-source-id: 024a890bf3dd0bf269d8bfe61f19871d027326f0	2019-10-04 16:28:59 -07:00
Zafar Takhirov	6bb7433ad5	Replacing the skip_list with white_list in the qconfig propagation Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27183 Test Plan: Imported from OSS Differential Revision: D17700548 Pulled By: zafartahirov fbshipit-source-id: 18e6ffbda496b14ac1da1783f928ad539cdb1d16	2019-10-03 20:40:17 -07:00
James Reed	1affa7c32c	Allow set for qconfig for dynamic_quantize Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27181 Test Plan: Imported from OSS Differential Revision: D17717482 Pulled By: jamesr66a fbshipit-source-id: f3930fc87831cbdcf4390cd769c594bb13f5cd81	2019-10-02 19:55:45 -07:00
Zafar Takhirov	27dc595215	Rename _intrinsic to intrinsic Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27194 Test Plan: Imported from OSS Differential Revision: D17704957 Pulled By: zafartahirov fbshipit-source-id: 46f02d129aa77c3047b2a6c606bfadd831a6b0fc	2019-10-02 18:53:06 -07:00
Raghuraman Krishnamoorthi	4abfb5493e	Handle uninitialized min/max values in histogram observer (#27151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27151 We need to be ab le to handle observers with no min/max data correctly as models sometimes have modules that do not get any data. ghstack-source-id: 91113403 Test Plan: buck test caffe2/test:quantization -- test_minmax_observer buck test caffe2/test:quantization -- test_per_channel_minmax_observer buck test caffe2/test:quantization --test_histogram_observer Reviewed By: csummersea Differential Revision: D17690828 fbshipit-source-id: e95709333ea0f66d79ddb8141b7cba5a83347dbd	2019-10-01 14:56:37 -07:00
Jerry Zhang	98c02e6df3	Enable tests (#27103 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27103 att Test Plan: python test/test_quantization.py 'GraphModePostTrainingQuantTest' Imported from OSS Differential Revision: D17678261 fbshipit-source-id: 5caa7512c6ff4a613980c86b5b221e0cfbe0a173	2019-10-01 12:10:21 -07:00
Raghuraman Krishnamoorthi	dddae3f854	Fuse module enhancements (#26457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26457 Enhancement to fuse module to support sequentials, fuse list can now be just like the state dict. Also add support for Conv-Relu and linear-relu fusion Also support inplace and out of place fusion of models. ghstack-source-id: 91076386 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train $test_quantization\.FusionTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval $test_quantization\.FusionTest$' --print-passing-details Differential Revision: D17466382 fbshipit-source-id: 0a548f8f4c366f3ecc59db693bac725ccd62328e	2019-09-30 22:00:20 -07:00
Raghuraman Krishnamoorthi	bdcaf6334b	Support for add relu functional module (#26612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26612 Add support for add relu functional module, this allows for fusion of add and relu quantized operations ghstack-source-id: 91055976 Test Plan: buck test caffe2/test:quantization -- 'test_functional_module $test_quantization\.FunctionalModuleTest$' --print-passing-details Differential Revision: D17518268 fbshipit-source-id: e1e8b4655d6b32405863ab9d1c7da111fb4343cc	2019-09-30 18:16:58 -07:00
James Reed	4d7bec5f3e	Improve repr for quantized modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27008 Test Plan: Imported from OSS Differential Revision: D17649174 Pulled By: jamesr66a fbshipit-source-id: e3e6c4bb31e1ad8ed1ebe27f803f90d564ecfe53	2019-09-28 15:15:14 -07:00
Raghuraman Krishnamoorthi	2ccbdb79c8	Per-channel baseline (#26516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26516 ghstack-source-id: 90982010 Test Plan: Integrate per-channel support into conv and linear modules. The following tests pass: buck test caffe2/test:quantized -- 'test_linear_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_conv_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_float_quant_compare_per_channel $test_quantized_models\.ModelNumerics$' --print-passing-details Differential Revision: D17342622 fbshipit-source-id: f0d618928e3d9348672c589a6b7a47049c372a2e	2019-09-28 14:05:06 -07:00
Jerry Zhang	09f0e949cd	PyTorch Graph Mode Quantization API (#26390 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26390 `quantize_script`: top level API for graph mode quantization Test Plan: there are some known issues, we can enable test after all known issues are fixed. Imported from OSS Differential Revision: D17645132 fbshipit-source-id: 61f261d5607409d493b39a2f4e05ebd017279f6b	2019-09-27 19:23:51 -07:00
Dmytro Dzhulgakov	764bf826e3	Remove fbgemm_is_cpu_supported in favor of torch.backends.quantized.supported_qengines (#26840 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26840 Cleaning up top-level namespace. Also cosmetic changes to torch.backends.quantized Test Plan: Imported from OSS Differential Revision: D17604403 Pulled By: dzhulgakov fbshipit-source-id: c55af277ea7319d962a82a6120f65ccd47a60abc	2019-09-27 13:45:15 -07:00
Raghuraman Krishnamoorthi	b0a2f6f2f5	Serialization and range reduction support for Fake Quant/Observer (#26519 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26519 ghstack-source-id: 90895631 Test Plan: buck test caffe2/test:quantization -- 'test_histogram_observer $test_quantization\.ObserverTest$' --print-passing-details and buck test caffe2/test:fake_quant -- 'test_fq_serializable $test_fake_quant\.TestFakeQuantizePerTensorAffine$' --print-passing-details Differential Revision: D17217408 fbshipit-source-id: 0da7efdcdae0c065dd035c5dd2b6a78231545ece	2019-09-27 10:09:39 -07:00
Dmytro Dzhulgakov	0a8a779abe	Add more inplace arguments to quantization top level API (#26782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26782 At least we should be consistent on top-level APIs and prepare/convert/etc. Logic is inplace=False by default but top-level APIs take care of doing fewer copies. Also renames always-inplace methods like add_observer to have underscore in the end. One fix for MinMaxObserver was triggered by deepcopy surfacing that we were accidentally keeping autograd around Test Plan: Imported from OSS Differential Revision: D17595956 Pulled By: dzhulgakov fbshipit-source-id: 801f9f5536b553f24c7a660064dd6fce685edd65	2019-09-26 00:07:07 -07:00
Dmytro Dzhulgakov	128a65e2e0	Use noop observer to pass dtype for dynamic quantization (#26709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26709 Polishes implementation from #25975. Primarily, we use NoopObserver to communicate that weights need to be quantized to float16. The very top-level API (quantize_dynamic) stays the same with `dtype` argument but the implementation follows the common flow. One can argue that dynamic fp16 quantization doesn't really fit into the 'observer' mechanism. It's in fact not ideal, but it's better to have the same flow than branching on both dtype and qconfig. Test Plan: Imported from OSS Differential Revision: D17544103 Pulled By: dzhulgakov fbshipit-source-id: 6af3f18c35929a1a53ea734079c005f656e4925f	2019-09-24 09:24:39 -07:00
Dmytro Dzhulgakov	a79b3685db	Simplify observers declaration with functools.partial (#26492 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26492 Previous definition of observers was quite clumsy - with things like `default_observer()()`. This PR strips a way a lot of craft and allows to pass just class names directly. In order to override default arguments either `functools.partial` can be used or convenient wrapper `MyObserver.with_args(x=1)` is provided. Also rename `QConfig_dynamic` to `QConfigDynamic` because it violates the naming convention. Test Plan: Imported from OSS Differential Revision: D17521265 Pulled By: dzhulgakov fbshipit-source-id: ba9df19b368641acf4093c43df9990796284fd9e	2019-09-23 10:15:59 -07:00
Jerry Zhang	254122dd4e	quantize_linear -> quantize_per_tensor (#26574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26574 Since we also have `quantized::linear`, `quantize_linear` sounds confusing, so we plan to rename it before the branch cut Test Plan: ci Imported from OSS Differential Revision: D17514876 fbshipit-source-id: 01d9005e6ec8cb9950b9d8bba122109c389641d3	2019-09-20 21:58:48 -07:00
Lingyi Liu	11f9fe2433	Fix the API for record observer (#26413 ) Summary: Mainly want to resolve comments from https://github.com/pytorch/pytorch/pull/25830. Overall, we want to provide a recording observer for recording the runtime tensor values of activation path in order to debug the numerical accuracy loss offline. According to the feedback from https://github.com/pytorch/pytorch/issues/25830, it might be better to record all the observers in a dict and query the dict to get corresponding tensor values. hx89 is working on how to insert the recording observers into model under debug. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26413 Differential Revision: D17506502 Pulled By: llyfacebook fbshipit-source-id: 3ab90dc78920e7ec3fa572c2a07327a9991c530a	2019-09-20 14:27:56 -07:00
Jianyu Huang	f433ee1499	Add the FP16 weight support for LSTM in dynamic_quantize (#25975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25975 We would like to add the FP16 weight support for the dynamic quantized LSTM. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm794.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Building: finished in 13.4 sec (100%) 8134/8134 jobs, 81 updated Total time: 13.9 sec Trace available for this run at /tmp/testpilot.20190910-210241.2092790.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision c86e65add357582accb6ec0be23b92c8a2c510bd fbpkg ca46e8f5b26c451a8b0b2462c11bb61d at Mon Sep 9 22:16:37 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/696/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 0.183 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.184s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 Summary (total time 4.35s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D17299116 fbshipit-source-id: 7fe91ece25867f2c0496f1b63fb1041e6b815166	2019-09-19 22:19:22 -07:00
Haixin Liu	dcbfc3bdbf	Add per channel observer (#25887 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25887 ghstack-source-id: 90383258 Add per channel observer to compute the qparams for each channel. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer' buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer_scriptable' Differential Revision: D17137226 fbshipit-source-id: 0b1c93e3cbcda86f5c4e30f7cd94c670f2665063	2019-09-18 22:16:45 -07:00
Haixin Liu	f2e9622ed8	Add l2 norm minimization (#24022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24022 In histogram observer add an approximation for L2 error minimization for selecting min/max. By selecting new min/max, we filter out outliers in input distribution. This follows the implementation of NormMinimization::NonlinearQuantizationParamsSearch in caffe2/quantization/server/norm_minimization.cc ghstack-source-id: 90298789 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_histogram_observer' Differential Revision: D16713239 fbshipit-source-id: 82631ba47974e25689c9c66bc3088117090e26d4	2019-09-18 00:07:10 -07:00
Sebastian Messmer	9f6b6b8101	Back out "[quant][observer] Add histogram observer" (#26236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26236 Original diff broke oss CI. Reverting. Original commit changeset: 0f047d3349cb ghstack-source-id: 90125990 Test Plan: testinprod Reviewed By: hx89 Differential Revision: D17385490 fbshipit-source-id: 4258502bbc0e3a6dd6852c8ce01ed05eee618b1a	2019-09-14 12:48:46 -07:00
Haixin Liu	1563fdb591	Add histogram observer (#23959 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23959 Add histogram observer that records the running histogram of tensor values along with min/max values. ghstack-source-id: 90076996 Test Plan: Added a test test_histogram_observer buck test mode/dev caffe2/test:quantization -- 'test_histogram_observer' buck test mode/dev caffe2/test:quantization -- 'test_observer_scriptable' Differential Revision: D16692835 fbshipit-source-id: 0f047d3349cb9770fad4a2b6cb346c51d9e99cd4	2019-09-13 19:24:04 -07:00
James Reed	bdc656da70	TorchScript Serialization for dynamic LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26084 Test Plan: Imported from OSS Differential Revision: D17339315 Pulled By: jamesr66a fbshipit-source-id: 03a2674edcf779becfe3b8ec96f1bae23c74b11c	2019-09-12 11:04:47 -07:00
James Reed	83ecdf76da	Revert "TorchScript Serialization for dynamic LSTM module" (#26079 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26079 This reverts commit e3039612d851d0fbd337546c8debc27ec7cfc4e4. Test Plan: Imported from OSS Differential Revision: D17337585 Pulled By: jamesr66a fbshipit-source-id: 4b93a4c5ca2fe491d609da889a42d22be8e52889	2019-09-11 21:23:19 -07:00
Jianyu Huang	ead14a6bd4	Use BytesIO instead of tempfile (#25976 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25976 As recommended in https://github.com/pytorch/pytorch/pull/25877/files#r322956051: > We should move more of these toward using BytesIO. Using files in tests is generally considered bad practice because it introduces syscalls and dependencies on the execution environment, and thus can cause test flakiness/instability. ghstack-source-id: 89929947 Test Plan: CI Differential Revision: D17310441 fbshipit-source-id: ba97cce4224225df45ff44062f1bc8ebefb25922	2019-09-11 19:35:49 -07:00

1 2 3 4 5

237 Commits