pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Author	SHA1	Message	Date
Salil Desai	bc68625151	[Vulkan] Add support for Optimization Blocklist to Vulkan Rewrite (#87431 ) Optimization Blocklist will be used in a future diff (D40315730) to make the rewrite to transfer input/output backends optional Differential Revision: [D40315729](https://our.internmc.facebook.com/intern/diff/D40315729/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87431 Approved by: https://github.com/mcr229, https://github.com/digantdesai	2022-10-31 14:15:51 +00:00
David Reiss	faf03bd226	Update default ouput extension in optimize_for_mobile.cc (#45598 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45598 .bc is causing issues on Android. Let's switch to .ptl. Test Plan: CI Reviewed By: kimishpatel Differential Revision: D24026180 fbshipit-source-id: 9f252f3652d748bccb19dc61a783d693e171b2c6	2020-10-15 15:34:34 -07:00
Tao Xu	a277c097ac	[iOS][GPU] Add Metal/MPSCNN support on iOS (#46112 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46112 ### Summary This PR adds the support of running torchscript models on iOS GPU via Metal (Inference only). The feature is currently in prototype state, API changes are expected. The tutorial and the documents will be added once it goes to beta. allow-large-files - Users API ``` auto module = torch::jit::load(model); module.eval(); at::Tensor input = at::ones({1,3,224,224}, at::ScalarType::Float).metal(); auto output = module.forward({input}).toTensor().cpu(); ``` - Supported Models - Person Segmentation v106 (FB Internal) - Mobilenetv2 - Supported Operators - aten::conv2d - aten::addmm - aten::add.Tensor - aten::sub.Tensor - aten::mul.Tensor - aten::relu - aten::hardtanh - aten::hardtanh_ - aten::sigmoid - aten::max_pool2d - aten::adaptive_avg_pool2d - aten::reshape - aten::t - aten::view - aten::log_softmax.int - aten::upsample_nearest2d.vec - Supported Devices - Apple A9 and above - iOS 10.2 and above - CMake scripts - `IOS_ARCH=arm64 ./scripts/build_ios.sh -DUSE_METAL=ON` ### Test Plan - Circle CI ghstack-source-id: 114155638 Test Plan: 1. Sandcastle CI 2. Circle CI Reviewed By: dreiss Differential Revision: D23236555 fbshipit-source-id: 98ffc48b837e308bc678c37a9a5fd8ae72d11625	2020-10-13 01:46:56 -07:00
gunandrose4u	a4aba1d465	fix compile error (#45052 ) Summary: Update vulkanOptimizeForMobile function invoking in optimize_for_mobile.cc to align latest call contract in PR https://github.com/pytorch/pytorch/pull/44903. Pull Request resolved: https://github.com/pytorch/pytorch/pull/45052 Reviewed By: malfet Differential Revision: D23814953 Pulled By: mrshenli fbshipit-source-id: 0fa844a8291e952715b9de35cdec0e411c42b7f9	2020-09-21 10:23:49 -07:00
Linbin Yu	bff741a849	Improve save_for_mobile cxx binary (#43721 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43721 We can combine optimization pass and save_for_mobile together to reduce friction. Since lite interpreter model can also be used in full JIT, I don't think we need the option to save it as full JIT model. Also - improved usage message - print op list before and after optimization pass Test Plan: ``` buck run //xplat/caffe2:optimize_for_mobile -- --model=/home/linbin/sparkspot.pt Building: finished in 12.4 sec (100%) 2597/2597 jobs, 2 updated Total time: 12.5 sec pt_operator_library( name = "old_op_library", ops = [ "aten::_convolution", "aten::adaptive_avg_pool2d", "aten::add_.Tensor", "aten::batch_norm", "aten::mul.Tensor", "aten::relu_", "aten::softplus", "aten::sub.Tensor", ], ) pt_operator_library( name = "new_op_library", ops = [ "aten::adaptive_avg_pool2d", "aten::add_.Tensor", "aten::batch_norm", "aten::mul.Tensor", "aten::relu_", "aten::softplus", "aten::sub.Tensor", "prepacked::conv2d_clamp_run", ], ) The optimized model for lite interpreter was saved to /home/linbin/sparkspot_mobile_optimized.bc ``` ``` buck run //xplat/caffe2:optimize_for_mobile -- --model=/home/linbin/sparkspot.pt --backend=vulkan ``` Reviewed By: kimishpatel Differential Revision: D23363533 fbshipit-source-id: f7fd61aaeda5944de5bf198e7f93cacf8368babd	2020-08-27 11:01:12 -07:00
Ivan Kobzarev	3852215170	[vulkan] jit passes for vulkan conv2 prepack and fuse with clamp (#39282 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39282 Test Plan: Imported from OSS Differential Revision: D21962424 Pulled By: IvanKobzarev fbshipit-source-id: 2d20e827d2c3836b7e6b443293377c68dc1ffa5a	2020-06-20 14:12:21 -07:00
Xingying Cheng	0b3755b1d0	Add optimization blacklist as second arg to optimizeForMobile method. (#37462 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37462 Instead of running all the optimization pass in optimizeForMobile method, introducing a whitelist optimizer dictionary as second param in the method, when it is not passed during calling, the method will run all the optimization passes, otherwise the method will read the dict and only run the pass with value of True. ghstack-source-id: 106104503 Test Plan: python test/test_mobile_optimizer.py Imported from OSS Differential Revision: D22096029 fbshipit-source-id: daa9370c0510930f4c032328b225df0bcf97880f	2020-06-17 18:14:45 -07:00
Kimish Patel	9f02897431	Account for the change in optimizeForMobile API change. Test Plan: TBD Reviewed By: ayush29feb Differential Revision: D21185736 fbshipit-source-id: fc7abc9c2eba8e6a390e54168b1fc4a17bf80e68	2020-04-24 13:21:56 -07:00
Kimish Patel	c5c63a2e35	Add quick utility to transform scripted/traced models for mobile. (#35904 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35904 Currently this optimization means transform conv2d and linears to prepacked(xnnpack) equivalent. Test Plan: buck run fbsource//xplat/caffe2:optimize_for_mobile -- --model="/tmp/inpainting_fbnet.pt" Reviewed By: AshkanAliabadi Differential Revision: D20824433 fbshipit-source-id: 88d5c0d21b77911f95f018b03398b0df758ab0d7	2020-04-03 11:42:11 -07:00

9 Commits