pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-11-07 10:01:39 +08:00

Author	SHA1	Message	Date
Nikolay Korovaiko	db5791d543	autodiff changes to enable profiling Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25397 Differential Revision: D17565747 Pulled By: Krovatkin fbshipit-source-id: b772437d9e02df99db6e662cb7d1227359959bed	2019-09-25 10:11:44 -07:00
Zachary DeVito	ad0af1127b	Add ivalue::type(), part 1 (#25439 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25439 This introduces a type() method on IValue that returns the tagged type of the IValue. The intention is that this value is always present/accurate, making it possible for clients to recover the Type from an IValue. Currently our APIs here are incomplete: they can sometimes recover a type but not always. This PR adds the function, and cleans up remaining cases where Lists/Dicts are not tagged. However, this information does not survive serialization unchanged. A second PR will use the type information in the ClassType being serialized to fixup the serialized ivalues to have the correct types again. After this patch it will be save to remove our incomplete APIs for recovering types. Test Plan: Imported from OSS Differential Revision: D17125595 Pulled By: zdevito fbshipit-source-id: 71c8c1a0e44762647e8f15f45d8ed73af8e6cb92	2019-09-18 16:06:58 -07:00
Martin Yuan	490eb7fed9	Add GET_ATTR instruction (#25151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25151 The prim::GetAttr operator depends on node. However, in lite interpreter there will be no node dependency. Promote the operator to a first-class instruction. Test Plan: Imported from OSS Differential Revision: D17076412 fbshipit-source-id: 8de20978445bb598634c5462e66e4459dcd567be	2019-08-28 20:45:55 -07:00
Martin Yuan	5dd01a7eea	Pull instruction definitions out of interpreter.cpp. (#25148 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25148 Instructions will be used in lite interpreter as well. Pull it out of interpreter.cpp, so that the lite interpreter doesn't have to compile with interpreter.cpp. Test Plan: Imported from OSS Differential Revision: D17076413 fbshipit-source-id: 99b3d8d27a96823a4a4dde6b2337ee44635e34cb	2019-08-28 20:17:36 -07:00
Zachary DeVito	61818b8986	Add interface declarations to JIT (#25258 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25258 this is the first commit in a series to add interfaces to JIT. Interfaces allow the specification through a blank python class of an abstract interface that can be used in type annotations for Script functions. If a TorchScript class implements all the methods in the interface with the appropriate types, then it is implicitly considered to implement that interface. Follows required: * implementation of serialization * implementation in the parser frontend * better error reporting for explaining why a class does not meet an interface specification. Test Plan: Imported from OSS Differential Revision: D17079963 Pulled By: zdevito fbshipit-source-id: a9986eeba2d4fdedd0064ce7d459c0251480a5a0	2019-08-27 22:54:37 -07:00
Edward Yang	9340b155bc	Revert D15901930: Add interface declarations to JIT Test Plan: revert-hammer Differential Revision: D15901930 Original commit changeset: 22c82d12c9c2 fbshipit-source-id: 4009a3ce7af245d7e0f4924824ece59cdc774180	2019-08-27 06:41:32 -07:00
Zachary DeVito	4b22cf6bd5	Add interface declarations to JIT (#21972 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21972 ghimport-source-id: 280f89ca678615f915be2139d1c05cb6bc39eefc Test Plan: Imported from OSS Differential Revision: D15901930 Pulled By: zdevito fbshipit-source-id: 22c82d12c9c2600e569d7083e2771fd6ec3de2b1	2019-08-26 16:57:59 -07:00
Zachary DeVito	bdc57d3833	Merge ProfiledTensorType and TensorType (#24284 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24284 This PR finishes the unification of all Tensor types into a single object. ProfiledTensorType is renamed to TensorType and the old TensorType is deleted. Notes: * Fixes bug in merge for VaryingShape by changing its representation to an optional list of optional ints. * Removes ProfiledTensorType::create(type) invocations that can now simply be expect calls on tensor type. Test Plan: Imported from OSS Differential Revision: D16794034 Pulled By: zdevito fbshipit-source-id: 10362398d0bb166d0d385d74801e95d9b87d9dfc	2019-08-20 13:01:28 -07:00
Michael Suo	711be82951	Make optimize a thread_local flag Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23170 Test Plan: Imported from OSS Differential Revision: D16441912 Pulled By: suo fbshipit-source-id: a33485178a329d54e41e364c4f14950f88481c55	2019-07-24 23:09:21 -07:00
Michael Suo	3b2844eeea	Make CompilationUnit own Functions (#22202 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22202 ghimport-source-id: de6c963af1df76d2d6357155e64a5913ab879f76 Test Plan: Imported from OSS Differential Revision: D15998761 Pulled By: suo fbshipit-source-id: 5414a6424953738d823b265d20dc67dde6e5b2d8	2019-07-04 17:12:00 -07:00
Nikolay Korovaiko	a3fc6ed046	Hook up liveness into profiling pipeline. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21881 Differential Revision: D15931627 Pulled By: Krovatkin fbshipit-source-id: dc825a563c7aceb5f66a2ed2a600d550b70941b2	2019-06-20 21:23:16 -07:00
Nikolay Korovaiko	a85305fdea	Hook up profiled execution in the interpreter (#21799 ) Summary: Rebasing https://github.com/pytorch/pytorch/pull/21616 onto master Pull Request resolved: https://github.com/pytorch/pytorch/pull/21799 Differential Revision: D15832854 Pulled By: Krovatkin fbshipit-source-id: 88d754446df2abc25ea86e46764848d48ee3a5fc	2019-06-14 16:56:13 -07:00
James Reed	4bcc72fe95	Support for NamedTuple (#21428 ) Summary: Resolves https://github.com/pytorch/lockdown/issues/18 This implements NamedTuple by taking advantage of the existing `names` field in `TupleType`. TODO: This currently doesn't retain the NamedTuple-ness through serialization. Discussed with suo offline, we can probably make a way to define an anonymous NamedTuple in script (e.g. `NamedTuple('Foo', [('a', int), ('b', float), ('c', List[float])])` and serialize that TODO: implement support for calling the constructor with kwargs Pull Request resolved: https://github.com/pytorch/pytorch/pull/21428 Differential Revision: D15741564 Pulled By: jamesr66a fbshipit-source-id: c077cbcea1880675ca6deb340a9ec78f824a136c	2019-06-14 16:45:56 -07:00
Zachary DeVito	56f4602630	Add WeakIValue, use in tracer. (#21515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21515 ghimport-source-id: 7898a68791db2b5050164ab01d6ca6991e05746d Reviewed By: suo Differential Revision: D15719981 Pulled By: zdevito fbshipit-source-id: 42cf26cf6541bcdf95f1343da3b9228fe2c229da	2019-06-12 17:12:53 -07:00
Sebastian Messmer	b527e48588	Use c10::List (#21177 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21177 - Integrate c10::ListPtr into IValue and the c10 dispatcher. - Streamline conversion to/from IValue. Before, we had IValue::to<> and kernel_functor.h had its own ivalue_to_arg_type and return_type_to_ivalue. They are now unified. Also, this means that nested types like Dicts of Lists of Optional of Dict of ... do work as expected now Differential Revision: D15476433 fbshipit-source-id: bde9df80df20091aa8e6ae17ba7e90abd149b954	2019-06-12 13:58:24 -07:00
Michael Suo	cab3e726df	Split out Function into its own file (#21539 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21539 ghimport-source-id: f1e4396a0bec6e30d3179f926ec4da68807942f7 Differential Revision: D15741979 Pulled By: suo fbshipit-source-id: 4cd0ed36bcbf8db0b36a101dda6f58975f806889	2019-06-10 16:37:58 -07:00
Zachary DeVito	ea822d9626	Interpreter support for CallFunction/CallMethod (#21562 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21562 ghimport-source-id: 17e5e183f730f50d97ef48973aafc6249d54978f Reviewed By: suo Differential Revision: D15729500 Pulled By: zdevito fbshipit-source-id: efa8a133b617b1498810392a8da6b513ce00b5eb	2019-06-09 15:28:26 -07:00
Zachary DeVito	18996a8952	unfinished push/pop reduction (#21559 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21559 ghimport-source-id: 81ba4a5638577781e1ea706599966c033c37e814 Reviewed By: suo Differential Revision: D15729501 Pulled By: zdevito fbshipit-source-id: 3423bff61e89617c40078d5fab726b77d21bfa27	2019-06-09 15:28:16 -07:00
Zachary DeVito	13edda417d	Prepare interpreter for function calling (#21558 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21558 ghimport-source-id: a8a19dbefea869ca1401e5afea6c02f31f95b99a Reviewed By: suo Differential Revision: D15729491 Pulled By: zdevito fbshipit-source-id: 9629664608a2379a2ddcafaf741fa8463c4fb917	2019-06-09 15:28:13 -07:00
Zachary DeVito	d71501259b	Revert D15572818: Prepare interpreter for function calling Differential Revision: D15572818 Original commit changeset: 3a9b5f053664 fbshipit-source-id: b932411e8e88c7414c8db332d6049fe4e26bd83e	2019-06-07 22:20:54 -07:00
Zachary DeVito	d4bcab0dba	Revert D15590900: Reduce number of stack manipulation instructions in interpreter. Differential Revision: D15590900 Original commit changeset: 98829979feba fbshipit-source-id: eb7f1d396bb2b98d2852af81c69db81430eba33c	2019-06-07 22:20:50 -07:00
Zachary DeVito	bfb235b8c9	Revert D15618275: Interpreter support for CallFunction/CallMethod Differential Revision: D15618275 Original commit changeset: 038ae27e5416 fbshipit-source-id: 8dbe0f564ba103fe445dacc471085c659171705f	2019-06-07 22:20:40 -07:00
Zachary DeVito	5f6afafdef	Interpreter support for CallFunction/CallMethod (#21325 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21325 ghimport-source-id: eeca1176f5e00c85a69cd016acccf5105e670e02 Reviewed By: jamesr66a Differential Revision: D15618275 Pulled By: zdevito fbshipit-source-id: 038ae27e5416f1ce338009627c839a4d61a00658	2019-06-07 20:56:58 -07:00
Zachary DeVito	dde27958dd	Reduce number of stack manipulation instructions in interpreter. (#21240 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21240 ghimport-source-id: 5e9cbe8b3df3ac721135d2f652a420ae0b14ac55 Reviewed By: jamesr66a Differential Revision: D15590900 Pulled By: zdevito fbshipit-source-id: 98829979feba23685f0ba98ba3cb840157f7259a	2019-06-07 20:56:49 -07:00
Zachary DeVito	c53e4d012d	Prepare interpreter for function calling (#21185 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21185 ghimport-source-id: 6b9cb92d1f1f59bb980dcfa0d29dfe985ee955d1 Reviewed By: jamesr66a Differential Revision: D15572818 Pulled By: zdevito fbshipit-source-id: 3a9b5f053664c09212b97f1391d8d006337b5550	2019-06-07 20:56:46 -07:00
Ilia Cherniavskii	409200df59	Move inter-op settings into ATen/Parallel (#20050 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20050 ghimport-source-id: cc102bab8abf3e56c099245976786317ed63ea14 Differential Revision: D15248576 Pulled By: ilia-cher fbshipit-source-id: 55ddcb7af387ddfc68a42ac7167de07ea648e249	2019-05-17 03:12:02 -07:00
Edward Yang	97e1f07ffc	Replace AT_CHECK with TORCH_CHECK [shard 10/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20436 Reviewed By: jerryzh168 Differential Revision: D15318926 fbshipit-source-id: 71a43070cc50cc174f703ebc595f1d87c6fc1e91	2019-05-15 07:35:37 -07:00
Zachary DeVito	3afd99680c	Remove SourceLocation (respin) (#20333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20333 ghimport-source-id: e64075bb82067224463e9955d10bd13967d1975d Differential Revision: D15284081 Pulled By: zdevito fbshipit-source-id: ac26ae48392b9daff08f460529c06af8f4e4722a	2019-05-09 16:17:33 -07:00
Wanchao Liang	e870b11ae6	Revert D15275731: Remote SourceLocation Differential Revision: D15275731 Original commit changeset: f4da178c3137 fbshipit-source-id: 830b79735eb2dadc4795b5aae407826bf20ef121	2019-05-09 13:07:11 -07:00
Zachary DeVito	eca91de5d2	Remote SourceLocation (#20300 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20300 ghimport-source-id: 06f606c4db3b70b1d2ed9f6ed4542c3f703c4e17 Differential Revision: D15275731 Pulled By: zdevito fbshipit-source-id: f4da178c31372c2264feb9f99476b9c9aa66c1f2	2019-05-09 11:48:29 -07:00
Mikhail Zolotukhin	8b46938355	Cleanup includes in torch/csrc/jit/* (#19922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19922 ghimport-source-id: 0434c46bf75621ff79ea27a18a2475e7f13e2487 Differential Revision: D15125015 Pulled By: ZolotukhinM fbshipit-source-id: 5685edfc94067f62e363a85e9badb7f757b1d321	2019-05-06 13:40:26 -07:00
Gregory Chanan	043e363c6c	Cache device on TensorImpl; clean up TensorImpl constructors. (#18833 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18833 ghimport-source-id: 6f2be25fcc5e6be3ffe20582e604bd2c1fbab66b Stack from [ghstack](https://github.com/ezyang/ghstack): * #18833 [STACK] Cache device on TensorImpl; clean up TensorImpl constructors. * #18832 [STACK] Disallow changing the device of a tensor via set_. * #18831 [STACK] Stop swapping in Storages of the wrong device for Tensors. 1) We cache device on TensorImpl. This means we can access the device without a virtual function and allows us to more easily extend TensorImpls (because they don't need to figure out how to store the Device for themselves). 2) Clean up TensorImpl APIs. We had a constructor that took a TensorTypeId and an allocator and would allocate a Storage based on the recognized types of TensorTypeIds. Instead, we just have two different constructors: one for types with a storage, one without. Reviewed By: dzhulgakov Differential Revision: D14766230 fbshipit-source-id: 745b8db84dcd6cb58f1a8675ad3ff8d033bc50df	2019-04-05 07:21:39 -07:00
James Reed	85f36014e2	Experimental logging/counters API (#18235 ) Summary: This defines a generic counters API that users can utilize to provide monitoring functionality in e.g. a production service. We expose both counters for runtime internals as well as a TorchScript API to create user-defined counters. Synopsis of the API: - `torch/csrc/jit/script/logging.h` specifies the externally-facing API in C++ - `torch/jit/_logging.py` specifies the Python API We use an interface, `LoggerBase`, to define the interactions between users and a logging backend. Implementing a subclass of `LoggerBase` allows the user to handle these events in a custom way, such as logging into a DB or calling into an infra-specific counters API. From the frontend perspective, we can create log events in two ways: 1. We provide an `add_stat_value(name, val)` function. This calls into the Logger backend with a key/value pair. For example, we might call `add_stat_value('foo', 1)` to bump an event counter. 2. We provide a `time_point()` function to record a timestamp in nanoseconds. This can be used in conjunction with `add_stat_value` to record runtime wall clock durations. Examples of frontend usage can be found in `test_jit.py TestLogging`. We provide a trivial `LockingLogger` implementation as an example and for testing purposes. It is likely not ready for production usage. It demonstrates that a backend implementing the API can do things like specify aggregation types and report these aggregate stats via the `get_counters()` API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18235 Differential Revision: D14545060 Pulled By: jamesr66a fbshipit-source-id: 04099543a1898cfdd411511e46e03d5dce9b4881	2019-03-29 17:14:03 -07:00
James Reed	1d26a3ae7e	Open registration for c10 thread pool (#17788 ) Summary: 1. Move ATen threadpool & open registration mechanism to C10 2. Move the `global_work_queue` to use this open registration mechanism, to allow users to substitute in their own Pull Request resolved: https://github.com/pytorch/pytorch/pull/17788 Reviewed By: zdevito Differential Revision: D14379707 Pulled By: jamesr66a fbshipit-source-id: 949662d0024875abf09907d97db927f160c54d45	2019-03-08 15:38:41 -08:00
Edward Yang	4404762d7d	Rename IntList to IntArrayRef. (#16751 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16751 This was made more complicated by the fact that ivalue::IntList is a thing. So I had to fix all of the sites where we referring to IValue post facto. The following codemods were run, in this order: ``` codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in IntList IntArrayRef codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in IntArrayRef::create IntList::create codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in ivalue::IntArrayRef ivalue::IntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in Tag::IntArrayRef Tag::IntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in isIntArrayRef isIntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in toIntArrayRef toIntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in 'Shared<IntArrayRef>' 'Shared<IntList>' codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in 'intrusive_ptr<IntArrayRef>' 'intrusive_ptr<IntList>' ``` Some manual fixups were done afterwards; they can be reviewed separately at https://github.com/pytorch/pytorch/pull/16752 Reviewed By: dzhulgakov Differential Revision: D13954363 fbshipit-source-id: b5c40aacba042402155a2f5a229fa6db7992ac64	2019-02-05 14:54:34 -08:00
Zachary DeVito	c42431bd7a	Revert D13740752: [c10] plug caffe2 into jit Differential Revision: D13740752 Original commit changeset: 2d9383574d42 fbshipit-source-id: e9ff217a438720423340a10af7fa263b33f2ae24	2019-01-25 12:29:19 -08:00
Bram Wasti	6d2aee4a9b	plug caffe2 into jit (#16331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16331 Temporary measure to enable caffe2 ops in pytorch Reviewed By: smessmer Differential Revision: D13740752 fbshipit-source-id: 2d9383574d42ce84ee471aba32eeb4f5a0cc7a4c	2019-01-24 22:28:21 -08:00
Mikhail Zolotukhin	47bf30661f	Directly include headers from ATen. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16287 Differential Revision: D13792949 Pulled By: ZolotukhinM fbshipit-source-id: d627d8dc469df048063c70d0b5b8d33fede809a3	2019-01-24 11:22:27 -08:00
Sebastian Messmer	0ab8de3125	Remove some dependencies from ivalue.h to ATen (#15855 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15855 This is preparation work for moving IValue to c10. Reviewed By: ezyang Differential Revision: D13605259 fbshipit-source-id: cc545f582ab8607bb02aaf71273cb2710200b295	2019-01-17 16:03:58 -08:00
Guoqiang Jerry Chen	6641b09fac	respect grad guard for torch.jit._fork and torch.jit._wait (#16101 ) Summary: respect grad guard for torch.jit._fork and torch.jit._wait. Verified that the test failed without the fix, and pass with the fix. Ideally I would like to enable and disable grad inside the forked function. It doesn't seems like it's supported at this moment. This code handles that as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16101 Differential Revision: D13708374 Pulled By: gqchen fbshipit-source-id: 0533f080c4d0253fb4c61d2a0d3cc22de5721a09	2019-01-17 11:12:57 -08:00
Michael Suo	f636dc9276	clang format world (#15524 ) Summary: The PR clang-formats everything in `torch/csrc/jit/` and adds it to the pre-commit hook. Here is a list of non-mechanical changes: - I went over each file and fixed up whenever I could tell that clang-format was clobbering comment formatting. - Made the macros in register_prim_ops a little more clang-format friendly by omitting trailing commas - Refactored autodiff.cpp to use a helper class with explicit state rather than a bunch of capturing lambdas - Small improvements to the precommit hook clang-format Pull Request resolved: https://github.com/pytorch/pytorch/pull/15524 Differential Revision: D13547989 Pulled By: suo fbshipit-source-id: 3ff1541bb06433ccfe6de6e33f29227a2b5bb493	2018-12-26 06:55:01 -08:00
James Sun	88bf683cbc	Support error handling in forked threads (#14523 ) Summary: Save error info in the future for parent thread to pick up. Throw the error when the thread is the root thread. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14523 Differential Revision: D13251756 Pulled By: highker fbshipit-source-id: b40f9a45665e1a934743f131ec5e8bad5622ce67	2018-12-19 18:54:46 -08:00
Peter Goldsborough	73ee7fda4c	Remove deprecated variable_tensor_functions (#15003 ) Summary: Removing the deprecated functions in `torch/csrc/variable_tensor_functions.h` (like `torch::CPU`) and corresponding implementations from `torch/csrc/torch.cpp` from master after the release. ezyang gchanan soumith Pull Request resolved: https://github.com/pytorch/pytorch/pull/15003 Differential Revision: D13418086 Pulled By: goldsborough fbshipit-source-id: a0accdf6f7b0efa1ec07ac7b74b86ff2da37543f	2018-12-11 17:16:11 -08:00
Edward Yang	517c7c9861	Canonicalize all includes in PyTorch. (#14849 ) Summary: Anywhere we used #include "foo.h", we now say #include <foo.h> Paths are adjusted to be rooted out of aten/src, torch/lib, or the root level directory. I modified CMakeLists.txt by hand to remove TH and THC from the include paths. I used the following script to do the canonicalization: ``` import subprocess import re import os.path files = subprocess.check_output(['git', 'ls-files']).decode('utf-8').rstrip().split('\n') for fn in files: if not any(fn.endswith(suff) for suff in ['.cu', '.cpp', '.in', '.h', '.hpp', '.cu', '.cuh', '.cc']): continue if not any(fn.startswith(pref) for pref in ["aten/", "torch/"]): continue with open(fn, 'r') as f: c = f.read() def fmt(p): return "#include <{}>".format(p) def repl(m): p = m.group(1) if p in ["dlfcn.h", "unistd.h", "nvrtc.h", "cuda.h", "cuda_runtime.h", "cstdint", "cudnn.h", "Python.h", "cusparse.h", "cuda_runtime_api.h", "cuda_fp16.h", "cublas_v2.h", "stdint.h", "curand_kernel.h"]: return fmt(p) if any(p.startswith(pref) for pref in ["torch/csrc", "c10/", "ATen/", "caffe2/", "TH/", "THC/", "Eigen/", "gtest/", "zdl/", "gloo/", "onnx/", "miopen/"]): return fmt(p) for root in ["aten/src", "torch/lib", ""]: for bad_root in [os.path.dirname(fn), "aten/src/TH", "aten/src/THC", "torch/csrc"]: new_p = os.path.relpath(os.path.join(bad_root, p), root) if not new_p.startswith("../") and (os.path.exists(os.path.join(root, new_p)) or os.path.exists(os.path.join(root, new_p + ".in"))): return fmt(new_p) print("ERROR: ", fn, p) return m.group(0) new_c = re.sub(r'#include "([^"]+)"', repl, c) if new_c != c: print(fn) with open(fn, 'w') as f: f.write(new_c) ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14849 Reviewed By: dzhulgakov Differential Revision: D13363445 Pulled By: ezyang fbshipit-source-id: 52361f878a672785f9306c9e9ab2513128092b68	2018-12-08 19:38:30 -08:00
James Sun	186341c5dc	Merge Caffe2 and PyTorch thread pool definitions (#14114 ) Summary: (1) Move Caffe2 thread pool to aten (2) Use the same thread pool definition for PyTorch interpreter (3) Make ivalue::Future thread-safe Pull Request resolved: https://github.com/pytorch/pytorch/pull/14114 Reviewed By: ilia-cher Differential Revision: D13110451 Pulled By: highker fbshipit-source-id: a83acb6a4bafb7f674e3fe3d58f7a74c68064fac	2018-11-28 18:10:20 -08:00
James Sun	d02781a2ef	Make InterpresterStateImpl a intrusive_ptr_target (#13784 ) Summary: InterpresterStateImpl con continue its lifecycle by increment the ref count itself. This patch also removes InterpresterState::clone() interface that conflicts with intrusive_ptr_target that disallows copy. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13784 Differential Revision: D13015451 Pulled By: highker fbshipit-source-id: a05f1ea6549d52ec693ccffefaa4d520b2474b8c	2018-11-09 23:39:18 -08:00
James Sun	dca3c2c60f	Save and execute futures in a task queue (#13212 ) Summary: Upon calling wait(), save the forked thread and the current thread to a task queue. A idling thread (which currently is single threaded) should pick a ready task and run till there is nothing in the task queue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13212 Differential Revision: D12884522 Pulled By: highker fbshipit-source-id: b3942a0ee63c148e05f5f41bdc73007fa3c3368e	2018-11-09 01:46:35 -08:00
Peter Goldsborough	0479517325	Add modernize-* checks to clang-tidy (#13196 ) Summary: Enables almost all `modernize-*` checks in clang-tidy. This warns against things such as: - Use of `const std::string&` instead of new-style `std::string` + move, - Using old-style loops instead of range-for loops, - Use of raw `new` - Use of `push_back` instead of `emplace_back` - Use of `virtual` together with `override` (`override` is sufficient) ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/13196 Differential Revision: D12891837 Pulled By: goldsborough fbshipit-source-id: 4d0f782a09eb391ee718d3d66f74c095ee121c09	2018-11-02 20:30:40 -07:00
mruberry	6fe089c6ea	Hierarchical device independent -> device specific architecture (#13108 ) Summary: This PR principally redesigns the fuser's logical flow to be hierarchical, with device-independent logic directing (relatively little) device-specific logic. This design is based on reviews of XLA, TVM, internal design review at NVIDIA and discussions with fuser owners at Facebook. To further vet the design I have begun developing the next significant PR (extended fusion logic) on top of this architecture and it has made the work significantly easier. This PR also improves fuser modularity, which should make it easier for others to contribute to. Unfortunately, this PR is large and its nature has made breaking it into smaller pieces challenging. Future PRs should be smaller. The fusion flow is now: - Fusions are "registered" and "upfront compilation" occurs. The fusion specifications, which includes the graph, go into a thread-safe device-independent cache. Upfront compilation generates some information used later during shape inference. - Fusions are run, which passes them to an executor that performs shape inference, requests an instantiated fusion from the specification's thread-safe store, and launches them. Launch logic eventually defers to device-specific logic. - Fusions not previously instantiated are compiled. Compilation is device-specific and arg-specific. Compilation logic eventually defers to device-specific logic. - If the fusion could not be run because fusion on the requested device is disabled or shape inference fails a fallback is invoked. This flow can be thought of as PyTorch IR -> Device-Independent Fusion Logic -> Device-Specific Fusion Logic. The current upstream logic is, by contrast, PyTorch IR -> Device-Specific Logic -> Device-Independent Logic, which results in needless code duplication and lack of conceptual clarity. That was my mistake when splitting the fuser off from the rest of the jit and our reviews since then have been incredibly helpful in understanding why the approach in this PR is better. This PR does not only move code around. It also fixes few couple bugs and makes some logical/code changes. Bug fixes: - thread-safety is improved with caches preventing concurrent access - the nvrtc version is now reviewed to determine the appropriate compute architecture to compile for, fixing a bug that would cause runtime errors if a user's nvrtc didn't support the compute architecture their gpu reported - an issue with DeviceGuard not setting the device properly and failing silently is worked-around (ezyang mentioned he was reviewing the dynamic registration DeviceGuard uses, which may resolve the issue) Code/Logical changes: - "const" now appears many more places (note: I cast const away in operator.h because of some obscure build issues -- I think we should be able to fix this and will take a look while this goes through testing) - The new flow allowed some redundant code to be removed (AnnotatedGraph is gone, for example, and the more straightforward flow eliminated duplication of effort elsewhere) - Fallback logic is now also invoked if a fusion is requested on a device that cannot handle fusions - Use of macros to determine which files are compiled is reduced (though they may come back if the Windows build is unhappy) - There is no more "common" code or folder, the device-independent logic being at the forefront of the fuser replaces and improves upon the goal of sharing code apaszke who I promised naming rights to zdevito who correctly pointed out that the device-independent logic should be the bulk of what the fuser is doing ngimel who contributed to the design of this architecture Pull Request resolved: https://github.com/pytorch/pytorch/pull/13108 Reviewed By: gchanan, fmassa Differential Revision: D12850608 Pulled By: soumith fbshipit-source-id: 24e2df6dfa97591ee36aeca8944519678c301fa3	2018-10-31 18:13:00 -07:00
Elias Ellison	59f8e8ada7	First step at adding exceptions (#12789 ) Summary: This is a first step towards adding exceptions. We need minimal support in order to begin converting the torch library to weak script mode (which is the main goal here). Some limitations (that are documented in the tests & compiler): 1. Cannot assign exceptions to variables 2. Any name after raise is being treated as a valid Exception 3. No control flow analysis yet. Below a will be undefined: if True: a = 1 else: raise Exception("Hi") return a Pull Request resolved: https://github.com/pytorch/pytorch/pull/12789 Differential Revision: D12848936 Pulled By: eellison fbshipit-source-id: 1f60ceef2381040486123ec797e97d65b074862d	2018-10-30 20:25:50 -07:00

1 2 3

115 Commits