pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 13:44:15 +08:00

Author	SHA1	Message	Date
Maxim Berman	7b00adf5d3	Add CUDNN_LIB_DIR in rpath (#3255 ) * Add CUDNN_LIB_DIR in link -rpath * insert CUDNN_LIB_PATH in front of rpath	2017-10-28 00:13:53 -04:00
Adam Paszke	61afb0d519	Autogenerate ATen dispatch for JIT nodes	2017-10-27 02:40:09 +05:30
Sam Gross	67839ce7bc	Delete unused Softmax code (#3220 ) Softmax and LogSoftmax are automatically bound and dispatched through VariableType.	2017-10-21 20:51:27 +02:00
Edward Z. Yang	67612cba09	Add -Wno-missing-braces Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-10-19 23:04:19 -04:00
Sam Gross	f1f64c8d07	Generate autograd functions for NN / more refactors (#3136 ) Generate autograd functions for NN and implement more derivatives in derivatives.yaml A big refactor of gen_variable_type.py	2017-10-19 15:03:26 -04:00
Adam Paszke	98e67448fa	Large Softmax and LogSoftmax refactor - Cleaned up THNN and THCUNN code and kernels - Improved THCUNN kernel performance 5x, making it match cuDNN performance - Added support for computing softmax over arbitrary dims NOTE: The default dim for 3D inputs is now 1 (used to be 0) - Both functions now accept inputs with arbitrarily many dimensions - Autograd functions no longer save the input (it's unnecessary) - Added cuDNN bindings for softmax, but they are unused as THCUNN matches or even exceeds cuDNN performance	2017-10-19 19:51:10 +02:00
Trevor Killeen	dcb457fdd9	add support for using nnpack when installed via conda (#3155 ) * add support for using nnpack when installed via conda * unify nnpack discovery between conda and user	2017-10-18 20:11:13 +02:00
Richard Zou	0f4ae13f05	Better cudnn version checking (#3132 )	2017-10-16 20:59:18 +02:00
Richard Zou	1322f9a272	Add cudnn version to torch.version	2017-10-13 23:58:25 +02:00
Francisco Massa	f093545919	Add compiled CUDA version in torch.version.cuda	2017-10-10 10:16:14 -04:00
Soumith Chintala	efe91fb9c1	delete redundant python nccl code	2017-10-09 22:24:18 -04:00
Soumith Chintala	4d62933529	add initial NCCL C bindings	2017-10-09 22:24:18 -04:00
Soumith Chintala	b7e258f81e	link specific versioned System NCCL, rather than generic file	2017-10-09 22:24:18 -04:00
Trevor Killeen	029252fb3b	NNPACK bindings for Convolution (#2826 ) * skeleton commit for building and linking nnpack library in PyTorch * first stab at conv forward binding + integration * bind NNPACK gradient kernels * move nnpack forward, input gradient calls deeper * nnpack conv api mimics nn * fix symbol error; use memory across calls * clean up warnings, add shape checking, thread safety, configurable thread specification * add batch size threshold, also bind for single-element batch for the future	2017-10-04 13:48:14 -04:00
Adam Paszke	437d3af7bf	Add CUDNN_INCLUDE_DIR before CUDA directories in setup.py	2017-10-03 10:06:47 -04:00
Sam Gross	de757805fc	Implement some autograd functions using ATen (#2805 ) This adds some generated autograd functions implemented in C++, which are generated from derivatives.yaml. It also generates Python bindings for the Variable methods. The generated files are: Functions.cpp/h: subclasses of torch::autograd::Function VariableType.cpp/h: The at::Type for autograd Variables python_variable_methods.cpp: Python bindings to torch::autograd::Variable python_variable_methods_dispatch.h: wrapper which releases GIL and sets the CUDA device python_functions.cpp/h: exposes generated autograd functions as Python objects The generated functions are mostly shadowed by the definitions in variable.py. We'll remove the Python implementations in favor of the generated C++ implementations in a subsequent commit.	2017-09-26 17:08:00 -04:00
Adam Paszke	b7849662b5	Always regenerate nn wrappers after rebuilding THNN and THCUNN	2017-09-25 23:21:30 -04:00
Adam Paszke	411e1469e0	Add tools for autograd profiling	2017-09-25 23:21:30 -04:00
Soumith Chintala	f4eca7c94d	make CUDA_HOME take precedence over all other CUDA detection methods (#2863 )	2017-09-25 18:17:40 -04:00
Soumith Chintala	5be06230f9	cleanup external NCCL detection, add NCCL_ROOT_DIR / NCCL_LIB_DIR mechanism	2017-09-25 11:28:59 -04:00
Edward Z. Yang	bf9ab91779	Indicate if the last invocation of setup.py was debug or not. How to use: import torch.version print(torch.version.debug) Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-22 18:33:47 -04:00
Lu Fang	0a1ac8bfe5	create a cse pass, with very naive support.	2017-09-22 17:06:27 -04:00
Edward Z. Yang	670ec4bc59	Split Type into its own header file. No other substantive changes. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-20 12:24:27 -04:00
Adam Paszke	28828e033f	Make certain functions traceable	2017-09-19 10:53:32 -04:00
Adam Paszke	b708b6de8d	Add ONNX pass (JIT trace initialization)	2017-09-19 10:53:32 -04:00
Adam Paszke	0e53fe3a41	Put ONNX files where they belong	2017-09-19 10:53:32 -04:00
Adam Paszke	8dae433de8	Move JIT passes to a separate directory	2017-09-19 10:53:32 -04:00
Sam Gross	80d229b0e7	Refactor THPUtils_invalidArguments into separate file	2017-09-13 19:18:02 -04:00
Peter Ruch	0a9f93e43c	add env var for python executable	2017-09-13 17:49:08 -04:00
Soumith Chintala	19cfda761c	write THD link libraries to text file and read it in setup.py to link dependencies correctly (#2711 )	2017-09-12 20:56:36 -04:00
Sam Gross	1290e586fb	Use at::Tensor based autograd Variable (#2676 ) Variable is now a subclass of at::Tensor backed by a VariableImpl* pImpl. The implementation of the ATen functions is defined in the auto-generated VariableType.h/cpp file. Currently, only functions which fall through to the base type, such as sizes() and isCuda() are implemented. Differentiable ops like add() and mul() will be added in a subsequent PR.	2017-09-12 11:36:01 -04:00
Soumith Chintala	cf2c7ca998	add THPP linkage when building THD (#2687 )	2017-09-11 08:53:38 -04:00
Edward Z. Yang	459cc5a346	Check for nanopb and pybind11 submodules as well. (#2660 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-07 13:24:31 -04:00
Soumith Chintala	84095f9512	add linux guard	2017-09-07 11:57:49 -04:00
Soumith Chintala	894c05fd22	fix static linkage and make THD statically linked	2017-09-07 11:54:18 -04:00
Zach DeVito	6d8d5bab4c	Codemod Toffee -> ONNX, toffee -> onnx. Change file names to match	2017-09-06 13:45:39 -04:00
Edward Z. Yang	d59714e3b1	Code review comment changes. - Reduce setup.py diff. - Expunge WITH_TOFFEE from codebase. - Elaborate on a comment. - Move gen_toffee.sh to tools - Delete densenet test. - Use 'using' to inherit a constructor. - Delete outdated comment. - Comment about why primspecs can return fewer outputs. - Remove dead, commented out includes. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	7ac6d67a4e	Add nanopb to list of dep_libs. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Adam Paszke	594f98ce16	Support multi-stage AutogradClosures	2017-09-05 17:48:55 -04:00
Edward Z. Yang	605ef38831	Explicitly override CMAKE_DEBUG_POSTFIX for nanopb build. If it's not set, CMAKE_DEBUG_POSTFIX sets it to 'd' which means the static library gets named something different when built in debug mode. This is annoying because it means if you build in debug mode, the library is in a different place. Rather than teach the build system to find the correct name, just set this POSTFIX so names don't change. Also, update setup.py to look for the non-debug archive. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	de6ef65be5	Port to nanopb. General strategy: - nanopb is statically linked into PyTorch. It must be built with -fPIC. - Generated nanopb files for toffee.proto are checked into our repo. - Because nanopb generated protobufs are C only, we wrote a wrapper around it to give a Google C++ style interface. More on this shortly. How does the wrapper work? - It's called "micropb" becaues it is less small than nanopb :) - nanopb requires all variable-length fields to be written out using a "callbacks" mechanism. - We wrote pre-canned callbacks for all of the types ToffeeIR writes out and lists; these are micropb_callback and micropb_callback_list. These operate simply by dynamically allocating and storing the data to be written out in data (this defeats the purpose of the callback mechanism, but it's easy to implement) - Finally some boilerplate to actually implement the wrapper classes and have owning pointers to the actual data. Testing strategy: - Take the serialized protobuf from nanopb, parse it again with ToffeeIR and print it. Worked with all of test_jit.py! These tests don't run without 'toffee' being installed. TODO: - Update CI to install ToffeeIR, so we can run the Toffee tests in CI - Update E2E with Caffe2 tests so that they work with new stuff. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Zach DeVito	a3fdb281d1	Python wrapper for Node IR using pybind11 Supports almost all of the IR API.	2017-09-05 17:48:55 -04:00
Adam Paszke	fa308b3183	Improve backward tracing	2017-09-05 17:48:55 -04:00
Zach DeVito	57b7370aab	switch NodeKind over to Symbol type.	2017-09-05 17:48:55 -04:00
Zach DeVito	d7d74428a3	batchnorm hacking	2017-09-05 17:48:55 -04:00
Edward Z. Yang	db79be82ab	Move Toffee for C++ functions back to autograd. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	e1b345d81b	More alexnet things as primspec. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	6f6fe177f1	Make Toffee optional. Unbreaks CI. The general strategy: - We put all the toffee files in torch/csrc/toffee; they will only be added when toffee is enabled - Toffee is enabled if torch/lib/ToffeeIR is present (since we don't have a submodule/subtree thing going on) - The most prevalant place you will need to use WITH_TOFFEE is for primspec definitions on C++ autograd functions. There is a macro HAS_PRIMSPEC to ameliorate optionally defining primspec() virtual overrides on Function classes. HasPrimspec is always available but will be a zero field class when Toffee is disabled. NB: We might revert this commit in the future if we figure out a way to unconditionally enable Toffee that everyone likes. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	4b1f182199	Disable C++ Python conversion code. We want all the conversion code to live in one place. Away it goes! This means that alexnet protobuf no longer works. It will start working again when we port changes. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	dd58b145c3	Toffee graph exporting for PyTorch. This commit adds a new exporter pass which takes a graph and returns a string of the human-readable protobuf representation of a model. We have two strategies for how conversions are implemented: - If a Python autograd function has a primspec static method, we invoke it to get the Toffee conversion. Use torch.toffee.op to generate the format expected to be returned. The particular data representation is opaque and subject to change in the future. - Otherwise, there's a giant if statement in the exporter, which manually uses the JIT IR C++ API and Toffee IR C++ protobuf API to convert. You must check out a copy of the ToffeeIR repo https://github.com/ProjectToffee/ToffeeIR at torch/lib; at the moment we don't have a subtree/submodule set up. Technical debt in this commit: - To get protobuf headers in scope, we unconditionally add $CONDA_PREFIX/include to the include path. This needs to be replaced with a more robust mechanism. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00

... 14 15 16 17 18 ...

922 Commits