eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Benoit Steiner	264f8141f8	Shared the tensor reduction test	2016-02-01 07:44:31 -08:00
Benoit Steiner	11bb71c8fc	Sharded the tensor device test	2016-02-01 07:34:59 -08:00
Benoit Steiner	e80ed948e1	Fixed a number of compilation warnings generated by the cuda tests	2016-01-31 20:09:41 -08:00
Benoit Steiner	6720b38fbf	Fixed a few compilation warnings	2016-01-31 16:48:50 -08:00
Benoit Steiner	4a2ddfb81d	Sharded the CUDA argmax tensor test	2016-01-31 10:44:15 -08:00
Benoit Steiner	483082ef6e	Fixed a few memory leaks in the cuda tests	2016-01-30 11:59:22 -08:00
Benoit Steiner	bd21aba181	Sharded the cxx11_tensor_cuda test and fixed a memory leak	2016-01-30 11:47:09 -08:00
Benoit Steiner	9de155d153	Added a test to cover threaded tensor shuffling	2016-01-30 10:56:47 -08:00
Benoit Steiner	32088c06a1	Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.	2016-01-30 10:51:14 -08:00
Benoit Steiner	2053478c56	Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test	2016-01-30 10:46:36 -08:00
Benoit Steiner	d0db95f730	Sharded the tensor thread pool test	2016-01-30 10:43:57 -08:00
Benoit Steiner	ba27c8a7de	Made the CUDA contract test more robust to numerical noise.	2016-01-30 10:28:43 -08:00
Benoit Steiner	963f2d2a8f	Marked several methods EIGEN_DEVICE_FUNC	2016-01-28 23:37:48 -08:00
Benoit Steiner	c5d25bf1d0	Fixed a couple of compilation warnings.	2016-01-28 23:15:45 -08:00
Benoit Steiner	7b3044d086	Made sure to call nvcc with the relaxed-constexpr flag.	2016-01-28 15:36:34 -08:00
Gael Guennebaud	ddf64babde	merge	2016-01-28 13:21:48 +01:00
Gael Guennebaud	7802a6bb1c	Fix unit test filename.	2016-01-28 09:35:37 +01:00
Benoit Steiner	4bf9eaf77a	Deleted an invalid assertion that prevented the assignment of empty tensors.	2016-01-27 17:09:30 -08:00
Benoit Steiner	291069e885	Fixed some compilation problems with nvcc + clang	2016-01-27 15:37:03 -08:00
Benoit Steiner	47ca9dc809	Fixed the tensor_cuda test	2016-01-27 14:58:48 -08:00
Benoit Steiner	55a5204319	Fixed the flags passed to nvcc to compile the tensor code.	2016-01-27 14:46:34 -08:00
Benoit Steiner	9dfbd4fe8d	Made the cuda tests compile using make check	2016-01-27 12:22:17 -08:00
Benoit Steiner	5973bcf939	Properly specify the namespace when calling cout/endl	2016-01-27 12:04:42 -08:00
Gael Guennebaud	9c8f7dfe94	bug #1156 : fix several function declarations whose arguments were passed by value instead of being passed by reference	2016-01-27 18:34:42 +01:00
Ville Kallioniemi	02db1228ed	Add constructor for long types.	2016-01-26 23:41:01 -07:00
Hauke Heibel	5eb2790be0	Fixed minor typo in SplineFitting.	2016-01-25 22:17:52 +01:00
Benoit Steiner	e3a15a03a4	Don't explicitely evaluate the subexpression from TensorForcedEval::evalSubExprIfNeeded, as it will be done when executing the EvalTo subexpression	2016-01-24 23:04:50 -08:00
Benoit Steiner	bd207ce11e	Added missing EIGEN_DEVICE_FUNC qualifier	2016-01-24 20:36:05 -08:00
Benoit Steiner	cb4e53ff7f	Merged in ville-k/eigen/tensorflow_fix (pull request PR-153) Add ctor for long	2016-01-22 19:11:31 -08:00
Ville Kallioniemi	9f94e030c1	Re-add executable flags to minimize changeset.	2016-01-22 20:08:45 -07:00
Benoit Steiner	3aeeca32af	Leverage the new blocking code in the tensor contraction code.	2016-01-22 16:36:30 -08:00
Benoit Steiner	4beb447e27	Created a mechanism to enable contraction mappers to determine the best blocking strategy.	2016-01-22 14:37:26 -08:00
Gael Guennebaud	6a44ccb58b	Backout changeset `690bc950f7`	2016-01-22 15:03:53 +01:00
Ville Kallioniemi	9b6c72958a	Update to latest default branch	2016-01-21 23:08:54 -07:00
Benoit Steiner	c33479324c	Fixed a constness bug	2016-01-21 17:08:11 -08:00
Jan Prach	690bc950f7	fix clang warnings "braces around scalar initializer"	2016-01-20 19:35:59 -08:00
Benoit Steiner	7ce932edd3	Small cleanup and small fix to the contraction of row major tensors	2016-01-20 18:12:08 -08:00
Benoit Steiner	47076bf00e	Reduce the register pressure exerted by the tensor mappers whenever possible. This improves the performance of the contraction of a matrix with a vector by about 35%.	2016-01-20 14:51:48 -08:00
Ville Kallioniemi	915e7667cd	Remove executable bit from header files	2016-01-19 21:17:29 -07:00
Ville Kallioniemi	2832175a68	Use explicitly 32 bit integer types in constructors.	2016-01-19 20:12:17 -07:00
Benoit Steiner	df79c00901	Improved the formatting of the code	2016-01-19 17:24:08 -08:00
Benoit Steiner	6d472d8375	Moved the contraction mapping code to its own file to make the code more manageable.	2016-01-19 17:22:05 -08:00
Benoit Steiner	b3b722905f	Improved code indentation	2016-01-19 17:09:47 -08:00
Benoit Steiner	5b7713dd33	Record whether the underlying tensor storage can be accessed directly during the evaluation of an expression.	2016-01-19 17:05:10 -08:00
Ville Kallioniemi	63fb66f53a	Add ctor for long	2016-01-17 21:25:36 -07:00
Benoit Steiner	34057cff23	Fixed a race condition that could affect some reductions on CUDA devices.	2016-01-15 15:11:56 -08:00
Benoit Steiner	0461f0153e	Made it possible to compare tensor dimensions inside a CUDA kernel.	2016-01-15 11:22:16 -08:00
Benoit Steiner	aed4cb1269	Use warp shuffles instead of shared memory access to speedup the inner reduction kernel.	2016-01-14 21:45:14 -08:00
Benoit Steiner	8fe2532e70	Fixed a boundary condition bug in the outer reduction kernel	2016-01-14 09:29:48 -08:00
Benoit Steiner	9f013a9d86	Properly record the rank of reduced tensors in the tensor traits.	2016-01-13 14:24:37 -08:00
Benoit Steiner	79b69b7444	Trigger the optimized matrix vector path more conservatively.	2016-01-12 15:21:09 -08:00
Benoit Steiner	d920d57f38	Improved the performance of the contraction of a 2d tensor with a 1d tensor by a factor of 3 or more. This helps speedup LSTM neural networks.	2016-01-12 11:32:27 -08:00
Benoit Steiner	bd7d901da9	Reverted a previous change that tripped nvcc when compiling in debug mode.	2016-01-11 17:49:44 -08:00
Benoit Steiner	c5e6900400	Silenced a few compilation warnings.	2016-01-11 17:06:39 -08:00
Benoit Steiner	f894736d61	Updated the tensor traits: the alignment is not part of the Flags enum anymore	2016-01-11 16:42:18 -08:00
Benoit Steiner	4f7714d72c	Enabled the use of fixed dimensions from within a cuda kernel.	2016-01-11 16:01:00 -08:00
Benoit Steiner	01c55d37e6	Deleted unused variable.	2016-01-11 15:53:19 -08:00
Benoit Steiner	0504c56ea7	Silenced a nvcc compilation warning	2016-01-11 15:49:21 -08:00
Benoit Steiner	b523771a24	Silenced several compilation warnings triggered by nvcc.	2016-01-11 14:25:43 -08:00
Benoit Steiner	2c3b13eded	Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152) Alternative way of forcing instantiation of device kernels without causing warnings or requiring device to device kernel invocations.	2016-01-11 11:43:37 -08:00
Benoit Steiner	2ccb1c8634	Fixed a bug in the dispatch of optimized reduction kernels.	2016-01-11 10:36:37 -08:00
Benoit Steiner	780623261e	Re-enabled the optimized reduction CUDA code.	2016-01-11 09:07:14 -08:00
Jeremy Barnes	91678f489a	Cleaned up double-defined macro from last commit	2016-01-10 22:44:45 -05:00
Jeremy Barnes	403a7cb6c3	Alternative way of forcing instantiation of device kernels without causing warnings or requiring device to device kernel invocations. This allows Tensorflow to work on SM 3.0 (ie, Amazon EC2) machines.	2016-01-10 22:39:13 -05:00
Benoit Steiner	e76904af1b	Simplified the dispatch code.	2016-01-08 16:50:57 -08:00
Benoit Steiner	d726e864ac	Made it possible to use array of size 0 on CUDA devices	2016-01-08 16:38:14 -08:00
Benoit Steiner	3358dfd5dd	Reworked the dispatch of optimized cuda reduction kernels to workaround a nvcc bug that prevented the code from compiling in optimized mode in some cases	2016-01-08 16:28:53 -08:00
Benoit Steiner	53749ff415	Prevent nvcc from miscompiling the cuda metakernel. Unfortunately this reintroduces some compulation warnings but it's much better than having to deal with random assertion failures.	2016-01-08 13:53:40 -08:00
Benoit Steiner	6639b7d6e8	Removed a couple of partial specialization that confuse nvcc and result in errors such as this: error: more than one partial specialization matches the template argument list of class "Eigen::internal::get<3, Eigen::internal::numeric_list<std::size_t, 1UL, 1UL, 1UL, 1UL>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, a, as...>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, as...>>"	2016-01-07 18:45:19 -08:00
Benoit Steiner	0cb2ca5de2	Fixed a typo.	2016-01-06 18:50:28 -08:00
Benoit Steiner	213459d818	Optimized the performance of broadcasting of scalars.	2016-01-06 18:47:45 -08:00
Benoit Steiner	cfff40b1d4	Improved the performance of reductions on CUDA devices	2016-01-04 17:25:00 -08:00
Benoit Steiner	515dee0baf	Added a 'divup' util to compute the floor of the quotient of two integers	2016-01-04 16:29:26 -08:00
Gael Guennebaud	8b0d1eb0f7	Fix numerous doxygen shortcomings, and workaround some clang -Wdocumentation warnings	2016-01-01 21:45:06 +01:00
Gael Guennebaud	978c379ed7	Add missing ctor from uint	2015-12-30 12:52:38 +01:00
Eugene Brevdo	f7362772e3	Add digamma for CPU + CUDA. Includes tests.	2015-12-24 21:15:38 -08:00
Benoit Steiner	bdcbc66a5c	Don't attempt to vectorize mean reductions of integers since we can't use SSE or AVX instructions to divide 2 integers.	2015-12-22 17:51:55 -08:00
Benoit Steiner	a1e08fb2a5	Optimized the configuration of the outer reduction cuda kernel	2015-12-22 16:30:10 -08:00
Benoit Steiner	9c7d96697b	Added missing define	2015-12-22 16:11:07 -08:00
Benoit Steiner	e7e6d01810	Made sure the optimized gpu reduction code is actually compiled.	2015-12-22 15:07:33 -08:00
Benoit Steiner	b5d2078c4a	Optimized outer reduction on GPUs.	2015-12-22 15:06:17 -08:00
Benoit Steiner	1c3e78319d	Added missing const	2015-12-21 15:05:01 -08:00
Benoit Steiner	1b82969559	Add alignment requirement for local buffer used by the slicing op.	2015-12-18 14:36:35 -08:00
Benoit Steiner	75a7fa1919	Doubled the speed of full reductions on GPUs.	2015-12-18 14:07:31 -08:00
Benoit Steiner	8dd17cbe80	Fixed a clang compilation warning triggered by the use of arrays of size 0.	2015-12-17 14:00:33 -08:00
Benoit Steiner	4aac55f684	Silenced some compilation warnings triggered by nvcc	2015-12-17 13:39:01 -08:00
Benoit Steiner	40e6250fc3	Made it possible to run tensor chipping operations on CUDA devices	2015-12-17 13:29:08 -08:00
Benoit Steiner	2ca55a3ae4	Fixed some compilation error triggered by the tensor code with msvc 2008	2015-12-16 20:45:58 -08:00
Gael Guennebaud	35d8725c73	Disable AutoDiffScalar generic copy ctor for non compatible scalar types (fix ambiguous template instantiation)	2015-12-16 10:14:24 +01:00
Christoph Hertzberg	92655e7215	bug #1136 : Protect isinf for Intel compilers. Also don't distinguish GCC from ICC and don't rely on EIGEN_NOT_A_MACRO, which might not be defined when including this.	2015-12-15 11:34:52 +01:00
Benoit Steiner	17352e2792	Made the entire TensorFixedSize api callable from a CUDA kernel.	2015-12-14 15:20:31 -08:00
Benoit Steiner	75e19fc7ca	Marked the tensor constructors as EIGEN_DEVICE_FUNC: This makes it possible to call them from a CUDA kernel.	2015-12-14 15:12:55 -08:00
Gael Guennebaud	ca39b1546e	Merged in ebrevdo/eigen (pull request PR-148) Add special functions to eigen: lgamma, erf, erfc.	2015-12-11 11:52:09 +01:00
Benoit Steiner	6af52a1227	Fixed a typo in the constructor of tensors of rank 5.	2015-12-10 23:31:12 -08:00
Benoit Steiner	2d8f2e4042	Made 2 tests compile without cxx11. HdG: --	2015-12-10 23:20:04 -08:00
Benoit Steiner	8d28a161b2	Use the proper accessor to refer to the value of a scalar tensor	2015-12-10 22:53:56 -08:00
Benoit Steiner	8e00ea9a92	Fixed the coefficient accessors use for the 2d and 3d case when compiling without cxx11 support.	2015-12-10 22:45:10 -08:00
Benoit Steiner	9db8316c93	Updated the cxx11_tensor_custom_op to not require cxx11.	2015-12-10 20:53:44 -08:00
Benoit Steiner	4e324ca6ae	Updated the cxx11_tensor_assign test to make it compile without support for cxx11	2015-12-10 20:47:25 -08:00
Eugene Brevdo	fa4f933c0f	Add special functions to Eigen: lgamma, erf, erfc. Includes CUDA support and unit tests.	2015-12-07 15:24:49 -08:00
Benoit Steiner	7dfe75f445	Fixed compilation warnings	2015-12-07 08:12:30 -08:00
Gael Guennebaud	ad3d68400e	Add matrix-free solver example	2015-12-07 12:33:38 +01:00
Gael Guennebaud	b37036afce	Implement wrapper for matrix-free iterative solvers	2015-12-07 12:23:22 +01:00
Benoit Steiner	f4ca8ad917	Use signed integers instead of unsigned ones more consistently in the codebase.	2015-12-04 18:14:16 -08:00
Benoit Steiner	490d26e4c1	Use integers instead of std::size_t to encode the number of dimensions in the Tensor class since most of the code currently already use integers.	2015-12-04 10:15:11 -08:00
Benoit Steiner	d20efc974d	Made it possible to use the sigmoid functor within a CUDA kernel.	2015-12-04 09:38:15 -08:00
Benoit Steiner	029052d276	Deleted redundant code	2015-12-03 17:08:47 -08:00
Gael Guennebaud	fd727249ad	Update ADOL-C support.	2015-11-30 16:00:22 +01:00
Gael Guennebaud	da46b1ed54	bug #1112 : fix compilation on exotic architectures	2015-11-27 15:57:18 +01:00
Mark Borgerding	7ddcf97da7	added scalar_sign_op (both real,complex)	2015-11-24 17:15:07 -05:00
Benoit Steiner	44848ac39b	Fixed a bug in TensorArgMax.h	2015-11-23 15:58:47 -08:00
Benoit Steiner	547a8608e5	Fixed the implementation of Eigen::internal::count_leading_zeros for MSVC. Also updated the code to silence bogux warnings generated by nvcc when compilining this function.	2015-11-23 12:17:45 -08:00
Benoit Steiner	562078780a	Don't create more cuda blocks than necessary	2015-11-23 11:00:10 -08:00
Benoit Steiner	df31ca3b9e	Made it possible to refer t oa GPUDevice from code compile with a regular C++ compiler	2015-11-23 10:03:53 -08:00
Benoit Steiner	1e04059012	Deleted unused variable.	2015-11-23 08:36:54 -08:00
Benoit Steiner	9fa65d3838	Split TensorDeviceType.h in 3 files to make it more manageable	2015-11-20 17:42:50 -08:00
Benoit Steiner	a367804856	Added option to force the usage of the Eigen array class instead of the std::array class.	2015-11-20 12:41:40 -08:00
Benoit Steiner	86486eee2d	Pulled latest updates from trunk	2015-11-20 11:10:37 -08:00
Benoit Steiner	383d1cc2ed	Added proper support for fast 64bit integer division on CUDA	2015-11-20 11:09:46 -08:00
Benoit Steiner	0ad7c7b1ad	Fixed another clang compilation warning	2015-11-19 15:52:51 -08:00
Benoit Steiner	66ff9b2c6c	Fixed compilation warning generated by clang	2015-11-19 15:40:32 -08:00
Benoit Steiner	f37a5f1c53	Fixed compilation error triggered by nvcc	2015-11-19 14:34:26 -08:00
Benoit Steiner	04f1284f9a	Shard the uint128 test	2015-11-19 14:08:08 -08:00
Benoit Steiner	e2859c6b71	Cleanup the integer division test	2015-11-19 14:07:50 -08:00
Benoit Steiner	f8df393165	Added support for 128bit integers on CUDA devices.	2015-11-19 13:57:27 -08:00
Benoit Steiner	1dd444ea71	Avoid using the version of TensorIntDiv optimized for 32-bit integers when the divisor can be equal to one since it isn't supported.	2015-11-18 11:37:58 -08:00
Benoit Steiner	f1fbd74db9	Added sanity check	2015-11-13 09:07:27 -08:00
Benoit Steiner	7815b84be4	Fixed a compilation warning	2015-11-12 20:16:59 -08:00
Benoit Steiner	10a91930cc	Fixed a compilation warning triggered by nvcc	2015-11-12 20:10:52 -08:00
Benoit Steiner	ed4b37de02	Fixed a few compilation warnings	2015-11-12 20:08:01 -08:00
Benoit Steiner	b69248fa2a	Added a couple of missing EIGEN_DEVICE_FUNC	2015-11-12 20:01:50 -08:00
Benoit Steiner	0aaa5941df	Silenced some compilation warnings triggered by nvcc	2015-11-12 19:11:43 -08:00
Benoit Steiner	2c73633b28	Fixed a few more typos	2015-11-12 18:39:19 -08:00
Benoit Steiner	be08e82953	Fixed typos	2015-11-12 18:37:40 -08:00
Benoit Steiner	150c12e138	Completed the IndexList rewrite	2015-11-12 18:11:56 -08:00
Benoit Steiner	8037826367	Simplified more of the IndexList code.	2015-11-12 17:19:45 -08:00
Benoit Steiner	e9ecfad796	Started to make the IndexList code compile by more compilers	2015-11-12 16:41:14 -08:00
Benoit Steiner	7a1316fcc5	Fixed compilation error with xcode.	2015-11-12 11:05:54 -08:00
Benoit Steiner	737d237722	Made it possible to run some of the CXXMeta functions on a CUDA device.	2015-11-12 09:02:59 -08:00
Benoit Steiner	1e072424e8	Moved the array code into it's own file.	2015-11-12 08:57:04 -08:00
Benoit Steiner	aa5f1ca714	gen_numeric_list takes a size_t, not a int	2015-11-12 08:30:10 -08:00
Benoit Steiner	9fa10fe52d	Don't use std::array when compiling with nvcc since nvidia doesn't support the use of STL containers on GPU.	2015-11-11 15:38:30 -08:00
Benoit Steiner	c587293e48	Fixed a compilation warning	2015-11-11 15:35:12 -08:00
Benoit Steiner	7f1c29fb0c	Make it possible for a vectorized tensor expression to be executed in a CUDA kernel.	2015-11-11 15:22:50 -08:00
Benoit Steiner	99f4778506	Disable SFINAE when compiling with nvcc	2015-11-11 15:04:58 -08:00
Benoit Steiner	5cb18e5b5e	Fixed CUDA compilation errors	2015-11-11 14:36:33 -08:00
Benoit Steiner	228edfe616	Use Eigen::NumTraits instead of std::numeric_limits	2015-11-11 09:26:23 -08:00
Benoit Steiner	20e2ab1121	Fixed another compilation warning	2015-12-07 16:17:57 -08:00
Benoit Steiner	d573efe303	Code cleanup	2015-11-06 14:54:28 -08:00
Benoit Steiner	9fa283339f	Silenced a compilation warning	2015-11-06 11:44:22 -08:00
Benoit Steiner	53432a17b2	Added static assertions to avoid misuses of padding, broadcasting and concatenation ops.	2015-11-06 10:26:19 -08:00
Benoit Steiner	6857a35a11	Fixed typos	2015-11-06 09:42:05 -08:00
Benoit Steiner	33cbdc2d15	Added more missing EIGEN_DEVICE_FUNC	2015-11-06 09:29:59 -08:00
Benoit Steiner	ed1962b464	Reimplement the tensor comparison operators by using the scalar_cmp_op functors. This makes them more cuda friendly.	2015-11-06 09:18:43 -08:00
Benoit Steiner	29038b982d	Added support for modulo operation	2015-11-05 19:39:48 -08:00
Benoit Steiner	fbcf8cc8c1	Pulled latest updates from trunk	2015-11-05 14:30:02 -08:00
Benoit Steiner	0d15ad8019	Updated the regressions tests that cover full reductions	2015-11-05 14:22:30 -08:00
Benoit Steiner	c75a19f815	Misc fixes to full reductions	2015-11-05 14:21:20 -08:00
Benoit Steiner	ec5a81b45a	Fixed a bug in the extraction of sizes of fixed sized tensors of rank 0	2015-11-05 13:39:48 -08:00
Gael Guennebaud	589b839ad0	Add unit test for Hessian via AutoDiffScalar	2015-11-05 14:54:05 +01:00
Gael Guennebaud	9ceaa8e445	bug #1063 : nest AutoDiffScalar by value to avoid dead references	2015-11-05 13:54:26 +01:00
Benoit Steiner	beedd9630d	Updated the reduction code so that full reductions now return a tensor of rank 0.	2015-11-04 13:57:36 -08:00
Benoit Steiner	6a02c2a85d	Fixed a compilation warning	2015-10-29 20:21:29 -07:00
Benoit Steiner	ca12d4c3b3	Pulled latest updates from trunk	2015-10-29 17:57:48 -07:00
Benoit Steiner	31bdafac67	Added a few tests to cover rank-0 tensors	2015-10-29 17:56:48 -07:00
Benoit Steiner	ce19e38c1f	Added support for tensor maps of rank 0.	2015-10-29 17:49:04 -07:00
Benoit Steiner	3785c69294	Added support for fixed sized tensors of rank 0	2015-10-29 17:31:03 -07:00
Benoit Steiner	0d7a23d34e	Extended the reduction code so that reducing an empty set returns the neural element for the operation	2015-10-29 17:29:49 -07:00
Benoit Steiner	1b0685d09a	Added support for rank-0 tensors	2015-10-29 17:27:38 -07:00
Benoit Steiner	c444a0a8c3	Consistently use the same index type in the fft codebase.	2015-10-29 16:39:47 -07:00
Benoit Steiner	09ea3a7acd	Silenced a few more compilation warnings	2015-10-29 16:22:52 -07:00
Benoit Steiner	0974a57910	Silenced compiler warning	2015-10-29 15:00:06 -07:00
Gael Guennebaud	77ff3386b7	Refactoring of the cost model: - Dynamic is now an invalid value - introduce a HugeCost constant to be used for runtime-cost values or arbitrarily huge cost - add sanity checks for cost values: must be >=0 and not too large This change provides several benefits: - it fixes shortcoming is some cost computation where the Dynamic case was not properly handled. - it simplifies cost computation logic, and should avoid future similar shortcomings. - it allows to distinguish between different level of dynamic/huge/infinite cost - it should enable further simplifications in the computation of costs (save compilation time)	2015-10-28 11:42:14 +01:00
Gael Guennebaud	d4cf436cb1	Enable mpreal unit test for C++11 compiler only	2015-10-27 17:35:54 +01:00
Benoit Steiner	1c8312c811	Started to add support for tensors of rank 0	2015-10-26 14:29:26 -07:00
Benoit Steiner	1f4c98abb1	Fixed compilation warning	2015-10-26 12:42:55 -07:00
Benoit Steiner	9dc236bc83	Fixed compilation warning	2015-10-26 12:41:48 -07:00
Benoit Steiner	9f721384e0	Added support for empty dimensions	2015-10-26 11:21:27 -07:00
Benoit Steiner	a3e144727c	Fixed compilation warning	2015-10-26 10:48:11 -07:00
Benoit Steiner	f8e7b9590d	Fixed compilation error triggered by gcc 4.7	2015-10-26 10:47:37 -07:00
Gael Guennebaud	a5324a131f	bug #1092 : fix iterative solver ctors for expressions as input	2015-10-26 16:16:24 +01:00
Gael Guennebaud	af2e25d482	Merged in infinitei/eigen (pull request PR-140) bug #1097 Added ArpackSupport to cmake install target	2015-10-26 15:31:39 +01:00
Abhijit Kundu	0ed41bdefa	ArpackSupport was missing here also.	2015-10-16 18:21:02 -07:00
Abhijit Kundu	1127ca8586	Added ArpackSupport to cmake install target	2015-10-16 16:41:33 -07:00
Benoit Steiner	de1e9f29f4	Updated the custom indexing code: we can now use any container that provides the [] operator to index a tensor. Added unit tests to validate the use of std::map and a few more types as valid custom index containers	2015-10-15 14:58:49 -07:00
Benoit Steiner	6585efc553	Tightened the definition of isOfNormalIndex to take into account integer types in addition to arrays of indices Only compile the custom index code when EIGEN_HAS_SFINAE is defined. For the time beeing, EIGEN_HAS_SFINAE is a synonym for EIGEN_HAS_VARIADIC_TEMPLATES, but this might evolve in the future. Moved some code around.	2015-10-14 09:31:37 -07:00
Gabriel Nützi	fc7478c04d	name changes 2 user: Gabriel Nützi <gnuetzi@gmx.ch> branch 'default' changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h changed unsupported/Eigen/CXX11/src/Tensor/TensorMeta.h	2015-10-09 19:10:08 +02:00
Gabriel Nützi	7b34834f64	name changes user: Gabriel Nützi <gnuetzi@gmx.ch> branch 'default' changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h	2015-10-09 19:08:14 +02:00
Gabriel Nützi	6edae2d30d	added CustomIndex capability only to Tensor and not yet to TensorBase. using Sfinae and is_base_of to select correct template which converts to array<Index,NumIndices> user: Gabriel Nützi <gnuetzi@gmx.ch> branch 'default' added unsupported/Eigen/CXX11/src/Tensor/TensorMetaMacros.h added unsupported/test/cxx11_tensor_customIndex.cpp changed unsupported/Eigen/CXX11/Tensor changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h changed unsupported/Eigen/CXX11/src/Tensor/TensorMeta.h changed unsupported/test/CMakeLists.txt	2015-10-09 18:52:48 +02:00
Gael Guennebaud	ac22b66f1c	Fix macro issues	2015-10-13 10:18:09 +02:00
Gael Guennebaud	3e32f6b554	update mpreal.h	2015-10-13 09:58:54 +02:00
Gael Guennebaud	186ec1437c	Cleanup EIGEN_SPARSE_PUBLIC_INTERFACE, it is now a simple alias to EIGEN_GENERIC_PUBLIC_INTERFACE	2015-10-08 22:06:49 +02:00
Gael Guennebaud	1b148d9e2e	Move IncompleteCholesky to official modules	2015-10-08 11:32:46 +02:00
Gael Guennebaud	632e7705b1	Improve doc of IncompleteCholesky	2015-10-08 10:54:36 +02:00
Gael Guennebaud	131db3c552	Fix return by value versus ref typo in IncompleteCholesky	2015-10-07 16:37:46 +02:00
Gael Guennebaud	752a0e5339	bug #1076 : fix scaling in IncompleteCholesky, improve doc, add read-only access to the different factors, remove debugging code.	2015-10-06 13:25:45 +02:00
Gael Guennebaud	ddb5650530	bug #1070 : propagate last three Matrix template arguments for NumTraits<AutoDiffScalar<>>::Real	2015-09-28 15:07:03 +02:00
Benoit Steiner	13aee4463e	Cleaned up a test	2015-09-18 09:42:08 -07:00
Benoit Steiner	58a6453d48	Fixed compilation warning	2015-09-17 10:18:49 -07:00
Benoit Steiner	31afdcb4c2	Fix return type for TensorEvaluator<TensorSlicingOp>::data	2015-09-17 09:40:21 -07:00
Benoit Steiner	84e0c27b61	Fixed a compilation warning	2015-09-08 17:05:35 -07:00
Benoit Steiner	05f2f94f2b	Fixed a compilation warning	2015-09-08 17:04:03 -07:00
Gael Guennebaud	3942db9d7c	Use inline versus static free functions.	2015-09-03 13:42:54 +02:00
Benoit Steiner	56983f6d43	Fixed compilation warning	2015-10-23 12:03:42 -07:00
Benoit Steiner	57857775b4	Added support for arrays of size 0	2015-10-23 10:20:51 -07:00
Benoit Steiner	c40c2ceb27	Reordered the code of fft constructor to prevent compilation warnings	2015-10-23 09:38:19 -07:00
Benoit Steiner	a586fdaa91	Reworked the tensor contraction mapper code to make it compile on Android	2015-10-23 09:33:41 -07:00
Benoit Steiner	29c3b7513e	Pulled latest updates from trunk	2015-10-23 09:16:14 -07:00
Benoit Steiner	9ea39ce13c	Refined the #ifdef __CUDACC__ guard to ensure that when trying to compile gpu code with a non cuda compiler results in a linking error instead of bogus code.	2015-10-23 09:15:34 -07:00
Gael Guennebaud	c244081490	disable usage of INTMAX_T	2015-10-23 14:48:54 +02:00
Gael Guennebaud	0905ed5390	remove useless cstdint header	2015-10-23 14:41:25 +02:00
Benoit Steiner	ac99b49249	Added missing glue logic	2015-10-22 16:54:21 -07:00
Benoit Steiner	2dd9446613	Added mapping between a specific device and the corresponding packet type	2015-10-22 16:53:36 -07:00
Benoit Steiner	2495e2479f	Added tests for the fft code	2015-10-22 16:52:55 -07:00
Benoit Steiner	a147c62998	Added support for fourier transforms (code courtesy of thucjw@gmail.com)	2015-10-22 16:51:30 -07:00
Benoit Steiner	825146c8fd	Fixed incorrect expected value	2015-10-22 11:56:00 -07:00
Benoit Steiner	4cf7da63de	Added a constructor to simplify the construction of tensormap from tensor	2015-10-22 11:48:02 -07:00
Benoit Steiner	b178cc3479	Added some syntactic sugar to make it simpler to compare a tensor to a scalar.	2015-10-21 11:28:28 -07:00
Benoit Steiner	0af63493fd	Disable SFINAE for versions of gcc older than 4.8	2015-10-20 11:53:30 -07:00
Benoit Steiner	73b8e719ae	Removed bogus assertion	2015-10-20 11:42:34 -07:00
Benoit Steiner	eaf4b98180	Added support for boolean reductions (ie 'and' & 'or' reductions)	2015-10-20 11:41:22 -07:00
Benoit Steiner	f5c1587e4e	Fixed a bug in the tensor conversion op	2015-10-20 11:37:44 -07:00
Christoph Hertzberg	5ad7981f73	Use full packet size for Dynamic-sized objects (otherwise, the unalignedcount unit test fails with AVX enabled)	2015-09-02 22:51:43 +02:00
Gael Guennebaud	51455824ea	Fix AlignedVector3 wrt previous change	2015-09-02 21:51:58 +02:00
Benoit Steiner	f41831e445	Added support for argmax/argmin	2015-08-31 08:18:53 -07:00
Benoit Steiner	2ab603316a	Use numext::mini/numext::maxi instead of std::min/std::max in the tensor code	2015-08-28 08:14:15 -07:00
Christoph Hertzberg	1bdd06a199	Fix some trivial warnings	2015-08-19 21:38:18 +02:00
Christoph Hertzberg	0721690dbb	Use standard include syntax in Tensor module (<> for include-path and "" for relative path)	2015-08-18 14:34:00 +02:00
Gael Guennebaud	dc2c103b3b	merge	2015-08-16 14:22:02 +02:00
Christoph Hertzberg	d6a4805fdf	Protect further isnan/isfinite/isinf calls	2015-08-16 14:00:02 +02:00
Gael Guennebaud	0d5e673baa	Fix Tensor module wrt nullary functor recent change	2015-08-09 21:20:24 +02:00
Gael Guennebaud	7e0d7a76b8	Remove dense nested loops in IncompleteCholesky	2015-08-04 18:01:38 +02:00
Gael Guennebaud	e31fc50280	Numerous fixes for IncompleteCholesky. Still have to make it fully exploit the sparse structure of the L factor, and improve robustness to illconditionned problems.	2015-08-04 16:16:02 +02:00
Gael Guennebaud	9a4713e505	Add a unit test for IncompleteCholesky	2015-08-04 16:14:06 +02:00
Benoit Steiner	cbce0e3b12	Fixed compilation warning	2015-08-03 21:52:29 -07:00
Benoit Steiner	a5dc49e7e8	Fixed 2 compilation warnings generated by llvm	2015-07-29 15:06:08 -07:00
Benoit Steiner	e1d28b7ea7	Added a test for shuffling	2015-07-29 15:01:21 -07:00
Benoit Steiner	0570594f2c	Fixed a few compilation warnings triggered by clang	2015-07-29 11:48:38 -07:00
Benoit Steiner	099597406f	Simplified and generalized the DividerTraits code	2015-07-29 10:02:42 -07:00
Gael Guennebaud	6db3a557f4	Add missing specialization of struct DividerTraits<long>	2015-07-29 11:38:53 +02:00
Gael Guennebaud	aec4814370	Many files were missing in previous changeset.	2015-07-29 11:11:23 +02:00
Benoit Steiner	b9db19aec4	Pulled latest updates from trunk.	2015-07-27 09:39:57 -07:00
Benoit Steiner	f84417d97b	Removed an incorrect assertion.	2015-07-27 09:25:22 -07:00
Godeffroy Valet	2195822df6	Allowed tensor contraction operation with an empty array of dimension pairs, which performs a tensor product.	2015-07-25 11:58:36 +02:00
Benoit Steiner	f6282e451a	Fixed a typo in an assertion.	2015-07-24 17:35:47 -07:00
Benoit Steiner	4200bdec24	Extended the range of value inputs for TensorIntDiv to support tensors with more than 4 billion elements.	2015-07-22 17:02:30 -07:00
Benoit Steiner	0dda72316f	The eigen_check macro doesn't exist anymore: use assert instead	2015-07-21 17:34:15 -07:00
Gael Guennebaud	d3e5db9a80	add regression unit test for previous changeset	2015-07-21 22:23:17 +02:00
Valentin Roussellet	5e635f9ca1	AlignedVector3 accepts implicit conversions from more operators.	2015-07-21 16:42:52 +00:00
Benoit Steiner	6799c26cd6	Fixed a typo in a test and a compilation warning	2015-07-17 16:50:47 -07:00
Benoit Steiner	7a39439904	Rewrote Eigen::dimensions_match to prevent a static assertion when the rank of the tensors is different.	2015-07-17 16:46:30 -07:00
Benoit Steiner	e94f9eb637	Fixed a const correctness issue in TensorLayoutSwap	2015-07-17 15:44:26 -07:00
Benoit Steiner	943035e5bd	Pulled latest updates from trunk	2015-07-17 09:42:45 -07:00
Benoit Steiner	06a22ca5bd	Added support for sigmoid function to the tensor module	2015-07-17 09:29:00 -07:00
Nicolas Mellado	3275eddc24	Add const getters for LM parameters	2015-07-17 09:11:49 +02:00
Benoit Steiner	a5ec25f11c	Use the new EIGEN_HAS_INDEX_LIST define to enable the cxx11_tensor_index_list tests	2015-07-16 13:16:08 -07:00
Benoit Steiner	7a243959b4	Define EIGEN_HAS_INDEX_LIST whenever the class is defined. This makes it easier to support compilers that are cxx11 compliant and compilers that aren't.	2015-07-16 13:14:18 -07:00
Benoit Steiner	b756f6af5e	Added missing APIs to the Eigen::Sizes class	2015-07-16 12:14:18 -07:00
Benoit Steiner	05787f8367	Added support for tensor inflation.	2015-07-16 09:04:05 -07:00
Benoit Steiner	b900fe47d5	Avoid relying on a default value for the Vectorizable template parameter of the EvalRange functor	2015-07-15 17:17:04 -07:00
Benoit Steiner	4b3d697e12	Fixed compilation error in a cuda test	2015-07-15 17:14:24 -07:00
Benoit Steiner	8315e025e1	Updated the cuda tests to use the new GpuDevice constructor	2015-07-15 12:39:26 -07:00
Benoit Steiner	e892524efe	Added support for multi gpu configuration to the GpuDevice class	2015-07-15 12:38:34 -07:00
Benoit Steiner	b80036abec	Enabled the construction of a fixed sized tensor directly from an expression.	2015-07-13 11:16:37 -07:00
Benoit Steiner	3912ca0d53	Fixed a bug in the integer division code that caused some large numerators to be incorrectly handled	2015-07-13 11:14:59 -07:00
Gael Guennebaud	b8df8815f4	Fix operator<<(ostream,AlignedVector3)	2015-07-13 13:55:59 +02:00
Benoit Steiner	e6297741c9	Added support for generation of random complex numbers on CUDA devices	2015-07-07 17:40:49 -07:00
Benoit Steiner	6de6fa9483	Use NumTraits<T>::RequireInitialization instead of internal::is_arithmetic<T>::value to check whether it's possible to bypass the type constructor in the tensor code.	2015-07-07 15:23:56 -07:00
Benoit Steiner	a93af65938	Improved and cleaned up the 2d patch extraction code	2015-07-07 08:52:14 -07:00
Benoit Steiner	3f2101b03b	Use numext::swap instead of std::swap	2015-07-06 17:02:29 -07:00
Benoit Steiner	0485a2468d	use Eigen smart_copy instead of std::copy	2015-07-06 17:01:51 -07:00
Benoit Steiner	ebdacfc5ea	Fixed a compilation warning generated by clang	2015-07-06 15:03:11 -07:00
Benoit Steiner	81f9e968fd	Only attempt to use the texture path on GPUs when it's supported by CUDA	2015-07-06 13:32:38 -07:00
Benoit Steiner	864318e508	Misc small fixes to the tensor slicing code.	2015-07-06 11:45:56 -07:00
Benoit Steiner	8f1d547c92	Added a default value for the cuda stream in the GpuDevice constructor	2015-07-01 18:32:18 -07:00
Benoit Steiner	1e911b276c	Misc improvements and optimizations	2015-07-01 13:59:11 -07:00
Benoit Steiner	4ed213f97b	Improved a previous fix	2015-07-01 13:06:30 -07:00
Benoit Steiner	56e155dd60	Fixed a couple of mistakes in the previous commit.	2015-07-01 12:40:27 -07:00
Benoit Steiner	925d0d375a	Enabled the vectorized evaluation of several tensor expressions that was previously disabled by mistake	2015-07-01 11:32:04 -07:00
Benoit Steiner	6021b68d8b	Silenced a compilation warning	2015-06-30 15:42:25 -07:00
Benoit Steiner	f1f480b116	Added support for user defined custom tensor op.	2015-06-30 15:36:29 -07:00
Benoit Steiner	dc31fcb9ba	Added support for 3D patch extraction	2015-06-30 14:48:26 -07:00
Benoit Steiner	f587075987	Made ThreadPoolDevice inherit from a new pure abstract ThreadPoolInterface class: this enables users to leverage their existing threadpool when using eigen tensors.	2015-06-30 14:21:24 -07:00
Benoit Steiner	28b36632ec	Turned Eigen::array::size into a function to make the code compatible with std::array	2015-06-30 13:23:05 -07:00
Benoit Steiner	109005c6c9	Added a test for multithreaded full reductions	2015-06-30 13:08:12 -07:00
Benoit Steiner	a4aa7c6217	Fixed a few compilation warnings	2015-06-30 10:36:17 -07:00
Benoit Steiner	7d41e97fa9	Silenced a number of compilation warnings	2015-06-29 14:47:40 -07:00
Benoit Steiner	fffe63045c	Added a test for full reductions on GPU	2015-06-29 14:10:32 -07:00
Benoit Steiner	db9dbbda32	Improved performance of full reduction by 2 order of magnitude on CPU and 3 orders of magnitude on GPU	2015-06-29 14:06:32 -07:00
Benoit Steiner	f0ce85b757	Improved support for fixed size tensors	2015-06-29 14:04:15 -07:00
Benoit Steiner	670c71d906	Express the full reduction operations (such as sum, max, min) using TensorDimensionList	2015-06-29 11:30:36 -07:00
Benoit Steiner	d8098ee7d5	Added support for tanh function to the tensor code	2015-06-29 11:14:42 -07:00
Benoit Steiner	3625734bc8	Moved some utilities to TensorMeta.h to make it easier to reuse them accross several tensor operations. Created the TensorDimensionList class to encode the list of all the dimensions of a tensor of rank n. This could be done using TensorIndexList, however TensorIndexList require cxx11 which isn't yet supported as widely as we'd like.	2015-06-29 10:49:55 -07:00
Gael Guennebaud	84aaef93ba	Merged in vanhoucke/eigen_vanhoucke (pull request PR-118) Fix two small undefined behaviors caught by static analysis.	2015-06-20 13:56:48 +02:00
Gael Guennebaud	846b227bb7	Get rid of class internal::nested<> (still have to updated Tensor module)	2015-06-19 17:56:39 +02:00
vanhoucke	4cc0c961f3	Fix undefined behavior.	2015-06-19 15:46:46 +00:00
Benoit Steiner	6a9a29e96f	Fixed a compilation warning	2015-06-17 10:14:13 -07:00
Benoit Steiner	ab5db86fe9	Fixed merge conflict	2015-06-16 19:52:20 -07:00
Benoit Steiner	ea160a898c	Pulled latest updates from trunk	2015-06-16 19:46:23 -07:00
Benoit Steiner	367794e668	Fixed compilation warnings triggered by clang	2015-06-16 19:43:49 -07:00
Gael Guennebaud	9ab8ac5c8b	Fix compilation in TensorImagePatch	2015-06-16 14:50:08 +02:00
Gael Guennebaud	38874b1651	Fix shadow warnings in Tensor module	2015-06-16 14:43:46 +02:00
Gael Guennebaud	e2e66930c6	Fix compilation of alignedvector3 unit test	2015-06-16 14:40:55 +02:00
Gael Guennebaud	64753af3b7	code simplification	2015-06-09 15:35:34 +02:00
Gael Guennebaud	cacbc5679d	formatting	2015-06-09 15:23:08 +02:00
Gael Guennebaud	04665ef9e1	remove redundant dynamic allocations in GMRES	2015-06-09 15:18:21 +02:00
Gael Guennebaud	d4c574707e	fix some legitimate shadow warnings	2015-06-09 15:17:58 +02:00
Gael Guennebaud	8bc26562f4	Do not abort if the folder cannot be openned!	2015-06-05 14:31:29 +02:00
Gael Guennebaud	3e7bc8d686	Improve loading of symmetric sparse matrices in MatrixMarketIterator	2015-06-05 10:16:29 +02:00
Benoit Steiner	ea1190486f	Fixed a compilation error triggered by nvcc 7	2015-05-28 11:57:51 -07:00
Benoit Steiner	0e5fed74e7	Worked around some constexpr related bugs in nvcc 7	2015-05-28 10:14:38 -07:00
Benoit Steiner	f13b3d4433	Added missing include files	2015-05-28 07:57:28 -07:00
Benoit Steiner	abec18bae0	Fixed potential compilation error	2015-05-26 10:11:15 -07:00
Benoit Steiner	9df186c140	Added a few more missing EIGEN_DEVICE_FUNC statements	2015-05-26 09:47:48 -07:00
Benoit Steiner	466bcc589e	Added a few missing EIGEN_DEVICE_FUNC statements	2015-05-26 09:37:23 -07:00
Benoit Steiner	6b800744ce	Moved away from std::async and std::future as the underlying mechnism for the thread pool device. On several platforms, the functions passed to std::async are not scheduled in the order in which they are given to std::async, which leads to massive performance issues in the contraction code. Instead we now have a custom thread pool that ensures that the functions are picked up by the threads in the pool in the order in which they are enqueued in the pool.	2015-05-20 13:52:07 -07:00
Benoit Steiner	48f6b274e2	Fixed compilation error triggered by gcc 4.7	2015-05-20 08:54:44 -07:00
Benoit Steiner	2451679951	Avoid using the cuda memcpy for small tensor slices since the memcpy kernel is very expensive to launch	2015-05-19 15:19:01 -07:00
Benoit Steiner	a81d17b73a	Added new version of the TensorIntDiv class optimized for 32 bit signed integers. It saves 1 register on CPU and 2 on GPU.	2015-05-19 13:59:52 -07:00
Christoph Hertzberg	8c6a3b5ace	Fix trivial warnings in LevenbergMarquardt module and test	2015-04-24 21:35:30 +02:00
Benoit Steiner	fd1d4bd86c	Silenced a few compilation warnings	2015-04-22 16:16:15 -07:00
Benoit Steiner	91359e1d0a	Added the ability to generate a tensor from a custom user defined 'generator'. This simplifies the creation of constant tensors initialized using specific regular patterns. Created a gaussian window generator as a first use case.	2015-04-22 11:14:58 -07:00
Benoit Steiner	8838ed39f4	Added support for non-deterministic random number generation on GPU	2015-04-22 09:14:38 -07:00
Benoit Steiner	dfa991cbae	Make sure that the copy constructor of the evaluator is always called before launching the evaluation of a tensor expression on a cuda device.	2015-04-21 16:15:45 -07:00
Benoit Steiner	e709488361	Silenced a few compilation warnings	2015-04-20 17:39:45 -07:00
Benoit Steiner	10a1f81822	Sped up the assignment of a tensor to a tensor slice, as well as the assigment of a constant slice to a tensor	2015-04-20 17:34:11 -07:00
Benoit Steiner	43eb2ca6e1	Improved the tensor random number generators: * Use a mersenne twister whenebver possible instead of the default entropy source since the default one isn't very good at all. * Added the ability to seed the generators with a time based seed to make them non-deterministic.	2015-04-20 09:24:09 -07:00
Benoit Steiner	70bc3b0668	Silenced a warning in the tensor code	2015-04-19 12:38:00 -07:00
Benoit Steiner	3220eb2b93	Fixed some compilation warnings	2015-04-19 12:36:35 -07:00
Benoit Steiner	3b429b71e6	Fixed compilation warning triggered by gcc 4.7	2015-04-18 13:41:06 -07:00
Benoit Steiner	9c6b82bcd5	Use ptrdiff_t instead of size_t to encode fixed sizes. This silences several clang compilation warnings (transplanted from 4400e4436ac7c5bbd305a03c21aa4bce24ae199b)	2015-04-17 09:12:18 -07:00
Benoit Steiner	da5b98a94d	Updated the cxx11_tensor_convolution test to avoid using cxx11 features. This should enable the test to compile with gcc 4.7 and older	2015-04-16 12:29:16 -07:00
Benoit Steiner	d19d09ae6a	Updated a regression test to avoid compilation errors when compiling with gcc 4.7	2015-04-16 12:15:27 -07:00
Benoit Steiner	0f82399fe9	Pulled latest changes from trunk	2015-04-14 19:13:34 -07:00
Benoit Steiner	1de49ef4c2	Fixed a bug when chipping tensors laid out in row major order.	2015-04-07 10:44:13 -07:00
Benoit Steiner	a1f1e1e51d	Fixed the order of 2 #includes	2015-04-06 10:41:39 -07:00
Benoit Steiner	7c18ab921c	Pulled latest updates from trunk	2015-04-04 20:07:04 -07:00
Gael Guennebaud	15b5adb327	Fix regression in DynamicSparseMatrix and SuperLUSupport wrt recent change on nonZeros/nonZerosEstimate	2015-04-02 22:21:41 +02:00
Benoit Steiner	74e558cfa8	Pulled latest updates from trunk	2015-04-01 23:24:11 -07:00
Benoit Steiner	03a0df2010	Fixed some compilation warnings triggered by pre-cxx11 comoilers	2015-04-01 22:51:33 -07:00
Benoit Steiner	b8b7807269	Fixed some compilation warning triggered by the cxx11 emulation code	2015-04-01 21:48:18 -07:00
Benoit Steiner	383b6dfafe	Fixed 2 typos	2015-04-01 16:44:36 -07:00
Benoit Steiner	678207e02a	Added regression tests for tensor convolutions	2015-03-31 09:08:08 -07:00
Benoit Steiner	68d4afe985	Added support for convolution of tensors laid out in RowMajor mode	2015-03-31 09:07:09 -07:00
Benoit Steiner	f873686602	Added documentation for the convolution operation	2015-03-31 08:27:23 -07:00
Benoit Steiner	731d7b84b4	Sharded a large test	2015-03-30 23:26:45 -07:00
Benoit Steiner	35722fa022	Made the index type a template parameter of the tensor class instead of encoding it in the options.	2015-03-30 14:55:54 -07:00
Benoit Steiner	71950f02e5	Deleted unnecessary semicolons	2015-03-30 14:49:10 -07:00
Benoit Steiner	4df8b5a75e	Avoid making an unecessary copy of the tensor expression when evaluating it on a GPU device	2015-03-25 14:36:07 -07:00
Benoit Steiner	b3343bfdae	Fixed the vectorized implementation of the Tensor select() method	2015-03-25 13:25:53 -07:00

... 5 6 7 8 9 ...

1788 Commits