eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	445c015751	extend monitoring benchmarks with transpose matrix-vector and triangular matrix-vectors.	2016-12-05 13:36:26 +01:00
Gael Guennebaud	e3f613cbd4	Improve performance of row-major-dense-matrix * vector products for recent CPUs. This revised version does not bother about aligned loads/stores, and rather processes 8 rows at ones for better instruction pipelining.	2016-12-05 13:02:01 +01:00
Gael Guennebaud	3abc827354	Clean debugging code	2016-12-05 12:59:32 +01:00
Benoit Steiner	462c28e77a	Merged in srvasude/eigen (pull request PR-265) Add Expm1 support to Eigen.	2016-12-05 02:31:11 +00:00
Gael Guennebaud	4465d20403	Add missing generic load methods.	2016-12-03 21:25:04 +01:00
Gael Guennebaud	6a5fe86098	Complete rewrite of column-major-matrix * vector product to deliver higher performance of modern CPU. The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive. This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA. According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast. Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching. We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore).	2016-12-03 21:14:14 +01:00
Benoit Steiner	2bfece5cd1	Merged eigen/eigen into default	2016-12-02 16:30:14 -08:00
Mehdi Goli	592acc5bfa	Makingt default numeric_list works with sycl.	2016-12-02 17:58:30 +00:00
Gael Guennebaud	8dfb3e00b8	merge	2016-12-02 11:34:21 +01:00
Gael Guennebaud	4c0d5f3c01	Add perf monitoring for gemv	2016-12-02 11:34:12 +01:00
Gael Guennebaud	d2718d662c	Re-enable A^T*A action in BTL	2016-12-02 11:32:03 +01:00
Christoph Hertzberg	22f7d398e2	bug #1355 : Fixed wrong line-endings on two files	2016-12-02 11:22:05 +01:00
Gael Guennebaud	27873008d4	Clean up SparseCore module regarding ReverseInnerIterator	2016-12-01 21:55:10 +01:00
Angelos Mantzaflaris	8c24723a09	typo UIntPtr (grafted from `b6f04a2dd4` )	2016-12-01 21:25:58 +01:00
Angelos Mantzaflaris	aeba0d8655	fix two warnings(unused typedef, unused variable) and a typo (grafted from `a9aa3bcf50` )	2016-12-01 21:23:43 +01:00
Gael Guennebaud	181138a1cb	fix member order	2016-12-01 17:06:20 +01:00
Gael Guennebaud	9f297d57ae	Merged in rmlarsen/eigen (pull request PR-256) Add a default constructor for the "fake" __half class when not using the __half class provided by CUDA.	2016-12-01 15:27:33 +00:00
Gael Guennebaud	f95e3b84a5	merge	2016-12-01 16:18:57 +01:00
Benoit Steiner	7ff26ddcbb	Merged eigen/eigen into default	2016-12-01 07:13:17 -08:00
Gael Guennebaud	037b46762d	Fix misleading-indentation warnings.	2016-12-01 16:05:42 +01:00
Mehdi Goli	79aa2b784e	Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.	2016-12-01 13:02:27 +00:00
Benoit Steiner	a70393fd02	Cleaned up forward declarations	2016-11-30 21:59:07 -08:00
Benoit Steiner	e073de96dc	Moved the MemCopyFunctor back to TensorSyclDevice since it's the only caller and it makes TensorFlow compile again	2016-11-30 21:36:52 -08:00
Benoit Steiner	fca27350eb	Added the deallocate_all() method back	2016-11-30 20:45:20 -08:00
Benoit Steiner	e633a8371f	Simplified includes	2016-11-30 20:21:18 -08:00
Benoit Steiner	7cd33df4ce	Improved formatting	2016-11-30 20:20:44 -08:00
Benoit Steiner	fd1dc3363e	Merged eigen/eigen into default	2016-11-30 20:16:17 -08:00
Benoit Steiner	f5107010ee	Udated the Sizes class to work on AMD gpus without requiring a separate implementation	2016-11-30 19:57:28 -08:00
Benoit Steiner	e37c2c52d3	Added an implementation of numeric_list that works with sycl	2016-11-30 19:55:15 -08:00
Gael Guennebaud	8df272af88	Fix slection of product implementation for dynamic size matrices with fixed max size.	2016-11-30 22:21:33 +01:00
Benoit Steiner	faa2ff99c6	Pulled latest update from trunk	2016-11-30 09:31:24 -08:00
Benoit Steiner	df3da0780d	Updated customIndices2Array to handle various index sizes.	2016-11-30 09:30:12 -08:00
Gael Guennebaud	c927af60ed	Fix a performance regression in (matmat)vec for which mat*mat was evaluated multiple times.	2016-11-30 17:59:13 +01:00
Luke Iwanski	26fff1c5b1	Added EIGEN_STRONG_INLINE to get_sycl_supported_device().	2016-11-30 16:55:22 +00:00
Gael Guennebaud	ab4ef5e66e	bug #1351 : fix compilation of random with old compilers	2016-11-30 17:37:53 +01:00
Sergiu Deitsch	5e3c5c42f6	cmake: remove architecture dependency from Eigen3ConfigVersion.cmake Also, install Eigen3*.cmake under $prefix/share/eigen3/cmake by default. (grafted from `86ab00cdcf` )	2016-11-30 15:46:46 +01:00
Sergiu Deitsch	3440b46e2f	doc: mention the NO_MODULE option and target availability (grafted from `65f09be8d2` )	2016-11-30 15:41:38 +01:00
Rasmus Munk Larsen	a0329f64fb	Add a default constructor for the "fake" __half class when not using the __half class provided by CUDA.	2016-11-29 13:18:09 -08:00
Mehdi Goli	577ce78085	Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.	2016-11-29 15:30:42 +00:00
Benoit Steiner	3011dc94ef	Call internal::array_prod to compute the total size of the tensor.	2016-11-28 09:00:31 -08:00
Benoit Steiner	02080e2b67	Merged eigen/eigen into default	2016-11-27 07:27:30 -08:00
Benoit Steiner	9fd081cddc	Fixed compilation warnings	2016-11-26 20:22:25 -08:00
Benoit Steiner	9f8fbd9434	Merged eigen/eigen into default	2016-11-26 11:28:25 -08:00
Benoit Steiner	67b2c41f30	Avoided unnecessary type conversion	2016-11-26 11:27:29 -08:00
Benoit Steiner	7fe704596a	Added missing array_get method for numeric_list	2016-11-26 11:26:07 -08:00
Mehdi Goli	7318daf887	Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.	2016-11-25 16:19:07 +00:00
Benoit Steiner	7ad37606dd	Fixed the documentation of Scalar Tensors	2016-11-24 12:31:43 -08:00
Benoit Steiner	3be1afca11	Disabled the "remove the call to 'std::abs' since unsigned values cannot be negative" warning introduced in clang 3.5	2016-11-23 18:49:51 -08:00
Gael Guennebaud	308961c05e	Fix compilation.	2016-11-23 22:17:52 +01:00
Gael Guennebaud	21d0286d81	bug #1348 : Document EIGEN_MAX_ALIGN_BYTES and EIGEN_MAX_STATIC_ALIGN_BYTES, and reflect in the doc that EIGEN_DONT_ALIGN* are deprecated.	2016-11-23 22:15:03 +01:00

... 3 4 5 6 7 ...

9124 Commits