eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	512ba0ac76	Add regression unit tests for half-packet vectorization	2016-04-13 18:16:35 +02:00
Gael Guennebaud	06447e0a39	Improve half-packet vectorization logic to distinguish linear versus inner traversal modes.	2016-04-13 18:15:49 +02:00
Gael Guennebaud	bbb8854bf7	Enable half-packet in reduxions.	2016-04-13 13:02:34 +02:00
Benoit Steiner	e9b12cc1f7	Fixed compilation warnings generated by clang	2016-04-12 20:53:18 -07:00
Benoit Steiner	eaeb6ca93a	Enable the benchmarks for algebraic and transcendental fnctions on fp16.	2016-04-12 16:29:00 -07:00
Benoit Steiner	aa1ba8bbd2	Don't put a command at the end of an enumerator list	2016-04-12 16:28:11 -07:00
Benoit Steiner	e49945ced4	Pulled latest update from trunk	2016-04-12 14:13:41 -07:00
Benoit Steiner	25d05c4b8f	Fixed the vectorization logic test	2016-04-12 14:13:25 -07:00
Benoit Steiner	53121c0119	Turned on the contraction benchmarks for fp16	2016-04-12 14:11:52 -07:00
Gael Guennebaud	b67c983291	Enable the use of half-packet in coeff-based product. For instance, Matrix4f*Vector4f is now vectorized again when using AVX.	2016-04-12 23:03:03 +02:00
Benoit Steiner	e3a184785c	Fixed the zeta test	2016-04-12 11:12:36 -07:00
Benoit Steiner	3b76df64fc	Defer the decision to vectorize tensor CUDA code to the meta kernel. This makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3	2016-04-12 10:58:51 -07:00
Benoit Steiner	748c4c4599	More accurate cost estimates for exp, log, tanh, and sqrt.	2016-04-11 13:11:04 -07:00
Benoit Steiner	833efb39bf	Added epsilon, dummy_precision, infinity and quiet_NaN NumTraits for fp16	2016-04-11 11:03:56 -07:00
Benoit Steiner	e939b087fe	Pulled latest update from trunk	2016-04-11 11:03:02 -07:00
Gael Guennebaud	1744b5b5d2	Update doc regarding the genericity of EIGEN_USE_BLAS	2016-04-11 17:16:07 +02:00
Gael Guennebaud	91bf925fc1	Improve constness of level2 blas API.	2016-04-11 17:13:01 +02:00
Gael Guennebaud	0483430283	Move LAPACK declarations from blas.h to lapack.h and fix compatibility with EIGEN_USE_MKL	2016-04-11 17:12:31 +02:00
Gael Guennebaud	097d1e8823	Cleanup obsolete assign_scalar_eig2mkl helper.	2016-04-11 16:09:29 +02:00
Gael Guennebaud	fec4c334ba	Remove all references to MKL in BLAS wrappers.	2016-04-11 16:04:09 +02:00
Gael Guennebaud	ddabc992fa	Fix long to int conversion in BLAS API.	2016-04-11 15:52:01 +02:00
Gael Guennebaud	8191f373be	Silent unused warning.	2016-04-11 15:37:16 +02:00
Gael Guennebaud	6a9ca88e7e	Relax dependency on MKL for EIGEN_USE_BLAS	2016-04-11 15:17:14 +02:00
Gael Guennebaud	4e8e5888d7	Improve constness of blas level-3 interface.	2016-04-11 15:12:44 +02:00
Gael Guennebaud	675e0a2224	Fix static/inline keywords order.	2016-04-11 15:06:20 +02:00
Gael Guennebaud	fc6a0ebb1c	Typos in doc.	2016-04-11 10:54:58 +02:00
Till Hoffmann	643b697649	Proper handling of domain errors.	2016-04-10 00:37:53 +01:00
Till Hoffmann	7f4826890c	Merge upstream	2016-04-09 20:08:07 +01:00
Till Hoffmann	de057ebe54	Added nans to zeta function.	2016-04-09 20:07:36 +01:00
Gael Guennebaud	af2161cdb4	bug #1197 : fix/relax some LM unit tests	2016-04-09 11:14:02 +02:00
Gael Guennebaud	a05a683d83	bug #1160 : fix and relax some lm unit tests by turning faillures to warnings	2016-04-09 10:49:19 +02:00
Benoit Steiner	5da90fc8dd	Use numext::abs instead of std::abs in scalar_fuzzy_default_impl to make it usable inside GPU kernels.	2016-04-08 19:40:48 -07:00
Benoit Steiner	01bd577288	Fixed the implementation of Eigen::numext::isfinite, Eigen::numext::isnan, andEigen::numext::isinf on CUDA devices	2016-04-08 16:40:10 -07:00
Benoit Steiner	89a3dc35a3	Fixed isfinite_impl: NumTraits<T>::highest() and NumTraits<T>::lowest() are finite numbers.	2016-04-08 15:56:16 -07:00
Benoit Steiner	995f202cea	Disabled the use of half2 on cuda devices of compute capability < 5.3	2016-04-08 14:43:36 -07:00
Benoit Steiner	8d22967bd9	Initial support for taking the power of fp16	2016-04-08 14:22:39 -07:00
Benoit Steiner	3394379319	Fixed the packet_traits for half floats.	2016-04-08 13:33:59 -07:00
Benoit Steiner	0d2a532fc3	Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using clang instead of nvcc	2016-04-08 13:16:08 -07:00
Benoit Steiner	2d072b38c1	Don't test the division by 0 on float16 when compiling with msvc since msvc detects and errors out on divisions by 0.	2016-04-08 12:50:25 -07:00
Benoit Jacob	cd2b667ac8	Add references to filed LLVM bugs	2016-04-08 08:12:47 -04:00
Benoit Steiner	3bd16457e1	Properly handle complex numbers.	2016-04-07 23:28:04 -07:00
Benoit Steiner	63102ee43d	Turn on the coeffWise benchmarks on fp16	2016-04-07 23:05:20 -07:00
Benoit Steiner	7c47d3e663	Fixed the type casting benchmarks for fp16	2016-04-07 22:50:25 -07:00
Benoit Steiner	2f2801f096	Merged in parthaEth/eigen (pull request PR-175) Static casting scalar types so as to let chlesky module of eigen work with ceres	2016-04-07 22:10:14 -07:00
Benoit Steiner	d962fe6a99	Renamed float16 into cxx11_float16 since the test relies on c++11 features	2016-04-07 20:28:32 -07:00
Benoit Steiner	7d5b17087f	Added missing EIGEN_DEVICE_FUNC to the tensor conversion code.	2016-04-07 20:01:19 -07:00
Benoit Steiner	a6d08be9b2	Fixed the benchmarking of fp16 coefficient wise operations	2016-04-07 17:13:44 -07:00
parthaEth	2d5bb375b7	Static casting scalar types so as to let chlesky module of eigen work with ceres	2016-04-08 00:14:44 +02:00
Benoit Steiner	a02ec09511	Worked around numerical noise in the test for the zeta function.	2016-04-07 12:11:02 -07:00
Benoit Steiner	c912b1d28c	Fixed a typo in the polygamma test.	2016-04-07 11:51:07 -07:00

1 2 3 4 5 ...

7737 Commits