eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Benoit Steiner	c80587c92b	Merged eigen/eigen into default	2016-11-03 03:55:11 -07:00
Gael Guennebaud	a07bb428df	bug #1004 : improve accuracy of LinSpaced for abs(low) >> abs(high).	2016-11-02 11:34:38 +01:00
Gael Guennebaud	598de8b193	Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX.	2016-11-02 10:38:13 +01:00
Gael Guennebaud	3ecb343dc3	Fix regression in X = (X*X.transpose())/s with X rectangular by deferring resizing of the destination after the creation of the evaluator of the source expression.	2016-10-26 22:50:41 +02:00
Gael Guennebaud	58146be99b	bug #1004 : one more rewrite of LinSpaced for floating point numbers to guarantee both interpolation and monotonicity. This version simply does low+i*step plus a branch to return high if i==size-1. Vectorization is accomplished with a branch and the help of pinsertlast. Some quick benchmark revealed that the overhead is really marginal, even when filling small vectors.	2016-10-25 16:53:09 +02:00
Gael Guennebaud	13fc18d3a2	Add a pinsertlast function replacing the last entry of a packet by a scalar. (useful to vectorize LinSpaced)	2016-10-25 16:48:49 +02:00
Gael Guennebaud	b027d7a8cf	bug #1004 : remove the inaccurate "sequential" path for LinSpaced, mark respective function as deprecated, and enforce strict interpolation of the higher range using a correction term. Now, even with floating point precision, both the 'low' and 'high' bounds are exactly reproduced at i=0 and i=size-1 respectively.	2016-10-24 20:27:21 +02:00
Gael Guennebaud	53c77061f0	bug #698 : rewrite LinSpaced for integer scalar types to avoid overflow and guarantee an even spacing when possible. Otherwise, the "high" bound is implicitly lowered to the largest value allowing for an even distribution. This changeset also disable vectorization for this integer path.	2016-10-24 15:50:27 +02:00
Gael Guennebaud	e8e56c7642	Add unit test for overflow in LinSpaced	2016-10-24 15:43:51 +02:00
Benoit Steiner	78d2926508	Merged eigen/eigen into default	2016-10-12 13:46:29 -07:00
Gael Guennebaud	f939c351cb	Fix SPQR for rectangular matrices	2016-10-12 22:39:33 +02:00
Gael Guennebaud	5c366fe1d7	Merged in rmlarsen/eigen (pull request PR-230) Fix a bug in psqrt for SSE and AVX when EIGEN_FAST_MATH=1	2016-10-12 16:30:51 +00:00
Gael Guennebaud	4860727ac2	Remove static qualifier of free-functions (inline is enough and this helps ICC to find the right overload)	2016-10-07 09:21:12 +02:00
Benoit Steiner	507b661106	Renamed predux_half into predux_downto4	2016-10-06 17:57:04 -07:00
Gael Guennebaud	80b5133789	Fix compilation of qr.inverse() for column and full pivoting variants.	2016-10-06 09:55:50 +02:00
Benoit Steiner	78b569f685	Merged latest updates from trunk	2016-10-05 18:48:55 -07:00
Rasmus Munk Larsen	3ed67cb0bb	Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (enabled by EIGEN_FAST_MATH), which causes the vectorized parts of the computation to return -0.0 instead of NaN for negative arguments. Benchmark speed in Giga-sqrts/s Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz ----------------------------------------- SSE AVX Fast=1 2.529G 4.380G Fast=0 1.944G 1.898G Fast=1 fixed 2.214G 3.739G This table illustrates the worst case in terms speed impact: It was measured by repeatedly computing the sqrt of an n=4096 float vector that fits in L1 cache. For large vectors the operation becomes memory bound and the differences between the different versions almost negligible.	2016-10-04 14:22:56 -07:00
Benoit Steiner	616a7a1912	Improved support for compiling CUDA code with clang as the host compiler	2016-10-03 17:09:33 -07:00
Gael Guennebaud	8b84801f7f	bug #1310 : workaround a compilation regression from 3.2 regarding triangular * homogeneous	2016-09-30 22:49:59 +02:00
Gael Guennebaud	33500050c3	bug #1308 : fix compilation of some small products involving nullary-expressions.	2016-09-29 09:40:44 +02:00
Gael Guennebaud	779774f98c	bug #1311 : fix alignment logic in some cases of (scalar*small).lazyProduct(small)	2016-09-26 23:53:40 +02:00
Gael Guennebaud	48dfe98abd	bug #1308 : fix compilation of vector * rowvector::nullary.	2016-09-25 14:54:35 +02:00
Gael Guennebaud	86caba838d	bug #1304 : fix Projective * scaling and Projective *= scaling	2016-09-23 13:41:21 +02:00
Gael Guennebaud	66cbabafed	Add a note regarding gcc bug #72867	2016-09-22 11:18:52 +02:00
Gael Guennebaud	aecc51a3e8	fix typo	2016-09-21 21:53:00 +02:00
Gael Guennebaud	1fc3a21ed0	Disable a failure test if extended double precision is in use (x87)	2016-09-21 20:09:07 +02:00
Gael Guennebaud	5269d11935	Fix compilation if ICC.	2016-09-21 17:08:51 +02:00
Gael Guennebaud	bf03820339	Silent warning.	2016-09-17 14:14:01 +02:00
Gael Guennebaud	de05a18fe0	fix compilation with boost::multiprec	2016-09-17 14:13:48 +02:00
Gael Guennebaud	4cc2c73e6a	Fix alignement of statically allocated temporaries in gemv.	2016-09-17 12:52:27 +02:00
Gael Guennebaud	4adeababf9	Fix undeflow	2016-09-16 11:46:46 +02:00
Gael Guennebaud	471eac5399	bug #1195 : move NumTraits::Div<>::Cost to internal::scalar_div_cost (with some specializations in arch/SSE and arch/AVX)	2016-09-08 08:36:27 +02:00
Gael Guennebaud	b046a3f87d	Workaround MSVC instantiation faillure of has_ary_operator at the level of triats<Ref>::match so that the has_ary_operator are really properly instantiated throughout the compilation unit.	2016-09-06 15:47:04 +02:00
Gael Guennebaud	3cb914f332	bug #1266 : remove CUDA guards on MatrixBase::<decomposition> definitions. (those used to break old nvcc versions that we propably don't care anymore)	2016-09-06 09:55:50 +02:00
Gael Guennebaud	dabc81751f	Fix compilation when cuda_fp16.h does not exist.	2016-09-05 17:14:20 +02:00
Gael Guennebaud	e13071dd13	Workaround a weird msvc 2012 compilation error.	2016-09-05 15:50:41 +02:00
Gael Guennebaud	218c37beb4	bug #1286 : automatically detect the available prototypes of functors passed to CwiseNullaryExpr such that functors have only to implement the operators that matters among: operator()() operator()(i) operator()(i,j) Linear access is also automatically detected based on the availability of operator()(i,j).	2016-08-31 15:45:25 +02:00
Gael Guennebaud	efe2c225c9	bug #1283 : add regression unit test	2016-08-31 13:04:29 +02:00
Gael Guennebaud	8c48d42530	Fix 4x4 inverse with non-linear destination	2016-08-30 23:16:38 +02:00
Gael Guennebaud	c57317035a	Fix unit test for 1x1 matrices	2016-08-30 10:20:23 +02:00
Gael Guennebaud	7e029d1d6e	bug #1271 : add SparseMatrix::coeffs() methods returning a 1D view of the non zero coefficients.	2016-08-29 12:06:37 +02:00
Gael Guennebaud	a93e354d92	Add some pre-allocation unit tests (not working yet)	2016-08-29 11:08:44 +02:00
Gael Guennebaud	6cd7b9ea6b	Fix compilation with cuda 8	2016-08-29 11:06:08 +02:00
Gael Guennebaud	441b7eaab2	Add support for non trivial scalar factor in sparse selfadjoint * dense products, and enable +=/-= assignement for such products. This changeset also improves the performance by working on column of the result at once.	2016-08-24 13:06:34 +02:00
Gael Guennebaud	8132a12625	bug #1268 : detect faillure in LDLT and report them through info()	2016-08-23 23:15:55 +02:00
Gael Guennebaud	326320ec7b	Fix compilation in non C++11 mode.	2016-08-23 19:28:57 +02:00
Gael Guennebaud	00b2666853	bug #645 : patch from Tobias Wood implementing the extraction of eigenvectors in GeneralizedEigenSolver	2016-08-23 17:37:38 +02:00
Gael Guennebaud	504a4404f1	Optimize expression matching "d?=a-bc" as "d?=a; d?=bc;"	2016-08-23 16:52:22 +02:00
Gael Guennebaud	e47a8928ec	Fix compilation in check_for_aliasing due to ambiguous specializations	2016-08-23 16:19:10 +02:00
Gael Guennebaud	82147cefff	Fix possible overflow and biais in integer random generator	2016-08-23 13:25:31 +02:00

1 2 3 4 5 ...

1935 Commits