eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Rasmus Munk Larsen	8450a6d519	Clean up half packet traits and add a few more missing packet ops.	2019-03-14 15:18:06 -07:00
David Tellenbach	b013176e52	Remove undefined std::complex<int>	2019-03-14 11:40:28 +01:00
David Tellenbach	97f9a46cb9	PR 593: Add variadtic ctor for DiagonalMatrix with unit tests	2019-03-14 10:18:24 +01:00
Gael Guennebaud	45ab514fe2	revert debug stuff	2019-03-14 10:08:12 +01:00
Rasmus Munk Larsen	6a34003141	Remove EIGEN_MPL2_ONLY guard in IncompleteCholesky that is no longer needed after the AMD reordering code was relicensed to MPL2.	2019-03-13 11:52:41 -07:00
Gael Guennebaud	d7d2f0680e	bug #1684 : partially workaround clang's 6/7 bug #40815	2019-03-13 10:40:01 +01:00
Rasmus Larsen	690f0795d0	Merged in rmlarsen/eigen (pull request PR-615) Clean up PacketMathHalf.h and add a few missing logical packet ops.	2019-03-12 16:09:48 +00:00
Thomas Capricelli	1901433674	erm.. use proper id	2019-03-12 13:53:38 +01:00
Thomas Capricelli	90302aa8c9	update tracking code	2019-03-12 13:47:01 +01:00
Rasmus Munk Larsen	77f7d4a894	Clean up PacketMathHalf.h and add a few missing logical packet ops.	2019-03-11 17:51:16 -07:00
Eugene Zhulenev	001f10e3c9	Fix segfaults with cuda compilation	2019-03-11 09:43:33 -07:00
Eugene Zhulenev	899c16fa2c	Fix a bug in TensorGenerator for 1d tensors	2019-03-11 09:42:01 -07:00
Eugene Zhulenev	0f8bfff23d	Fix a data race in NonBlockingThreadPool	2019-03-11 09:38:44 -07:00
Gael Guennebaud	656d9bc66b	Apply SSE's pmin/pmax fix for GCC <= 5 to AVX's pmin/pmax	2019-03-10 21:19:18 +01:00
Gael Guennebaud	2df4f00246	Change license from LGPL to MPL2 with agreement from David Harmon.	2019-03-07 18:17:10 +01:00
Rasmus Munk Larsen	3c3f639fe2	Merge.	2019-03-06 11:54:30 -08:00
Rasmus Munk Larsen	f4ec8edea8	Add macro EIGEN_AVOID_THREAD_LOCAL to make it possible to manually disable the use of thread_local.	2019-03-06 11:52:04 -08:00
Rasmus Munk Larsen	41cdc370d0	Fix placement of "#if defined(EIGEN_GPUCC)" guard region. Found with -Wundefined-func-template. Author: tkoeppe@google.com	2019-03-06 11:42:22 -08:00
Rasmus Munk Larsen	cc407c9d4d	Fix placement of "#if defined(EIGEN_GPUCC)" guard region. Found with -Wundefined-func-template. Author: tkoeppe@google.com	2019-03-06 11:40:06 -08:00
Eugene Zhulenev	1bc2a0a57c	Add missing return to NonBlockingThreadPool::LocalSteal	2019-03-06 10:49:49 -08:00
Eugene Zhulenev	4e4dcd9026	Remove redundant steal loop	2019-03-06 10:39:07 -08:00
Rasmus Larsen	4d808e834a	Merged in rmlarsen/eigen_threadpool (pull request PR-606) Remove EIGEN_MPL2_ONLY guards around code re-licensed from LGPL to MPL2 in `2ca1e73239` Approved-by: Sameer Agarwal <sameeragarwal@google.com>	2019-03-06 17:59:03 +00:00
Rasmus Larsen	2ea18e505f	Merged in ezhulenev/eigen-01 (pull request PR-610) Block evaluation for TensorGeneratorOp	2019-03-06 16:49:38 +00:00
Eugene Zhulenev	25abaa2e41	Check that inner block dimension is continuous	2019-03-05 17:34:35 -08:00
Eugene Zhulenev	5d9a6686ed	Block evaluation for TensorGeneratorOp	2019-03-05 16:35:21 -08:00
Rasmus Larsen	b4861f4778	Merged in ezhulenev/eigen-01 (pull request PR-609) Tune tensor contraction threadpool heuristics	2019-03-05 23:54:40 +00:00
Gael Guennebaud	bfbf7da047	bug #1689 fix used-but-marked-unused warning	2019-03-05 23:46:24 +01:00
Eugene Zhulenev	a407e022e6	Tune tensor contraction threadpool heuristics	2019-03-05 14:19:59 -08:00
Eugene Zhulenev	56c6373f82	Add an extra check for the RunQueue size estimate	2019-03-05 11:51:26 -08:00
Eugene Zhulenev	b1a8627493	Do not create Tensor<const T> in cxx11_tensor_forced_eval test	2019-03-05 11:19:25 -08:00
Rasmus Munk Larsen	0318fc7f44	Remove EIGEN_MPL2_ONLY guards around code re-licensed from LGPL to MPL2 in `2ca1e73239`	2019-03-05 10:24:54 -08:00
Eugene Zhulenev	efb5080d31	Do not initialize invalid fast_strides in TensorGeneratorOp	2019-03-04 16:58:49 -08:00
Eugene Zhulenev	b95941e5c2	Add tiled evaluation for TensorForcedEvalOp	2019-03-04 16:02:22 -08:00
Eugene Zhulenev	694084ecbd	Use fast divisors in TensorGeneratorOp	2019-03-04 11:10:21 -08:00
Gael Guennebaud	b0d406d91c	Enable construction of Ref<VectorType> from a runtime vector.	2019-03-03 15:25:25 +01:00
Sam Hasinoff	9ba81cf0ff	Fully qualify Eigen::internal::aligned_free This helps avoids a conflict on certain Windows toolchains (potentially due to some ADL name resolution bug) in the case where aligned_free is defined in the global namespace. In any case, tightening this up is harmless.	2019-03-02 17:42:16 +00:00
Gael Guennebaud	22144e949d	bug #1629 : fix compilation of PardisoSupport (regression introduced in changeset `a7842daef2` )	2019-03-02 22:44:47 +01:00
Bernhard M. Wiedemann	b071672e78	Do not keep latex logs to make package builds more reproducible. See https://reproducible-builds.org/ for why this is good.	2019-02-27 11:09:00 +01:00
Rasmus Munk Larsen	cf4a1c81fa	Fix specialization for conjugate on non-complex types in TensorBase.h.	2019-03-01 14:21:09 -08:00
Sameer Agarwal	c181dfb8ab	Consistently use EIGEN_BLAS_FUNC in BLAS. Previously, for a few functions, eithe BLASFUNC or, EIGEN_CAT was being used. This change uses EIGEN_BLAS_FUNC consistently everywhere. Also introduce EIGEN_BLAS_FUNC_SUFFIX, which by default is equal to "_", this allows the user to inject a new suffix as needed.	2019-02-27 11:30:58 -08:00
Rasmus Larsen	9558f4c25f	Merged in rmlarsen/eigen_threadpool (pull request PR-596) Improve EventCount used by the non-blocking threadpool. Approved-by: Gael Guennebaud <g.gael@free.fr>	2019-02-26 20:37:26 +00:00
Rasmus Larsen	2ca1e73239	Merged in rmlarsen/eigen (pull request PR-597) Change licensing of OrderingMethods/Amd.h and SparseCholesky/SimplicialCholesky_impl.h from LGPL to MPL2. Approved-by: Gael Guennebaud <g.gael@free.fr>	2019-02-25 17:02:16 +00:00
Gael Guennebaud	e409dbba14	Enable SSE vectorization of Quaternion and cross3() with AVX	2019-02-23 10:45:40 +01:00
Rasmus Munk Larsen	6560692c67	Improve EventCount used by the non-blocking threadpool. The current algorithm requires threads to commit/cancel waiting in order they called Prewait. Spinning caused by that serialization can consume lots of CPU time on some workloads. Restructure the algorithm to not require that serialization and remove spin waits from Commit/CancelWait. Note: this reduces max number of threads from 2^16 to 2^14 to leave more space for ABA counter (which is now 22 bits). Implementation details are explained in comments.	2019-02-22 13:56:26 -08:00
Gael Guennebaud	0b25a5c431	fix alignment in ploadquad	2019-02-22 21:39:36 +01:00
Rasmus Munk Larsen	1dc1677d52	Change licensing of OrderingMethods/Amd.h and SparseCholesky/SimplicialCholesky_impl.h from LGPL to MPL2. Google LLC executed a license agreement with the author of the code from which these files are derived to allow the Eigen project to distribute the code and derived works under MPL2.	2019-02-22 12:33:57 -08:00
Gael Guennebaud	0cb4ba98e7	update wrt recent changes	2019-02-21 17:19:36 +01:00
Gael Guennebaud	cca6c207f4	AVX512: implement faster ploadquad<Packet16f> thus speeding up GEMM	2019-02-21 17:18:28 +01:00
Gael Guennebaud	1c09ee8541	bug #1674 : workaround clang fast-math aggressive optimizations	2019-02-22 15:48:53 +01:00
Gael Guennebaud	7e3084bb6f	Fix compilation on ARM.	2019-02-22 14:56:12 +01:00

1 2 3 4 5 ...

10522 Commits