eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-01-30 17:40:05 +08:00

Author	SHA1	Message	Date
Deven Desai	d1d22ef0f4	syncing this fork with upstream	2018-06-13 12:09:52 -04:00
Benoit Steiner	d3a380af4d	Merged in mfigurnov/eigen/gamma-der-a (pull request PR-403) Derivative of the incomplete Gamma function and the sample of a Gamma random variable Approved-by: Benoit Steiner <benoit.steiner.goog@gmail.com>	2018-06-11 17:57:47 +00:00
Gael Guennebaud	cb4c9a6a94	bug #1531 : make dedicatd unit testing for NumDimensions	2018-06-08 17:11:45 +02:00
Gael Guennebaud	d6813fb1c5	bug #1531 : expose NumDimensions for solve and sparse expressions.	2018-06-08 16:55:10 +02:00
Gael Guennebaud	89d65bb9d6	bug #1531 : expose NumDimensions for compatibility with Tensor	2018-06-08 16:50:17 +02:00
Gael Guennebaud	f05dea6b23	bug #1550 : prevent avoidable memory allocation in RealSchur	2018-06-08 10:14:57 +02:00
Gael Guennebaud	7933267c67	fix prototype	2018-06-08 09:56:01 +02:00
Gael Guennebaud	f4d1461874	Fix the way matrix folder is passed to the tests.	2018-06-08 09:55:46 +02:00
Benoit Steiner	522d3ca54d	Don't use std::equal_to inside cuda kernels since it's not supported.	2018-06-07 13:02:07 -07:00
Christoph Hertzberg	7d7bb91537	Missing line during manual rebase of PR-374	2018-06-07 20:30:09 +02:00
Michael Figurnov	30fa3d0454	Merge from eigen/eigen	2018-06-07 17:57:56 +01:00
Benoit Steiner	d2b0a4a59b	Merged in mfigurnov/eigen/fix-bessel (pull request PR-404) Fix compilation of special functions without C99 math.	2018-06-07 16:12:42 +00:00
Michael Figurnov	6c71c7d360	Merge from eigen/eigen.	2018-06-07 15:54:18 +01:00
Gael Guennebaud	c25034710e	Fiw some warnings in dox examples	2018-06-07 16:09:22 +02:00
Gael Guennebaud	37348d03ae	Fix int versus Index	2018-06-07 15:56:43 +02:00
Gael Guennebaud	c723ffd763	Fix warning	2018-06-07 15:56:20 +02:00
Gael Guennebaud	af7c83b9a2	Fix warning	2018-06-07 15:45:24 +02:00
Gael Guennebaud	7fe29aceeb	Fix MSVC warning C4290: C++ exception specification ignored except to indicate a function is not __declspec(nothrow)	2018-06-07 15:36:20 +02:00
Michael Figurnov	aa813d417b	Fix compilation of special functions without C99 math. The commit with Bessel functions i0e and i1e placed the ifdef/endif incorrectly, causing i0e/i1e to be undefined when EIGEN_HAS_C99_MATH=0. These functions do not actually require C99 math, so now they are always available.	2018-06-07 14:35:07 +01:00
Gael Guennebaud	55774b48e4	Fix short vs long	2018-06-07 15:26:25 +02:00
Christoph Hertzberg	e5f9f4768f	Avoid unnecessary C++11 dependency	2018-06-07 15:03:50 +02:00
Gael Guennebaud	b3fd93207b	Fix typos found using codespell	2018-06-07 14:43:02 +02:00
Michael Figurnov	5172a32849	Updated the stopping criteria in igammac_cf_impl. Previously, when computing the derivative, it used a relative error threshold. Now it uses an absolute error threshold. The behavior for computing the value is unchanged. This makes more sense since we do not expect the derivative to often be close to zero. This change makes the derivatives about 30% faster across the board. The error for the igamma_der_a is almost unchanged, while for gamma_sample_der_alpha it is a bit worse for float32 and unchanged for float64.	2018-06-07 12:03:58 +01:00
Michael Figurnov	4bd158fa37	Derivative of the incomplete Gamma function and the sample of a Gamma random variable. In addition to igamma(a, x), this code implements: * igamma_der_a(a, x) = d igamma(a, x) / da -- derivative of igamma with respect to the parameter * gamma_sample_der_alpha(alpha, sample) -- reparameterization derivative of a Gamma(alpha, 1) random variable sample with respect to the alpha parameter The derivatives are computed by forward mode differentiation of the igamma(a, x) code. Although gamma_sample_der_alpha can be implemented via igamma_der_a, a separate function is more accurate and efficient due to analytical cancellation of some terms. All three functions are implemented by a method parameterized with "mode" that always computes the derivatives, but does not return them unless required by the mode. The compiler is expected to (and, based on benchmarks, does) skip the unnecessary computations depending on the mode.	2018-06-06 18:49:26 +01:00
Deven Desai	8fbd47052b	Adding support for using Eigen in HIP kernels. This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.	2018-06-06 10:12:58 -04:00
Benoit Steiner	e206f8d4a4	Merged in mfigurnov/eigen (pull request PR-400) Exponentially scaled modified Bessel functions of order zero and one. Approved-by: Benoit Steiner <benoit.steiner.goog@gmail.com>	2018-06-05 17:05:21 +00:00
Penporn Koanantakool	e2ed0cf8ab	Add a ThreadPoolInterface* getter for ThreadPoolDevice.	2018-06-02 12:07:49 -07:00
Gael Guennebaud	84868da904	Don't run hg on non mercurial clone	2018-05-31 21:21:57 +02:00
Michael Figurnov	f216854453	Exponentially scaled modified Bessel functions of order zero and one. The functions are conventionally called i0e and i1e. The exponentially scaled version is more numerically stable. The standard Bessel functions can be obtained as i0(x) = exp(\|x\|) i0e(x) The code is ported from Cephes and tested against SciPy.	2018-05-31 15:34:53 +01:00
Gael Guennebaud	6af1433cb5	Doc: add aliasing in common pitfaffs.	2018-05-29 22:37:47 +02:00
Katrin Leinweber	ea94543190	Hyperlink DOIs against preferred resolver	2018-05-24 18:55:40 +02:00
Gael Guennebaud	999b552c16	Search for sequential Pastix.	2018-05-29 20:49:25 +02:00
Gael Guennebaud	eef4b7bd87	Fix handling of path names containing spaces and the likes.	2018-05-29 20:49:06 +02:00
Gael Guennebaud	647b724a36	Define pcast<> for SSE types even when AVX is enabled. (otherwise float are silently reinterpreted as int instead of being converted)	2018-05-29 20:46:46 +02:00
Gael Guennebaud	49262dfee6	Fix compilation and SSE support with PGI compiler	2018-05-29 15:09:31 +02:00
Christoph Hertzberg	750af06362	Add an option to test with external BLAS library	2018-05-22 21:04:32 +02:00
Christoph Hertzberg	d06a753d10	Make qr_fullpivoting unit test run for fixed-sized matrices	2018-05-22 20:29:17 +02:00
Gael Guennebaud	f0862b062f	Fix internal::is_integral<size_t/ptrdiff_t> with MSVC 2013 and older.	2018-05-22 19:29:51 +02:00
Gael Guennebaud	36e413a534	Workaround a MSVC 2013 compilation issue with MatrixBase(Index,int)	2018-05-22 18:51:35 +02:00
Gael Guennebaud	725bd92903	fix stupid typo	2018-05-18 17:46:43 +02:00
Gael Guennebaud	a382bc9364	is_convertible<T,Index> does not seems to work well with MSVC 2013, so let's rather use __is_enum(T) for old MSVC versions	2018-05-18 17:02:27 +02:00
Gael Guennebaud	4dd767f455	add some internal checks	2018-05-18 13:59:55 +02:00
Gael Guennebaud	345c0ab450	check that all integer types are properly handled by mat(i,j)	2018-05-18 13:46:46 +02:00
Mark D Ryan	405859f18d	Set EIGEN_IDEAL_MAX_ALIGN_BYTES correctly for AVX512 builds bug #1548 The macro EIGEN_IDEAL_MAX_ALIGN_BYTES is being incorrectly set to 32 on AVX512 builds. It should be set to 64. In the current code it is only set to 64 if the macro EIGEN_VECTORIZE_AVX512 is defined. This macro does get defined in AVX512 builds in Core, but only after Macros.h, the file that defines EIGEN_IDEAL_MAX_ALIGN_BYTES, has been included. This commit fixes the issue by setting EIGEN_IDEAL_MAX_ALIGN_BYTES to 64 if __AVX512F__ is defined.	2018-05-17 17:04:00 +01:00
Vamsi Sripathi	6293ad3f39	Performance improvements to tensor broadcast operation 1. Added new packet functions using SIMD for NByOne, OneByN cases 2. Modified existing packet functions to reduce index calculations when input stride is non-SIMD 3. Added 4 test cases to cover the new packet functions	2018-05-23 14:02:05 -07:00
Gael Guennebaud	7134fa7a2e	Fix compilation with MSVC by reverting to char* for _mm_prefetch except for PGI (the later being the one that has the wrong prototype).	2018-06-07 09:33:10 +02:00
Jeff Trull	e7147f69ae	Add tests for sparseQR results (value and size) covering bugs #1522 and #1544	2018-04-21 10:26:30 -07:00
Robert Lukierski	b2053990d0	Adding EIGEN_DEVICE_FUNC to Products, especially Dense2Dense Assignment specializations. Otherwise causes problems with small fixed size matrix multiplication (call to 0x00 in call_assignment_no_alias in debug mode or trap in release with CUDA 9.1).	2018-03-14 16:19:43 +00:00
Jeff Trull	9f0c5c3669	Make sparse QR result sizes consistent with dense QR, with the following rules: 1) Q is always square 2) QRP' is valid and recovers the original matrix This implies that the size of Q is the number of rows in the original matrix, square, and that the size of R is the size of the original matrix.	2018-02-15 15:00:31 -08:00
Christoph Hertzberg	d655900953	bug #1544 : Generate correct Q matrix in complex case. Original patch was by Jeff Trull in PR-386.	2018-05-17 19:17:01 +02:00

1 2 3 4 5 ...

9633 Commits