eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-01-24 14:45:14 +08:00

Author	SHA1	Message	Date
David Tellenbach	ae95b74af9	Add CMake infrastructure for smoke testing Necessary CMake changes to implement pre-merge smoke tests running via CI.	2021-03-31 22:09:00 +00:00
Rasmus Munk Larsen	5bbc9cea93	Add an info() method to the SVDBase class to make it possible to tell the user that the computation failed, possibly due to invalid input. Make Jacobi and divide-and-conquer fail fast and return info() == InvalidInput if the matrix contains NaN or +/-Inf.	2021-03-31 21:09:19 +00:00
Guoqiang QI	b5a926a0f6	Add GitLab templates for issues and merge requests This patch adds GitLab templates for bug reports, feature and merge requests. This closes #2117.	2021-03-31 16:01:12 +00:00
Antonio Sanchez	78ee3d6261	Fix CUDA constexpr issues for numeric_limits. Some CUDA/HIP constants fail on device with `constexpr` since they internally rely on non-constexpr functions, e.g. ``` \#define CUDART_INF_F __int_as_float(0x7f800000) ``` This fails for cuda-clang (though passes with nvcc). These constants are currently used by `device::numeric_limits`. For portability, we need to remove `constexpr` from the affected functions. For C++11 or higher, we should be able to rely on the `std::numeric_limits` versions anyways, since the methods themselves are now `constexpr`, so should be supported on device (clang/hipcc natively, nvcc with `--expr-relaxed-constexpr`).	2021-03-30 18:01:27 +00:00
Antonio Sanchez	af1247fbc1	Use Index type in loop over coefficients. Previously was `int`. Brought up by Kyle Snow (Polaris Geospatial Services) on the mailing list.	2021-03-29 17:40:55 +00:00
Antonio Sanchez	87729ea39f	Eliminate `round_impl` double-promotion warnings for c++03.	2021-03-25 16:52:19 +00:00
Deven Desai	748489ef9c	Un-defining EIGEN_HAS_CONSTEXPR on the HIP platform The Eigen unit-tests started failing on the HIP/ROCm platform, after the following commit `e7b8643d70` ``` In file included from /home/rocm-user/eigen/test/main.h:360: In file included from /home/rocm-user/eigen/Eigen/QR:11: In file included from /home/rocm-user/eigen/Eigen/Core:162: /home/rocm-user/eigen/Eigen/src/Core/util/Meta.h:300:17: error: constexpr function never produces a constant expression [-Winvalid-constexpr] static float (max)() { ^ /home/rocm-user/eigen/Eigen/src/Core/util/Meta.h:304:12: note: non-constexpr function '__int_as_float' cannot be used in a constant expression return HIPRT_MAX_NORMAL_F; ^ /home/rocm-user/eigen/Eigen/src/Core/arch/HIP/hcc/math_constants.h:14:28: note: expanded from macro 'HIPRT_MAX_NORMAL_F' #define HIPRT_MAX_NORMAL_F __int_as_float(0x7f7fffff) ^ /opt/rocm/hip/include/hip/hcc_detail/device_functions.h:913:32: note: declared here __device__ static inline float __int_as_float(int x) { ^ ``` The problem seems to that some of the constants defined in the HIP `math_constants.h` have a call to `__int_as_float` routine which is not declared `constexpr` in the HIP runtime header file. Working around this issue for now, be skipping the const_expr support (enabled via the above commit) on HIP	2021-03-25 13:45:52 +00:00
Chip Kerchner	d59ef212e1	Fixed performance issues for complex VSX and P10 MMA in gebp_kernel (level 3).	2021-03-25 11:08:19 +00:00
Steve Bronder	e7b8643d70	Revert "Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()"" This reverts commit `5f0b4a4010`.	2021-03-24 18:14:56 +00:00
Antonio Sanchez	5521c65afb	Eliminate mixingtypes_7 warning. `g_called` is not used in subtest 7, so was generating a `-Wunneeded-internal-declaration` warnings. Here we silence it by initializing the static variable.	2021-03-24 11:05:41 -07:00
Christoph Hertzberg	69a4f70956	Revert "Uses _mm512_abs_pd for Packet8d pabs" This reverts commit `f019b97aca`	2021-03-23 18:52:19 +00:00
David Tellenbach	824272cde8	Re-enable CI for Power	2021-03-22 19:28:25 +01:00
David Tellenbach	4811e81966	Remove yet another comma at end of enum	2021-03-18 23:30:00 +01:00
Steve Bronder	f019b97aca	Uses _mm512_abs_pd for Packet8d pabs	2021-03-18 15:47:52 +00:00
David Tellenbach	0cc9b5eb40	Split test commainitializer into two substests	2021-03-18 13:28:51 +01:00
Antonio Sanchez	c3fbc6cec7	Use singleton pattern for static registered tests. The original fails with nvcc+msvc - there's a static order of initialization issue leading to registered tests being cleared. The test then fails on ``` VERIFY(EigenTest::all().size()>0); ``` since `EigenTest` no longer contains any tests. The singleton pattern fixes this.	2021-03-18 00:56:31 +00:00
Niek Bouman	ed964ba3f1	Proposed fix for issue #2187	2021-03-18 00:55:36 +00:00
Antonio Sanchez	8dfe1029a5	Augment NumTraits with min/max_exponent() again. Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase where possible. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. The previous MR !443 failed for c++03 due to lack of `constexpr`. Because of this, we need to keep around the `std::numeric_limits` version in enum expressions until the switch to c++11. Fixes #2148	2021-03-16 20:12:46 -07:00
David Tellenbach	eb71e5db98	Fix another warning on missing commas	2021-03-17 03:07:04 +01:00
David Tellenbach	df4bc2731c	Revert "Augment NumTraits with min/max_exponent()." This reverts commit `75ce9cd2a7`.	2021-03-17 03:06:08 +01:00
Antonio Sanchez	75ce9cd2a7	Augment NumTraits with min/max_exponent(). Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. Fixes #2148	2021-03-17 01:00:41 +00:00
David Tellenbach	9fb7062440	Silence warning on comma at end of enumerator list	2021-03-17 01:46:52 +01:00
Theo Fletcher	b8502a9dd6	Updated SelfAdjointEigenSolver documentation to include that the eigenvectors matrix is unitary.	2021-03-16 18:48:02 +00:00
Rasmus Munk Larsen	2e83cbbba9	Add NaN propagation options to minCoeff/maxCoeff visitors.	2021-03-16 17:02:50 +00:00
Jens Wehner	c0a889890f	Fixed output of complex matrices	2021-03-15 21:51:55 +00:00
Antonio Sanchez	f612df2736	Add fmod(half, half). This is to support TensorFlow's `tf.math.floormod` for half.	2021-03-15 13:32:24 -07:00
Antonio Sanchez	14b7ebea11	Fix numext::round pre c++11 for large inputs. This is to resolve an issue for large inputs when +0.5 can actually lead to +1 if the input doesn't have enough precision to resolve the addition - leading to an off-by-one error. See discussion on `9a663973`.	2021-03-15 19:08:04 +00:00
Chip Kerchner	c9d4367fa4	Fix pround and add print	2021-03-15 19:07:43 +00:00
Antonio Sanchez	d24f9f9b55	Fix NVCC+ICC issues. NVCC does not understand `__forceinline`, so we need to use `inline` when compiling for GPU. ICC specializes `std::complex` operators for `float` and `double` by default, which cannot be used on device and conflict with Eigen's workaround in CUDA/Complex.h. This can be prevented by defining `_OVERRIDE_COMPLEX_SPECIALIZATION_` before including `<complex>`. Added this define to the tests and to `Eigen/Core`, but this will not work if the user includes `<complex>` before `<Eigen/Core>`. ICC also seems to generate a duplicate `Map` symbol in `PlainObjectBase`: ``` error: "Map" has already been declared in the current scope static ConstMapType Map(const Scalar *data) ``` I tracked this down to `friend class Eigen::Map`. Putting the `friend` statements at the bottom of the class seems to resolve this issue. Fixes #2180	2021-03-15 18:42:04 +00:00
Antonio Sanchez	14487ed14e	Add increment/decrement operators to Eigen::half. This is for consistency with bfloat16, and to support initialization with `std::iota`.	2021-03-15 10:52:23 -07:00
Antonio Sanchez	b271110788	Bump up rand histogram threshold. The previous one sometimes fails for MSVC which has a poor random number generator. Fixes #2182	2021-03-10 22:17:03 -08:00
Antonio Sanchez	d098c4d64c	Disable EIGEN_OPTIMIZATION_BARRIER for PPC clang. Doesn't seem to correctly select the register type, and most types lead to compiler crashes.	2021-03-10 16:05:01 -08:00
Antonio Sanchez	543e34ab9d	Re-implement move assignments. The original swap approach leads to potential undefined behavior (reading uninitialized memory) and results in unnecessary copying of data for static storage. Here we pass down the move assignment to the underlying storage. Static storage does a one-way copy, dynamic storage does a swap. Modified the tests to no longer read from the moved-from matrix/tensor, since that can lead to UB. Added a test to ensure we do not access uninitialized memory in a move. Fixes: #2119	2021-03-10 16:55:20 +00:00
Ben Niu	b8d1857f0d	[MSVC-specific] Define EIGEN_ARCH_x86_64 for native x64 (_M_X64 is defined and _M_ARM64EC is not), and define EIGEN_ARCH_ARM64 for both the native ARM64 (_M_ARM64 is defined) or ARM64EC (_M_ARM64EC is defined). _M_ARM64EC is defined when the code is compiled by MSVC for ARM64EC, a new ARM64 ABI designed to be compatible with x64 application emulation on ARM64. If _M_ARM64EC is defined, _M_X64 and _M_AMD64 are also defined, so x64-specific code (especially intrinsics) is also compiled to ARM64 instructions (compliant with the ARM64EC ABI) for maximum x64 compatibility. Although a majority of x64-specific intrinsics can emulated by ARM64 instructions, it is still a good to simply recompile the native ARM64 code paths to ARM64EC for pure computation tasks, for performance reasons.	2021-03-10 10:21:31 +00:00
Antonio Sanchez	853a5c4b84	Fix ambiguous call to CUDA __half constructor.	2021-03-08 21:06:28 -08:00
Antonio Sanchez	94327dbfba	Fix typo: DEVICE -> GPU	2021-03-08 11:21:00 -08:00
Antonio Sanchez	1296abdf82	Fix non-trivial Half constructor for CUDA. Both CUDA and HIP require trivial default constructors for types used in shared memory. Otherwise failing with ``` error: initialization is not supported for __shared__ variables. ```	2021-03-08 07:32:54 -08:00
Antonio Sanchez	6045243141	Revert stack allocation limit change that crept in. This was accidentally introduced when copying changes between repos.	2021-03-05 14:29:37 -08:00
Deven Desai	1a96d49afe	Changing the Eigen::half implementation for HIP Currently, when compiling with HIP, Eigen::half is derived from the `__half_raw` struct that is defined within the hip_fp16.h header file. This is true for both the "host" compile phase and the "device" compile phase. This was causing a very hard to detect bug in the ROCm TensorFlow build. In the ROCm Tensorflow build, * files that do not contain ant GPU code get compiled via gcc, and * files that contnain GPU code get compiled via hipcc. In certain case, we have a function that is defined in a file that is compiled by hipcc, and is called in a file that is compiled by gcc. If such a function had Eigen::half has a "pass-by-value" argument, its value was getting corrupted, when received by the function. The reason for this seems to be that for the gcc compile, Eigen::half is derived from a `__half_raw` struct that has `uint16_t` as the data-store, and for hipcc the `__half_raw` implementation uses `_Float16` as the data store. There is some ABI incompatibility between gcc / hipcc (which is essentially latest clang), which results in the Eigen::half value (which is correct at the call-site) getting randomly corrupted when passed to the function. Changing the Eigen::half argument to be "pass by reference" seems to workaround the error. In order to fix it such that we do not run into it again in TF, this commit changes the Eigne::half implementation to use the same `__half_raw` implementation as the non-GPU compile, during host compile phase of the hipcc compile.	2021-03-05 19:27:13 +00:00
Antonio Sanchez	2468253c9a	Define EIGEN_CPLUSPLUS and replace most __cplusplus checks. The macro `__cplusplus` is not defined correctly in MSVC unless building with the the `/Zc:__cplusplus` flag. Instead, it defines `_MSVC_LANG` to the specified c++ standard version number. Here we introduce `EIGEN_CPLUSPLUS` which will contain the c++ version number both for MSVC and otherwise. This simplifies checks for supported features. Also replaced most instances of standard version checking via `__cplusplus` with the existing `EIGEN_COMP_CXXVER` macro for better clarity. Fixes: #2170	2021-03-05 18:33:18 +00:00
Antonio Sanchez	82d61af3a4	Fix rint SSE/NEON again, using optimization barrier. This is a new version of !423, which failed for MSVC. Defined `EIGEN_OPTIMIZATION_BARRIER(X)` that uses inline assembly to prevent operations involving `X` from crossing that barrier. Should work on most `GNUC` compatible compilers (MSVC doesn't seem to need this). This is a modified version adapted from what was used in `psincos_float` and tested on more platforms (see #1674, https://godbolt.org/z/73ezTG). Modified `rint` to use the barrier to prevent the add/subtract rounding trick from being optimized away. Also fixed an edge case for large inputs that get bumped up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.	2021-03-05 08:54:12 -08:00
David Tellenbach	5f0b4a4010	Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()" This reverts commit `6cbb3038ac` because it breaks clang-10 builds on x86 and aarch64 when C++11 is enabled.	2021-03-05 13:16:43 +01:00
Steve Bronder	6cbb3038ac	Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()	2021-03-04 18:58:08 +00:00
David Tellenbach	5bfc67f9e7	Deactive CI for Power due to problems with GitLab runner	2021-03-04 17:33:40 +01:00
Eugene Zhulenev	a6601070f2	Add log2 operation to TensorBase	2021-03-04 00:13:36 +00:00
Antonio Sánchez	9a663973b4	Revert "Fix rint for SSE/NEON." This reverts commit `e72dfeb8b9`	2021-03-03 18:51:51 +00:00
Antonio Sanchez	e72dfeb8b9	Fix rint for SSE/NEON. It seems sometimes with aggressive optimizations the combination `psub(padd(a, b), b)` trick to force rounding is compiled away. Here we replace with inline assembly to prevent this (I tried `volatile`, but that leads to additional loads from memory). Also fixed an edge case for large inputs `a` where adding `b` bumps the value up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.	2021-03-03 09:41:46 -08:00
Christoph Hertzberg	199c5f2b47	geo_alignedbox_5 was failing with AVX enabled, due to storing `Vector4d` in a `std::vector` without using an aligned allocator. Got rid of using `std::vector` and simplified the code. Avoid leading `_`	2021-03-01 03:59:21 +01:00
Antonio Sanchez	1e0c7d4f49	Add print for SSE/NEON, use NEON rounding intrinsics if available. In SSE, by adding/subtracting 2^MantissaBits, we force rounding according to the current rounding mode. For NEON, we use the provided intrinsics for rint/floor/ceil if available (armv8). Related to #1969.	2021-02-27 22:42:07 +00:00
David Tellenbach	976ae0ca6f	Document that using raw function pointers doesn't work with unaryExpr.	2021-02-27 22:58:42 +01:00

1 2 3 4 5 ...

11370 Commits