eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Rasmus Munk Larsen	1239adfcab	Remove -fabi-version=6 flag from AVX512 builds. It was added to fix builds with gcc 4.9, but these don't even work today, and the flag breaks compilation with newer versions of gcc.	2021-09-16 16:16:47 -07:00
Rasmus Munk Larsen	6cadab6896	Clean up EIGEN_STATIC_ASSERT to only use standard c++11 static_assert.	2021-09-16 20:43:54 +00:00
Rasmus Munk Larsen	7b975acb1f	Remove unused variable.	2021-09-16 20:27:13 +00:00
Rasmus Munk Larsen	92849d814b	Remove unused variable.	2021-09-16 20:21:31 +00:00
Rasmus Munk Larsen	da027fa20a	Remove unused variable.	2021-09-16 20:02:42 +00:00
Ryan Pavlik	3c87d6b662	Fix typos in copyright dates (cherry picked from commit `3335e0767c`)	2021-09-15 20:49:43 +00:00
Antonio Sanchez	cb50730993	Default eigen_packet_wrapper constructor. This makes it trivial, allowing use of `memcpy`. Fixes #2326	2021-09-14 10:57:22 -07:00
Rohit Santhanam	a751225845	Minor fix for compilation error on HIP.	2021-09-12 14:06:58 +00:00
Antonio Sanchez	2e31570c16	Fix tuple_test after gpu_test_helper update. Duplicating the namespace `tuple_impl` caused a conflict with the `arch/GPU/Tuple.h` definitions for the `tuple_test`. We can't just use `Eigen::internal` either, since there exists a different `Eigen::internal::get`. Renaming the namespace to `test_detail` fixes the issue.	2021-09-11 20:24:42 -07:00
Antonio Sanchez	d06c639667	Fix unused variable warning and unnecessessary gpuFree.	2021-09-11 20:02:22 -07:00
Antonio Sanchez	bf66137efc	New GPU test utilities. This introduces new functions: ``` // returns kernel(args...) running on the CPU. Eigen::run_on_cpu(Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU. Eigen::run_on_gpu(Kernel kernel, Args&&... args); Eigen::run_on_gpu_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU if using // a GPU compiler, or CPU otherwise. Eigen::run(Kernel kernel, Args&&... args); Eigen::run_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); ``` Running on the GPU is accomplished by: - Serializing the kernel inputs on the CPU - Transferring the inputs to the GPU - Passing the kernel and serialized inputs to a GPU kernel - Deserializing the inputs on the GPU - Running `kernel(inputs...)` on the GPU - Serializing all output parameters and the return value - Transferring the serialized outputs back to the CPU - Deserializing the outputs and return value on the CPU - Returning the deserialized return value All inputs must be serializable (currently POD types, `Eigen::Matrix` and `Eigen::Array`). The kernel must also be POD (though usually contains no actual data). Tested on CUDA 9.1, 10.2, 11.3, with g++-6, g++-8, g++-10 respectively. This MR depends on !622, !623, !624.	2021-09-10 14:22:50 -07:00
Rasmus Munk Larsen	d7d0bf832d	Issue an error in case of direct inclusion of internal headers.	2021-09-10 19:12:26 +00:00
Maxiwell S. Garcia	36402e281d	ci: ppc64le: disable MMA for gcc-10 This patch disables MMA for CI because the building environment is using Ubuntu 18.04 image with LD 2.30. This linker version together with gcc-10 causes some 'unrecognized opcode' errors.	2021-09-09 12:18:07 -05:00
Antonio Sanchez	6c10495a78	Remove unnecessary std::tuple reference.	2021-09-09 15:49:44 +00:00
Antonio Sanchez	26e5beb8cb	Device-compatible Tuple implementation. An analogue of `std::tuple` that works on device. Context: I've tried `std::tuple` in various versions of NVCC and clang, and although code seems to compile, it often fails to run - generating "illegal memory access" errors, or "illegal instruction" errors. This replacement does work on device.	2021-09-08 13:34:19 -07:00
Antonio Sanchez	fcd73b4884	Add a simple serialization mechanism. The `Serializer<T>` class implements a binary serialization that can write to (`serialize`) and read from (`deserialize`) a byte buffer. Also added convenience routines for serializing a list of arguments. This will mainly be for testing, specifically to transfer data to and from the GPU.	2021-09-08 09:38:59 -07:00
Huang, Zhaoquan	558b3d4fb8	Add LLDB Pretty Printer	2021-09-07 17:28:24 +00:00
Antonio Sanchez	7792b1e909	Fix AVX2 PacketMath.h. There were a couple typos ps -> epi32, and an unaligned load issue.	2021-09-03 19:47:57 +00:00
Antonio Sanchez	5bf35383e0	Disable MSVC constant condition warning. We use extensive use of `if (CONSTANT)`, and cannot use c++17's `if constexpr`.	2021-09-03 11:07:18 -07:00
Antonio Sanchez	def145547f	Add missing packet types in pset1 call. Oops, introduced this when "fixing" integer packets.	2021-09-02 16:21:07 -07:00
Antonio Sanchez	eea2a3385c	Remove more DynamicSparseMatrix references. Also fixed some typos in SparseExtra/MarketIO.h.	2021-09-02 15:36:47 -07:00
Antonio Sanchez	3b48a3b964	Remove stray DynamicSparseMatrix references. DynamicSparseMatrix has been removed. These shouldn't be here anymore.	2021-09-02 19:47:26 +00:00
Antonio Sanchez	ebd4b17d2f	Fix tridiagonalization_inplace_selector. The `Options` of the new `hCoeffs` vector do not necessarily match those of the `MatrixType`, leading to build errors. Having the `CoeffVectorType` be a template parameter relieves this restriction.	2021-09-02 12:23:27 -07:00
Sergiu Deitsch	bf426faf93	cmake: populate package registry by default	2021-09-02 17:36:01 +00:00
Jens Wehner	8286073c73	Matrixmarket extension	2021-09-02 17:23:33 +00:00
Sergiu Deitsch	e8beb4b990	cmake: use ARCH_INDEPENDENT versioning if available	2021-09-02 16:08:58 +00:00
Sergiu Deitsch	7bc90cee7d	cmake: remove unused interface definitions	2021-09-02 15:41:56 +02:00
Antonio Sanchez	998bab4b04	Missing EIGEN_DEVICE_FUNCs to get `gpu_basic` passing with CUDA 9. CUDA 9 seems to require labelling defaulted constructors as `EIGEN_DEVICE_FUNC`, despite giving warnings that such labels are ignored. Without these labels, the `gpu_basic` test fails to compile, with errors about calling `__host__` functions from `__host__ __device__` functions.	2021-09-01 19:49:53 -07:00
Antonio Sanchez	74da2e6821	Rename Tuple -> Pair. This is to make way for a new `Tuple` class that mimics `std::tuple`, but can be reliably used on device and with aligned Eigen types. The existing Tuple has very few references, and is actually an analogue of `std::pair`.	2021-09-02 02:20:54 +00:00
Antonio Sanchez	3d4ba855e0	Fix AVX integer packet issues. Most are instances of AVX2 functions not protected by `EIGEN_VECTORIZE_AVX2`. There was also a missing semi-colon for AVX512.	2021-09-01 14:14:43 -07:00
Sergiu Deitsch	f2984cd077	cmake: remove deprecated package config variables	2021-09-01 19:05:51 +02:00
Maxiwell S. Garcia	09fc0f97b5	Rename 'vec_all_nan' of cxx11_tensor_expr test because this symbol is used by altivec.h	2021-09-01 16:42:51 +00:00
Antonio Sanchez	3a6296d4f1	Fix EIGEN_OPTIMIZATION_BARRIER for arm-clang. Clang doesn't like !621, needs the "g" constraint back. The "g" constraint also works for GCC >= 5. This fixes our gitlab CI.	2021-09-01 09:19:55 -07:00
jenswehner	a443a2373f	updated documentation	2021-08-31 22:58:28 +00:00
Antonio Sanchez	ff07a8a639	GCC 4.8 arm EIGEN_OPTIMIZATION_BARRIER fix (#2315 ). GCC 4.8 doesn't seem to like the `g` register constraint, failing to compile with "error: 'asm' operand requires impossible reload". Tested `r` instead, and that seems to work, even with latest compilers. Also fixed some minor macro issues to eliminate warnings on armv7. Fixes #2315.	2021-08-31 20:20:47 +00:00
Antonio Sanchez	cc3573ab44	Disable cuda Eigen::half vectorization on host. All cuda `__half` functions are device-only in CUDA 9, including conversions. Host-side conversions were added in CUDA 10. The existing code doesn't build prior to 10.0. All arithmetic functions are always device-only, so there's therefore no reason to use vectorization on the host at all. Modified the code to disable vectorization for `__half` on host, which required also updating the `TensorReductionGpu` implementation which previously made assumptions about available packets.	2021-08-31 19:13:12 +00:00
Adam Kallai	1415817d8d	win: include intrin header in Windows on ARM intrin header is needed for _BitScanReverse and _BitScanReverse64	2021-08-31 10:57:34 +02:00
Rasmus Munk Larsen	6f429a202d	Add missing dependency on LAPACK test suite binaries to target `buildtests`, so `make check` will work correctly when `EIGEN_ENABLE_LAPACK_TESTS` is `ON`.	2021-08-30 21:49:08 +00:00
Rasmus Munk Larsen	7e096ddcb0	Allow old Fortran code for LAPACK tests to compile despite argument mismatch errors (REAL passed to COMPLEX workspace argument) with GNU Fortran 10.	2021-08-30 19:47:30 +00:00
Turing Eret	3324389f6d	Add EIGEN_TENSOR_PLUGIN support per issue #2052 .	2021-08-30 19:36:55 +00:00
Antonio Sanchez	5db9e5c779	Fix fix<N> when variable templates are not supported. There were some typos that checked `EIGEN_HAS_CXX14` that should have checked `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES`, causing a mismatch in some of the `Eigen::fix<N>` assumptions. Also fixed the `symbolic_index` test when `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES` is 0. Fixes #2308	2021-08-30 08:06:55 -07:00
Jens Wehner	53ad9c75b4	included unordered_map header	2021-08-27 16:53:28 +00:00
Kolja Brix	a032397ae4	Fix PEP8 and formatting issues in GDB pretty printer.	2021-08-26 15:22:28 +00:00
jenswehner	9abf4d0bec	made RandomSetter C++11 compatible	2021-08-25 20:24:55 +00:00
Antonio Sanchez	eeacbd26c8	Bump CMake files to at least c++11. Removed all configurations that explicitly test or set the c++ standard flags. The only place the standard is now configured is at the top of the main `CMakeLists.txt` file, which can easily be updated (e.g. if we decide to move to c++14+). This can also be set via command-line using ``` > cmake -DCMAKE_CXX_STANDARD 14 ``` Kept the `EIGEN_TEST_CXX11` flag for now - that still controls whether to build/run the `cxx11_*` tests. We will likely end up renaming these tests and removing the `CXX11` subfolder.	2021-08-25 20:07:48 +00:00
Jakub Lichman	dc5b1f7d75	AVX512 and AVX2 support for Packet16i and Packet8i added	2021-08-25 19:38:23 +00:00
Han-Kuan Chen	ab28419298	optimize predux if architecture is aarch64	2021-08-25 19:18:54 +00:00
Antonio Sanchez	4011e4d258	Remove c++11-off CI jobs. This is step 1 in preparing to transition beyond c++03.	2021-08-24 17:42:39 +00:00
jenswehner	90b3b6b572	added doxygen flowchart	2021-08-24 17:11:51 +00:00
jenswehner	d85de1ef56	removed sparse dynamic matrix	2021-08-24 10:33:00 +02:00

1 2 3 4 5 ...

11551 Commits