eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Antonio Sanchez	86c0decc48	Disable more NVCC warnings. The 2979 warning is yet another "calling a __host__ function from a __host__ device__ function. Although we probably should eventually address these, they are flooding the logs. Most of these are harmless since we only call the original from the host. In cases where these are actually called from device, an error is generated instead anyways. The 2977 warning is a bit strange - although the warning suggests the `__device__` annotation is ignored, this doesn't actually seem to be the case. Without the `__device__` declarations, the kernel actually fails to run when attempting to construct such objects. Again, these warnings are flooding the logs, so disabling for now.	2021-09-23 10:52:39 -07:00
Kolja Brix	afa616bc9e	Fix some typos found	2021-09-23 15:22:00 +00:00
Antonio Sanchez	76bb29c0c2	Add -mfma for AVX512DQ tests.	2021-09-22 14:06:29 -07:00
sciencewhiz	4b6036e276	fix various typos	2021-09-22 16:15:06 +00:00
Antonio Sanchez	3753e6a2b3	Add AVX512 test job to CI.	2021-09-21 15:11:31 -07:00
Antonio Sanchez	343847273d	Enable AVX512 testing.	2021-09-21 15:00:36 -07:00
Alexander Grund	b5eaa42695	Fix alias violation in BFloat16 reinterpret_cast between unrelated types is undefined behavior and leads to misoptimizations on some platforms. Use the safer (and faster) version via bit_cast	2021-09-20 10:37:50 +02:00
Alexander Karatarakis	4d622be118	[AutodiffScalar] Remove const when returning by value clang-tidy: Return type 'const T' is 'const'-qualified at the top level, which may reduce code readability without improving const correctness The types are somewhat long, but the affected return types are of the form: ``` const T my_func() { // } ``` Change to: ``` T my_func() { // } ```	2021-09-18 21:23:32 +00:00
Antonio Sanchez	f49217e52b	Fix implicit conversion warnings in tuple_test. Fixes #2329.	2021-09-17 19:40:22 -07:00
Rasmus Munk Larsen	5595cfd194	Run CI tests in parallel no available cores.	2021-09-17 22:35:22 +00:00
Antonio Sanchez	3c724c44cf	Fix strict aliasing bug causing product_small failure. Packet loading is skipped due to aliasing violation, leading to nullopt matrix multiplication. Fixes #2327.	2021-09-17 21:09:34 +00:00
Antonio Sanchez	9882aec279	Silence string overflow warning for GCC in initializer_list_construction test. This looks to be a GCC bug. It doesn't seem to reproduce is a smaller example, making it hard to isolate.	2021-09-17 18:33:50 +00:00
Rasmus Munk Larsen	5dac69ff0b	Added a macro to pass arguments to ctest, e.g. to run tests in parallel.	2021-09-17 18:33:12 +00:00
Antonio Sanchez	5dac0b53c9	Move Eigen::all,last,lastp1,lastN to Eigen::placeholders::. These names are so common, IMO they should not exist directly in the `Eigen::` namespace. This prevents us from using the `last` or `all` names for any parameters or local variables, otherwise spitting out warnings about shadowing or hiding the global values. Many external projects (and our own examples) also heavily use ``` using namespace Eigen; ``` which means these conflict with external libraries as well, e.g. `std::fill(first,last,value)`. It seems originally these were placed in a separate namespace `Eigen::placeholders`, which has since been deprecated. I propose to un-deprecate this, and restore the original locations. These symbols are also imported into `Eigen::indexing`, which additionally imports `fix` and `seq`. An alternative is to remove the `placeholders` namespace and stick with `indexing`. NOTE: this is an API-breaking change. Fixes #2321.	2021-09-17 10:21:42 -07:00
Rohit Santhanam	44da7a3b9d	Disable specific subtests that fail on HIP due to non-functional device side malloc/free (on HIP).	2021-09-17 16:19:03 +00:00
Antonio Sanchez	16f9a20a6f	Add buildtests_gpu and check_gpu to simplify GPU testing. This is in preparation of adding GPU tests to the CI, allowing us to limit building/testing of GPU-specific tests for a given GPU-capable runner. GPU tests are tagged with the label "gpu". The new targets ``` make buildtests_gpu make check_gpu ``` allow building and running only the gpu tests.	2021-09-17 00:48:57 +00:00
Rasmus Munk Larsen	1239adfcab	Remove -fabi-version=6 flag from AVX512 builds. It was added to fix builds with gcc 4.9, but these don't even work today, and the flag breaks compilation with newer versions of gcc.	2021-09-16 16:16:47 -07:00
Rasmus Munk Larsen	6cadab6896	Clean up EIGEN_STATIC_ASSERT to only use standard c++11 static_assert.	2021-09-16 20:43:54 +00:00
Rasmus Munk Larsen	7b975acb1f	Remove unused variable.	2021-09-16 20:27:13 +00:00
Rasmus Munk Larsen	92849d814b	Remove unused variable.	2021-09-16 20:21:31 +00:00
Rasmus Munk Larsen	da027fa20a	Remove unused variable.	2021-09-16 20:02:42 +00:00
Ryan Pavlik	3c87d6b662	Fix typos in copyright dates (cherry picked from commit `3335e0767c`)	2021-09-15 20:49:43 +00:00
Antonio Sanchez	cb50730993	Default eigen_packet_wrapper constructor. This makes it trivial, allowing use of `memcpy`. Fixes #2326	2021-09-14 10:57:22 -07:00
Rohit Santhanam	a751225845	Minor fix for compilation error on HIP.	2021-09-12 14:06:58 +00:00
Antonio Sanchez	2e31570c16	Fix tuple_test after gpu_test_helper update. Duplicating the namespace `tuple_impl` caused a conflict with the `arch/GPU/Tuple.h` definitions for the `tuple_test`. We can't just use `Eigen::internal` either, since there exists a different `Eigen::internal::get`. Renaming the namespace to `test_detail` fixes the issue.	2021-09-11 20:24:42 -07:00
Antonio Sanchez	d06c639667	Fix unused variable warning and unnecessessary gpuFree.	2021-09-11 20:02:22 -07:00
Antonio Sanchez	bf66137efc	New GPU test utilities. This introduces new functions: ``` // returns kernel(args...) running on the CPU. Eigen::run_on_cpu(Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU. Eigen::run_on_gpu(Kernel kernel, Args&&... args); Eigen::run_on_gpu_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU if using // a GPU compiler, or CPU otherwise. Eigen::run(Kernel kernel, Args&&... args); Eigen::run_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); ``` Running on the GPU is accomplished by: - Serializing the kernel inputs on the CPU - Transferring the inputs to the GPU - Passing the kernel and serialized inputs to a GPU kernel - Deserializing the inputs on the GPU - Running `kernel(inputs...)` on the GPU - Serializing all output parameters and the return value - Transferring the serialized outputs back to the CPU - Deserializing the outputs and return value on the CPU - Returning the deserialized return value All inputs must be serializable (currently POD types, `Eigen::Matrix` and `Eigen::Array`). The kernel must also be POD (though usually contains no actual data). Tested on CUDA 9.1, 10.2, 11.3, with g++-6, g++-8, g++-10 respectively. This MR depends on !622, !623, !624.	2021-09-10 14:22:50 -07:00
Rasmus Munk Larsen	d7d0bf832d	Issue an error in case of direct inclusion of internal headers.	2021-09-10 19:12:26 +00:00
Maxiwell S. Garcia	36402e281d	ci: ppc64le: disable MMA for gcc-10 This patch disables MMA for CI because the building environment is using Ubuntu 18.04 image with LD 2.30. This linker version together with gcc-10 causes some 'unrecognized opcode' errors.	2021-09-09 12:18:07 -05:00
Antonio Sanchez	6c10495a78	Remove unnecessary std::tuple reference.	2021-09-09 15:49:44 +00:00
Antonio Sanchez	26e5beb8cb	Device-compatible Tuple implementation. An analogue of `std::tuple` that works on device. Context: I've tried `std::tuple` in various versions of NVCC and clang, and although code seems to compile, it often fails to run - generating "illegal memory access" errors, or "illegal instruction" errors. This replacement does work on device.	2021-09-08 13:34:19 -07:00
Antonio Sanchez	fcd73b4884	Add a simple serialization mechanism. The `Serializer<T>` class implements a binary serialization that can write to (`serialize`) and read from (`deserialize`) a byte buffer. Also added convenience routines for serializing a list of arguments. This will mainly be for testing, specifically to transfer data to and from the GPU.	2021-09-08 09:38:59 -07:00
Huang, Zhaoquan	558b3d4fb8	Add LLDB Pretty Printer	2021-09-07 17:28:24 +00:00
Antonio Sanchez	7792b1e909	Fix AVX2 PacketMath.h. There were a couple typos ps -> epi32, and an unaligned load issue.	2021-09-03 19:47:57 +00:00
Antonio Sanchez	5bf35383e0	Disable MSVC constant condition warning. We use extensive use of `if (CONSTANT)`, and cannot use c++17's `if constexpr`.	2021-09-03 11:07:18 -07:00
Antonio Sanchez	def145547f	Add missing packet types in pset1 call. Oops, introduced this when "fixing" integer packets.	2021-09-02 16:21:07 -07:00
Antonio Sanchez	eea2a3385c	Remove more DynamicSparseMatrix references. Also fixed some typos in SparseExtra/MarketIO.h.	2021-09-02 15:36:47 -07:00
Antonio Sanchez	3b48a3b964	Remove stray DynamicSparseMatrix references. DynamicSparseMatrix has been removed. These shouldn't be here anymore.	2021-09-02 19:47:26 +00:00
Antonio Sanchez	ebd4b17d2f	Fix tridiagonalization_inplace_selector. The `Options` of the new `hCoeffs` vector do not necessarily match those of the `MatrixType`, leading to build errors. Having the `CoeffVectorType` be a template parameter relieves this restriction.	2021-09-02 12:23:27 -07:00
Sergiu Deitsch	bf426faf93	cmake: populate package registry by default	2021-09-02 17:36:01 +00:00
Jens Wehner	8286073c73	Matrixmarket extension	2021-09-02 17:23:33 +00:00
Sergiu Deitsch	e8beb4b990	cmake: use ARCH_INDEPENDENT versioning if available	2021-09-02 16:08:58 +00:00
Sergiu Deitsch	7bc90cee7d	cmake: remove unused interface definitions	2021-09-02 15:41:56 +02:00
Antonio Sanchez	998bab4b04	Missing EIGEN_DEVICE_FUNCs to get `gpu_basic` passing with CUDA 9. CUDA 9 seems to require labelling defaulted constructors as `EIGEN_DEVICE_FUNC`, despite giving warnings that such labels are ignored. Without these labels, the `gpu_basic` test fails to compile, with errors about calling `__host__` functions from `__host__ __device__` functions.	2021-09-01 19:49:53 -07:00
Antonio Sanchez	74da2e6821	Rename Tuple -> Pair. This is to make way for a new `Tuple` class that mimics `std::tuple`, but can be reliably used on device and with aligned Eigen types. The existing Tuple has very few references, and is actually an analogue of `std::pair`.	2021-09-02 02:20:54 +00:00
Antonio Sanchez	3d4ba855e0	Fix AVX integer packet issues. Most are instances of AVX2 functions not protected by `EIGEN_VECTORIZE_AVX2`. There was also a missing semi-colon for AVX512.	2021-09-01 14:14:43 -07:00
Sergiu Deitsch	f2984cd077	cmake: remove deprecated package config variables	2021-09-01 19:05:51 +02:00
Maxiwell S. Garcia	09fc0f97b5	Rename 'vec_all_nan' of cxx11_tensor_expr test because this symbol is used by altivec.h	2021-09-01 16:42:51 +00:00
Antonio Sanchez	3a6296d4f1	Fix EIGEN_OPTIMIZATION_BARRIER for arm-clang. Clang doesn't like !621, needs the "g" constraint back. The "g" constraint also works for GCC >= 5. This fixes our gitlab CI.	2021-09-01 09:19:55 -07:00
jenswehner	a443a2373f	updated documentation	2021-08-31 22:58:28 +00:00

1 2 3 4 5 ...

11567 Commits