eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	5c73c9223a	Fix shadowing typedefs	2018-07-12 17:01:07 +02:00
Gael Guennebaud	98728312c8	Fix compilation regarding std::array	2018-07-12 17:00:37 +02:00
Gael Guennebaud	eb3d8f68bb	fix unused warning	2018-07-12 16:59:47 +02:00
Gael Guennebaud	006e18e52b	Cleanup the mess in Eigen/Core by moving CUDA/HIP stuff at more appropriate places (Macros.h), and alignment/vectorization logic is now in util/ConfigureVectorization.h	2018-07-12 16:57:41 +02:00
Thales Sabino	9a6a43319f	Fix cxx11_tensor_fft not building on Windows. The type used in Eigen::DSizes needs to be at least 8 bytes long. Internally Tensor tries to convert this to an __int64 on Windows and this fails to build. On Linux, long and long long are both 8 byte integer types. * * * Changing from "long long" to "std::int64_t".	2018-07-12 11:20:59 +01:00
Gael Guennebaud	b347eb0b1c	Fix doc	2018-07-12 11:56:18 +02:00
Mark D Ryan	e79c5149bf	Fix AVX512 implementations of psqrt This commit fixes the AVX512 implementations of psqrt in the same way that `3ed67cb0bb` fixed the AVX2 version of this function. The AVX512 versions of psqrt incorrectly return -0.0 for negative values, instead of NaN. Fixing the issues requires adding some additional instructions that slow down the algorithms. A similar test to the one used in `3ed67cb0bb` shows that the corrected Packet16f code runs at 73% of the speed of the existing code, while the corrected Packed8d function runs at 68% of the original.	2018-06-25 05:05:02 -07:00
Yuefeng Zhou	1eff6cf8a7	Use device's allocate function instead of internal::aligned_malloc. This would make it easier to track memory usage in device instances.	2018-02-20 16:50:05 -08:00
Gael Guennebaud	adb134d47e	Fix implicit conversion from 0.0 to scalar	2018-02-16 22:26:01 +04:00
Gael Guennebaud	937ad18221	add unit test for SimplicialCholesky and Boost multiprec.	2018-02-16 22:25:11 +04:00
Julian Kent	6d451cf2b6	Add missing consts for rows and cols functions in SparseLU	2018-02-10 13:44:05 +01:00
Daniele E. Domenichelli	a12b8a8c75	FindEigen3: Set Eigen3_FOUND variable	2018-07-11 16:31:50 +02:00
Gael Guennebaud	8bdb214fd0	remove double ;;	2018-07-12 11:17:53 +02:00
Gael Guennebaud	a9060378d3	bug #1570 : fix warning	2018-07-12 11:07:09 +02:00
Gael Guennebaud	6cd6551b26	Add deprecated header files for TensorFlow	2018-07-12 10:50:53 +02:00
Gael Guennebaud	da0c604078	Merged in deven-amd/eigen (pull request PR-402) Adding support for using Eigen in HIP kernels.	2018-07-12 08:07:16 +00:00
Gael Guennebaud	a4ea611ca7	Remove useless specialization thanks to is_convertible being more robust.	2018-07-12 09:59:44 +02:00
Gael Guennebaud	8a40dda5a6	Add some basic unit-tests	2018-07-12 09:59:00 +02:00
Gael Guennebaud	8ef267ccbd	spellcheck	2018-07-12 09:58:29 +02:00
Gael Guennebaud	21cf4a1a8b	Make is_convertible more robust and conformant to std::is_convertible	2018-07-12 09:57:19 +02:00
Gael Guennebaud	8a5955a052	Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.	2018-07-11 17:16:50 +02:00
Gael Guennebaud	d193cc87f4	Fix regression in `9357838f94`	2018-07-11 17:09:23 +02:00
Gael Guennebaud	fb33687736	Fix double ;;	2018-07-11 17:08:30 +02:00
Deven Desai	876f392c39	Updates corresponding to the latest round of PR feedback The major changes are 1. Moving CUDA/PacketMath.h to GPU/PacketMath.h 2. Moving CUDA/MathFunctions.h to GPU/MathFunction.h 3. Moving CUDA/CudaSpecialFunctions.h to GPU/GpuSpecialFunctions.h The above three changes effectively enable the Eigen "Packet" layer for the HIP platform 4. Merging the "hip_basic" and "cuda_basic" unit tests into one ("gpu_basic") 5. Updating the "EIGEN_DEVICE_FUNC" marking in some places The change has been tested on the HIP and CUDA platforms.	2018-07-11 10:39:54 -04:00
Deven Desai	1fe0b74904	deleting hip specific files that are no longer required	2018-07-11 09:28:44 -04:00
Deven Desai	dec47a6493	renaming CUDA* to GPU* for some header files	2018-07-11 09:26:54 -04:00
Deven Desai	471cfe5ff7	renaming CUDA* to GPU* for some header files	2018-07-11 09:22:04 -04:00
Deven Desai	38807a2575	merging updates from upstream	2018-07-11 09:17:33 -04:00
Gael Guennebaud	f00d08cc0a	Optimize extraction of Q in SparseQR by exploiting the structure of the identity matrix.	2018-07-11 14:01:47 +02:00
Gael Guennebaud	1625476091	Add internall::is_identity compile-time helper	2018-07-11 14:00:24 +02:00
Gael Guennebaud	fe723d6129	Fix conversion warning	2018-07-10 09:10:32 +02:00
Gael Guennebaud	9357838f94	bug #1543 : improve linear indexing for general block expressions	2018-07-10 09:10:15 +02:00
Gael Guennebaud	de9e31a06d	Introduce the macro ei_declare_local_nested_eval to help allocating on the stack local temporaries via alloca, and let outer-products makes a good use of it. If successful, we should use it everywhere nested_eval is used to declare local dense temporaries.	2018-07-09 15:41:14 +02:00
Gael Guennebaud	6190aa5632	bug #1567 : add optimized path for tensor broadcasting and 'Channel First' shape	2018-07-09 11:23:16 +02:00
Gael Guennebaud	ec323b7e66	Skip null numerators in triangular-vector-solve (as in BLAS TRSV).	2018-07-09 11:13:19 +02:00
Gael Guennebaud	359dd77ec3	Fix legitimate "declaration shadows a typedef" warning	2018-07-09 11:03:39 +02:00
Deven Desai	e2b2c61533	merging from master	2018-06-20 16:47:45 -04:00
Deven Desai	1bb6fa99a3	merging the CUDA and HIP implementation for the Tensor directory and the unit tests	2018-06-20 16:44:58 -04:00
Deven Desai	cfdabbcc8f	removing the *Hip files from the unsupported/Eigen/CXX11/src/Tensor and unsupported/test directories	2018-06-20 12:57:02 -04:00
Deven Desai	7e41c8f1a9	renaming Cuda files to Gpu in the unsupported/Eigen/CXX11/src/Tensor and unsupported/test directories	2018-06-20 12:52:30 -04:00
Deven Desai	ee73ae0a80	Merged eigen/eigen into default	2018-06-20 12:37:11 -04:00
Mark D Ryan	90a53ca6fd	Fix the Packet16h version of ptranspose The AVX512 version of ptranpose for PacketBlock<Packet16h,16> was reordering the PacketBlock argument incorrectly. This lead to errors in the multiplication of matrices composed of 16 bit floats on AVX512 machines, if at least of the matrices was using RowMajor order. This error is responsible for one tensorflow unit test failure on AVX512 machines: //tensorflow/python/kernel_tests:batch_matmul_op_test	2018-06-16 15:13:06 -07:00
Gael Guennebaud	1f54164eca	Fix a few issues with Packet16h	2018-07-07 00:15:07 +02:00
Gael Guennebaud	f2dc048df9	complete implementation of Packet16h (AVX512)	2018-07-06 17:43:11 +02:00
Gael Guennebaud	a937c50208	palign is not used anymore, so let's relax the unit test	2018-07-06 17:41:52 +02:00
Gael Guennebaud	56a33ae57d	test product kernel with half-floats.	2018-07-06 17:14:04 +02:00
Gael Guennebaud	f4d623ffa7	Complete Packet8h implementation and test it in packetmath unit test	2018-07-06 17:13:36 +02:00
Gael Guennebaud	a8ab6060df	Add unitests for inverse and selfadjoint-eigenvalues on CUDA	2018-07-06 09:58:45 +02:00
Gael Guennebaud	b8271bb368	fix md5sum of lapack_addons	2018-06-15 14:21:29 +02:00
Deven Desai	b6cc0961b1	updates based on PR feedback There are two major changes (and a few minor ones which are not listed here...see PR discussion for details) 1. Eigen::half implementations for HIP and CUDA have been merged. This means that - `CUDA/Half.h` and `HIP/hcc/Half.h` got merged to a new file `GPU/Half.h` - `CUDA/PacketMathHalf.h` and `HIP/hcc/PacketMathHalf.h` got merged to a new file `GPU/PacketMathHalf.h` - `CUDA/TypeCasting.h` and `HIP/hcc/TypeCasting.h` got merged to a new file `GPU/TypeCasting.h` After this change the `HIP/hcc` directory only contains one file `math_constants.h`. That will go away too once that file becomes a part of the HIP install. 2. new macros EIGEN_GPUCC, EIGEN_GPU_COMPILE_PHASE and EIGEN_HAS_GPU_FP16 have been added and the code has been updated to use them where appropriate. - `EIGEN_GPUCC` is the same as `(EIGEN_CUDACC \|\| EIGEN_HIPCC)` - `EIGEN_GPU_DEVICE_COMPILE` is the same as `(EIGEN_CUDA_ARCH \|\| EIGEN_HIP_DEVICE_COMPILE)` - `EIGEN_HAS_GPU_FP16` is the same as `(EIGEN_HAS_CUDA_FP16 or EIGEN_HAS_HIP_FP16)`	2018-06-14 10:21:54 -04:00

1 2 3 4 5 ...

9816 Commits