eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Eugene Zhulenev	73ecb2c57d	Cleanup includes in Tensor module after switch to C++11 and above	2019-10-29 15:49:54 -07:00
Eugene Zhulenev	e7ed4bd388	Remove internal::smart_copy and replace with std::copy	2019-10-29 11:25:24 -07:00
Eugene Zhulenev	fbc0a9a3ec	Fix CXX11Meta compilation with MSVC	2019-10-28 18:30:10 -07:00
Eugene Zhulenev	bd864ab42b	Prevent potential ODR in TensorExecutor	2019-10-28 15:45:09 -07:00
Mehdi Goli	6332aff0b2	This PR fixes: * The specialization of array class in the different namespace for GCC<=6.4 * The implicit call to `std::array` constructor using the initializer list for GCC <=6.1	2019-10-23 15:56:56 +01:00
Rasmus Larsen	8e4e29ae99	Merged in deven-amd/eigen-hip-fix-191018 (pull request PR-738) Fix for the HIP build+test errors.	2019-10-22 22:18:38 +00:00
Rasmus Munk Larsen	97c0c5d485	Add block evaluation V2 to TensorAsyncExecutor. Add async evaluation to a number of ops.	2019-10-22 12:42:44 -07:00
Deven Desai	102cf2a72d	Fix for the HIP build+test errors. The errors were introduced by this commit : After the above mentioned commit, some of the tests started failing with the following error ``` Built target cxx11_tensor_reduction Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:117: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:155:5: error: the field type is not amp-compatible DestinationBufferKind m_kind; ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:211:3: error: the field type is not amp-compatible DestinationBuffer m_destination; ^ ``` For some reason HIPCC does not like device code to contain enum types which do not have the base-type explicitly declared. The fix is trivial, explicitly state "int" as the basetype	2019-10-22 19:21:27 +00:00
Rasmus Munk Larsen	668ab3fc47	Drop support for c++03 in Eigen tensor. Get rid of some code used to emulate c++11 functionality with older compilers.	2019-10-18 16:42:00 -07:00
Eugene Zhulenev	df0e8b8137	Propagate block evaluation preference through rvalue tensor expressions	2019-10-17 11:17:33 -07:00
Eugene Zhulenev	0d2a14ce11	Cleanup Tensor block destination and materialized block storage allocation	2019-10-16 17:14:37 -07:00
Eugene Zhulenev	02431cbe71	TensorBroadcasting support for random/uniform blocks	2019-10-16 13:26:28 -07:00
Eugene Zhulenev	d380c23b2c	Block evaluation for TensorGenerator/TensorReverse/TensorShuffling	2019-10-14 14:31:59 -07:00
Gael Guennebaud	39fb9eeccf	bug #1747 : fix compilation with MSVC	2019-10-14 22:50:23 +02:00
Eugene Zhulenev	a411e9f344	Block evaluation for TensorGenerator + TensorReverse + fixed bug in tensor reverse op	2019-10-10 10:56:58 -07:00
Rasmus Larsen	b03eb63d7c	Merged in ezhulenev/eigen-01 (pull request PR-726) Block evaluation for TensorChipping + fixed bugs in TensorPadding and TensorSlicing	2019-10-10 16:58:11 +00:00
Gael Guennebaud	e7d8ba747c	bug #1752 : make is_convertible equivalent to the std c++11 equivalent and fallback to std::is_convertible when c++11 is enabled.	2019-10-10 17:41:47 +02:00
Gael Guennebaud	fb557aec5c	bug #1752 : disable some is_convertible tests for recent compilers.	2019-10-10 11:40:21 +02:00
Eugene Zhulenev	33e1746139	Block evaluation for TensorChipping + fixed bugs in TensorPadding and TensorSlicing	2019-10-09 12:45:31 -07:00
Gael Guennebaud	f0a4642bab	Implement c++03 compatible fix for changeset `7a43af1a33`	2019-10-09 16:00:57 +02:00
Gael Guennebaud	196de2efe3	Explicitly bypass resize and memmoves when there is already the exact right number of elements available.	2019-10-08 21:44:33 +02:00
Gael Guennebaud	36da231a41	Disable an expected warning in unit test	2019-10-08 16:28:14 +02:00
Gael Guennebaud	d1def335dc	fix one more possible conflicts with real/imag	2019-10-08 16:19:10 +02:00
Gael Guennebaud	87427d2eaa	PR 719: fix real/imag namespace conflict	2019-10-08 09:15:17 +02:00
Gael Guennebaud	7a43af1a33	Fix compilation of FFTW unit test	2019-10-08 08:58:35 +02:00
Eugene Zhulenev	f74ab8cb8d	Add block evaluation to TensorEvalTo and fix few small bugs	2019-10-07 15:34:26 -07:00
Brian Zhao	3afb640b56	Fixing incorrect size in Tensor documentation.	2019-10-04 21:30:35 -07:00
Rasmus Munk Larsen	20c4a9118f	Use "pdiv" rather than operator/ to support packet types.	2019-10-04 16:54:03 -07:00
Rasmus Larsen	d1dd51cb5f	Merged in ezhulenev/eigen-01 (pull request PR-723) Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelect Approved-by: Rasmus Larsen <rmlarsen@google.com>	2019-10-04 17:19:13 +00:00
Eugene Zhulenev	98bdd7252e	Fix compilation warnings and errors with clang in TensorBlockV2 code and tests	2019-10-04 10:15:33 -07:00
Rasmus Munk Larsen	fab4e3a753	Address comments on Chebyshev evaluation code: 1. Use pmadd when possible. 2. Add casts to avoid c++03 warnings.	2019-10-02 12:48:17 -07:00
Eugene Zhulenev	60ae24ee1a	Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelect	2019-10-02 12:44:06 -07:00
Eugene Zhulenev	6e40454a6e	Add beta to TensorContractionKernel and make memset optional	2019-10-02 11:06:02 -07:00
Rasmus Munk Larsen	bd0fac456f	Prevent infinite loop in the nvcc compiler while unrolling the recurrent templates for Chebyshev polynomial evaluation.	2019-10-01 13:15:30 -07:00
Gael Guennebaud	9549ba8313	Fix perf issue in SimplicialLDLT::solve for complexes (again, m_diag is real)	2019-10-01 12:54:25 +02:00
Gael Guennebaud	c8b2c603b0	Fix speed issue with SimplicialLDLT for complexes: the diagonal is real!	2019-09-30 16:14:34 +02:00
Rasmus Munk Larsen	13ef08e5ac	Move implementation of vectorized error function erf() to SpecialFunctionsImpl.h.	2019-09-27 13:56:04 -07:00
Eugene Zhulenev	7c8bc0d928	Fix cxx11_tensor_block_io test	2019-09-25 11:48:11 -07:00
Eugene Zhulenev	0c845e28c9	Fix erf in c++03	2019-09-25 11:31:45 -07:00
Eugene Zhulenev	71d5bedf72	Fix compilation warnings and errors with clang in TensorBlockV2	2019-09-25 11:25:22 -07:00
Deven Desai	5e186b1987	Fix for the HIP build+test errors. The errors were introduced by this commit : `d38e6fbc27` After the above mentioned commit, some of the tests started failing with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:70: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsHalf.h:28:22: error: call to 'erf' is ambiguous return Eigen::half(Eigen::numext::erf(static_cast<float>(a))); ^~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1600:7: note: candidate function [with T = float] float erf(const float &x) { return ::erff(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = float] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:23: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:33: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ 3 errors generated. ``` This PR fixes the compile error by removing the "old" implementation for "erf" (assuming that the "new" implementation is what we want going forward. from a GPU point-of-view both implementations are the same). This PR also fixes what seems like a cut-n-paste error in the aforementioned commit	2019-09-25 15:39:13 +00:00
Eugene Zhulenev	f35b9ab510	Fix a bug in a packed block type in TensorContractionThreadPool	2019-09-24 16:54:36 -07:00
Rasmus Larsen	d38e6fbc27	Merged in rmlarsen/eigen (pull request PR-704) Add generic PacketMath implementation of the Error Function (erf).	2019-09-24 23:40:29 +00:00
Rasmus Munk Larsen	591a554c68	Add TODO to cleanup FMA cost modelling.	2019-09-24 16:39:25 -07:00
Eugene Zhulenev	c64396b4c6	Choose TensorBlock StridedLinearCopy type statically	2019-09-24 16:04:29 -07:00
Eugene Zhulenev	c97b208468	Add new TensorBlock api implementation + tests	2019-09-24 15:17:35 -07:00
Eugene Zhulenev	ef9dfee7bd	Tensor block evaluation V2 support for unary/binary/broadcsting	2019-09-24 12:52:45 -07:00
Christoph Hertzberg	efd9867ff0	bug #1746 : Removed implementation of standard copy-constructor and standard copy-assign-operator from PermutationMatrix and Transpositions to allow malloc-less std::move. Added unit-test to rvalue_types	2019-09-24 11:09:58 +02:00
Christoph Hertzberg	e4c1b3c1d2	Fix implicit conversion warnings and use pnegate to negate packets	2019-09-23 16:07:43 +02:00
Christoph Hertzberg	ba0736fa8e	Fix (or mask away) conversion warnings introduced in `553caeb6a3` .	2019-09-23 15:58:05 +02:00

... 4 5 6 7 8 ...

10994 Commits