eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Benoit Steiner	de32f8d656	Fixed the printing of rank-0 tensors	2016-06-20 10:46:45 -07:00
Benoit Steiner	b055590e91	Made log1p_impl usable inside a GPU kernel	2016-06-16 11:37:40 -07:00
Benoit Steiner	7d495d890a	Merged in ibab/eigen (pull request PR-197) Implement exclusive scan option for Tensor library	2016-06-14 17:54:59 -07:00
Benoit Steiner	aedc5be1d6	Avoid generating pseudo random numbers that are multiple of 5: this helps spread the load over multiple cpus without havind to rely on work stealing.	2016-06-14 17:51:47 -07:00
Igor Babuschkin	c4d10e921f	Implement exclusive scan option	2016-06-14 19:44:07 +01:00
Gael Guennebaud	4c61f00838	Add missing explicit scalar conversion	2016-06-12 22:42:13 +02:00
Gael Guennebaud	a3a4714aba	Add debug output.	2016-06-11 14:41:53 +02:00
Gael Guennebaud	83904a21c1	Make sure T(i+1,i)==0 when diagonalizing T(i:i+1,i:i+1)	2016-06-11 14:41:36 +02:00
Benoit Steiner	65d33e5898	Merged in ibab/eigen (pull request PR-195) Add small fixes to TensorScanOp	2016-06-10 19:31:17 -07:00
Benoit Steiner	a05607875a	Don't refer to the half2 type unless it's been defined	2016-06-10 11:53:56 -07:00
Gael Guennebaud	0028049380	bug #1240 : Remove any assumption on NEON vector types.	2016-06-09 23:08:11 +02:00
Igor Babuschkin	86aedc9282	Add small fixes to TensorScanOp	2016-06-07 20:06:38 +01:00
Christoph Hertzberg	db0118342c	Fixed compilation of BVH_Example (required for make doc)	2016-06-07 19:17:18 +02:00
Benoit Steiner	84b2060a9e	Fixed compilation error with gcc 4.4	2016-06-06 17:16:19 -07:00
Benoit Steiner	7ef9f47b58	Misc small improvements to the reduction code.	2016-06-06 14:09:46 -07:00
Benoit Steiner	ea75dba201	Added missing EIGEN_DEVICE_FUNC qualifiers to the unary array ops	2016-06-06 13:32:28 -07:00
Benoit Steiner	33f0340188	Implement result_of for the new ternary functors	2016-06-06 12:06:42 -07:00
Gael Guennebaud	df24f4a01d	bug #1201 : improve code generation of affine*vec with MSVC	2016-06-06 16:46:46 +02:00
Benoit Steiner	9137f560f0	Moved assertions to the constructor to make the code more portable	2016-06-06 07:26:48 -07:00
Benoit Steiner	1f1e0b9e30	Silenced compilation warning	2016-06-05 12:59:11 -07:00
Benoit Steiner	5b95b4daf9	Moved static assertions into the class constructor to make the code more portable	2016-06-05 12:57:48 -07:00
Christoph Hertzberg	d7e3e4bb04	Removed executable bits from header files.	2016-06-05 10:15:41 +02:00
Eugene Brevdo	c53687dd14	Add randomized properties tests for betainc special function.	2016-06-05 11:10:30 -07:00
Sean Templeton	bd21243821	Fix compile errors initializing packets on ARM DS-5 5.20 The ARM DS-5 5.20 compiler fails compiling with the following errors: "src/Core/arch/NEON/PacketMath.h", line 113: Error: #146: too many initializer values Packet4f countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/PacketMath.h", line 118: Error: #146: too many initializer values Packet4i countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/Complex.h", line 30: Error: #146: too many initializer values static uint32x4_t p4ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET4(0x00000000, 0x80000000, 0x00000000, 0x80000000); ^ "src/Core/arch/NEON/Complex.h", line 31: Error: #146: too many initializer values static uint32x2_t p2ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET2(0x00000000, 0x80000000); ^ The vectors are implemented as two doubles, hence the too many initializer values error. Changed the code to use intrinsic load functions which all compilers implementing NEON should have.	2016-06-03 10:51:35 -05:00
Gael Guennebaud	1fc2746417	Make Arrays's ctor/assignment noexcept	2016-06-09 22:52:37 +02:00
Benoit Steiner	37638dafd7	Simplified the code that dispatches vectorized reductions on GPU	2016-06-09 10:29:52 -07:00
Benoit Steiner	66796e843d	Fixed definition of some of the reducer_traits	2016-06-09 08:50:01 -07:00
Benoit Steiner	4434b16694	Pulled latest updates from trunk	2016-06-09 08:25:47 -07:00
Benoit Steiner	14a112ee15	Use signed integers more consistently to encode the number of threads to use to evaluate a tensor expression.	2016-06-09 08:25:22 -07:00
Benoit Steiner	8f92c26319	Improved code formatting	2016-06-09 08:23:42 -07:00
Benoit Steiner	aa33446dac	Improved support for vectorization of 16-bit floats	2016-06-09 08:22:27 -07:00
Gael Guennebaud	e2b3836326	Include recent changesets that played with product's kernel	2016-06-09 17:13:33 +02:00
Gael Guennebaud	2bd59b0e0d	Take advantage that T is already diagonal in the extraction of generalized complex eigenvalues.	2016-06-09 17:12:03 +02:00
Gael Guennebaud	c1f9ca9254	Update RealQZ to reduce 2x2 diagonal block of T corresponding to non reduced diagonal block of S to positive diagonal form. This step involve a real 2x2 SVD problem. The respective routine is thus in src/misc/ to be shared by both EVD and AVD modules.	2016-06-09 17:11:03 +02:00
Gael Guennebaud	15890c304e	Add unit test for non symmetric generalized eigenvalues	2016-06-09 16:17:27 +02:00
Gael Guennebaud	a20d2ec1c0	Fix shadow variable, and indexing.	2016-06-09 16:16:22 +02:00
Abhijit Kundu	0beabb4776	Fixed type conversion from int	2016-06-08 16:12:04 -04:00
Gael Guennebaud	df095cab10	Fixes for PARDISO: warnings, and defaults to metis+ in-core mode.	2016-06-08 18:31:19 +02:00
Gael Guennebaud	9fc8379328	Fix extraction of complex eigenvalue pairs in real generalized eigenvalue problems.	2016-06-08 16:39:11 +02:00
Christoph Hertzberg	9dd9d58273	Copied a regression test from 3.2 branch.	2016-06-08 15:36:42 +02:00
Benoit Steiner	8fd57a97f2	Enable the vectorization of adds and mults of fp16	2016-06-07 18:22:18 -07:00
Benoit Steiner	d6d39c7ddb	Added missing EIGEN_DEVICE_FUNC	2016-06-07 14:35:08 -07:00
Eugene Brevdo	39baff850c	Add TernaryFunctors and the betainc SpecialFunction. TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.	2016-06-02 17:04:19 -07:00
Benoit Steiner	02db4e1a82	Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code	2016-06-04 08:21:17 -07:00
Benoit Steiner	c21eaedce6	Use array_prod to compute the number of elements contained in the input tensor expression	2016-06-04 07:47:04 -07:00
Benoit Steiner	36a4500822	Merged in ibab/eigen (pull request PR-192) Add generic scan method	2016-06-03 17:28:33 -07:00
Benoit Steiner	c2a102345f	Improved the performance of full reductions. AFTER: BM_fullReduction/10 4541 4543 154017 21.0M items/s BM_fullReduction/64 5191 5193 100000 752.5M items/s BM_fullReduction/512 9588 9588 71361 25.5G items/s BM_fullReduction/4k 244314 244281 2863 64.0G items/s BM_fullReduction/5k 359382 359363 1946 64.8G items/s BEFORE: BM_fullReduction/10 9085 9087 74395 10.5M items/s BM_fullReduction/64 9478 9478 72014 412.1M items/s BM_fullReduction/512 14643 14646 46902 16.7G items/s BM_fullReduction/4k 260338 260384 2678 60.0G items/s BM_fullReduction/5k 385076 385178 1818 60.5G items/s	2016-06-03 17:27:08 -07:00
Igor Babuschkin	dc03b8f3a1	Add generic scan method	2016-06-03 17:37:04 +01:00
Gael Guennebaud	8d97ba6b22	bug #725 : make move ctor/assignment noexcept.	2016-06-03 14:28:25 +02:00
Gael Guennebaud	e8b922ca63	Fix MatrixFunctions module.	2016-06-03 09:21:35 +02:00

1 2 3 4 5 ...

8195 Commits