eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-02-17 18:09:55 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	8fffc81606	Add NumTraits::digits10() function based on numeric_limits::digits10 and make use of it for printing matrices.	2016-07-25 11:13:01 +02:00
Gael Guennebaud	1b0353c659	Fix misuse of dummy_precesion in eigenvalues solvers	2016-07-23 17:52:31 +02:00
Gael Guennebaud	72744d93ef	Allows the compiler to inline outer products (the change from default to dont-inline in changeset `737bed19c1` was not motivated)	2016-07-22 17:02:28 +02:00
Gael Guennebaud	395c835f4b	Fix CUDA compilation	2016-07-22 15:30:24 +02:00
Gael Guennebaud	47afc9a365	More cleaning in half: - put its definition and functions in its own half_impl namespace such that the free function does not polute the Eigen namespace while still making them visible for half through ADL. - expose Eigen::half throguh a using statement - move operator<< from std to half_float namespace	2016-07-22 14:33:28 +02:00
Gael Guennebaud	0f350a8b7e	Fix CUDA compilation	2016-07-21 18:47:07 +02:00
Gael Guennebaud	bf91a44f4a	Use ADL and log10 for printing matrices.	2016-07-21 15:48:24 +02:00
Gael Guennebaud	87fbda812f	Add missing log10 and random generator for half.	2016-07-21 15:46:45 +02:00
Gael Guennebaud	01d12d3e82	Some cleanup in Halh: standard functions should be defined in the namespace of the class half to make ADL work, and thus the global is* functions can be removed.	2016-07-21 15:10:48 +02:00
Gael Guennebaud	7722913475	Fix ambiguous specialization with custom scalar type	2016-07-20 15:13:44 +02:00
Gael Guennebaud	fd057f86b3	Complete the coeff-wise math function table.	2016-07-20 12:14:10 +02:00
Gael Guennebaud	9e8476ef22	Add missing Eigen::rsqrt global function	2016-07-20 11:59:49 +02:00
Gael Guennebaud	4b4c296d6e	Simplify ScalarBinaryOpTraits by removing the Defined enum, and extend its documentation.	2016-07-20 09:56:39 +02:00
Gael Guennebaud	e3bf874c83	Workaround MSVC 2010 compilation issue.	2016-07-18 15:17:25 +02:00
Gael Guennebaud	0f89c6d6b5	Add a summary of possible values for EIGEN_COMP_MSVC	2016-07-18 15:16:13 +02:00
Gael Guennebaud	18884f17d7	Remove static constant declaration: this enforces compiler to generate costly code for thread safety.	2016-07-18 15:05:17 +02:00
Gael Guennebaud	79574e384e	Make scalar_product_op the default (instead of void)	2016-07-18 12:03:05 +02:00
Gael Guennebaud	6a3c451c1c	Permits call to explicit ctor.	2016-07-18 12:02:20 +02:00
Gael Guennebaud	0c3fe4aca5	merge	2016-07-18 10:44:15 +02:00
Gael Guennebaud	db9b154193	Add missing non-const reverse method in VectorwiseOp.	2016-07-16 15:19:28 +02:00
Gael Guennebaud	461cd819c2	Workaround VS2015 bug	2016-07-13 18:46:01 +02:00
Gael Guennebaud	5ea0864c81	Fix regression in a previous commit: some diagonal entry might not be treated by the 2x2 real preconditioner.	2016-07-13 18:37:54 +02:00
Gael Guennebaud	b4343aa67e	Avoid division by very small entries when extracting singularvalues, and explicitly handle the 1x1 complex case.	2016-07-12 17:22:03 +02:00
Gael Guennebaud	e2aa58b631	Consider denormals as zero in makeJacobi and 2x2 SVD. This also fix serious issues with x387 for which values can be much smaller than the smallest denormal!	2016-07-12 17:21:03 +02:00
klimpel	8b3fc31b55	compile fix (SFINAE variant apparently didn't work for all compilers) for the following compiler/platform: gcc (GCC) 4.1.2 20080704 (Red Hat 4.1.2-46) Copyright (C) 2006 Free Software Foundation, Inc.	2016-07-11 17:42:22 +02:00
Gael Guennebaud	a96a7ce3f7	Move CUDA's special functions to SpecialFunctions module.	2016-07-11 18:39:11 +02:00
Gael Guennebaud	fd60966310	merge	2016-07-11 18:11:47 +02:00
Gael Guennebaud	3e348fdcf9	Workaround MSVC bug	2016-07-11 15:24:52 +02:00
Konstantinos Margaritis	ef05463fcf	Merged kmargar/eigen/tip into default, Altivec/VSX port should be working ok now.	2016-07-10 16:11:46 +03:00
Konstantinos Margaritis	9f7caa7e7d	minor fixes for big endian altivec/vsx	2016-07-10 07:05:10 -03:00
Christoph Hertzberg	3c795c6923	bug #1119 : Adjust call to ?gssvx for SuperLU 5 Also improved corresponding cmake module to detect versions 5.x Based on patch by Christoph Grüninger.	2016-07-10 02:29:57 +02:00
Gael Guennebaud	2f7e2614e7	bug #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.	2016-07-08 11:13:55 +02:00
Gael Guennebaud	66917299a9	Add debug output	2016-07-06 22:27:15 +02:00
Gael Guennebaud	c3b23d7dbf	Fix support of Intel's VML	2016-07-06 14:07:32 +02:00
Gael Guennebaud	8ec4d6480d	Fix compilation with recent updates of icc 2016	2016-07-06 14:07:14 +02:00
Gael Guennebaud	5b3a6f51d3	Improve numerical robustness of RealSchur: add scaling and compare sub-diag entries to largest diagonal entry instead of the 2 neighbors.	2016-07-06 13:45:30 +02:00
Gael Guennebaud	367ef66af3	Re-enable some specializations for Assignment<.,Product<>>	2016-07-05 22:58:14 +02:00
Gael Guennebaud	155d8d8603	Fix compilation with msvc	2016-07-05 14:43:42 +02:00
Gael Guennebaud	b39fd8217f	Fix nesting of SolveWithGuess, and add unit test.	2016-07-04 17:47:47 +02:00
Gael Guennebaud	ec02af1047	Fix template resolution.	2016-07-04 17:37:33 +02:00
Gael Guennebaud	fbcfc2f862	Add unit test for solveWithGuess, and fix template resolution.	2016-07-04 17:19:38 +02:00
Gael Guennebaud	7f7839c12f	Add documentation and exemples for inplace decomposition.	2016-07-04 17:18:26 +02:00
Gael Guennebaud	32a41ee659	bug #707 : add inplace decomposition through Ref<> for Cholesky, LU and QR decompositions.	2016-07-04 15:13:35 +02:00
Gael Guennebaud	91b3039013	Change the semantic of the last template parameter of Assignment from "Scalar" to "SFINAE" only. The previous "Scalar" semantic was obsolete since we allow for different scalar types in the source and destination expressions. On can still specialize on scalar types through SFINAE and/or assignment functor.	2016-07-04 11:02:00 +02:00
Gael Guennebaud	0fa9e4a15c	Fix performance regression in dgemm introduced by changeset `5d51a7f12c`	2016-07-02 17:35:08 +02:00
Gael Guennebaud	672076db5d	Fix performance regression introduced in changeset `e56aabf205` . Register blocking sizes are better handled by the cache size heuristics. The current code introduced very small blocks, for instance for 9x9 matrix, thus killing performance.	2016-07-02 15:40:56 +02:00
Justin Carpentier	6126886a67	Use complete nested namespace Eigen::internal	2016-06-28 20:09:25 +02:00
Benoit Jacob	328c5d876a	Undo changes in AltiVec --- I don't have any way to test there.	2016-06-28 11:15:25 -04:00
Benoit Jacob	38fb606052	Avoid global variables with static constructors in NEON/Complex.h	2016-06-28 11:12:49 -04:00
Gael Guennebaud	d937a420a2	Fix compilation with MSVC by using our portable numext::log1p implementation.	2016-08-22 15:44:21 +02:00
Gael Guennebaud	2d5731e40a	bug #1270 : bypass custom asm for pmadd and recent clang version	2016-08-22 15:38:03 +02:00
Gael Guennebaud	49b005181a	Define EIGEN_COMP_CLANG to clang version as major*100+minor (e.g., 307 corresponds to clang 3.7)	2016-08-22 15:37:05 +02:00
Gael Guennebaud	130f891bb0	bug #1278 : ease parsing	2016-08-22 15:00:29 +02:00
Gael Guennebaud	d476cadbb8	bug #1247 : fix regression in compilation of pow(integer,integer), and add respective unit tests.	2016-06-25 10:12:06 +02:00
Gael Guennebaud	c50c73cae2	Fix missing specialization.	2016-06-24 23:10:39 +02:00
Gael Guennebaud	cd577a275c	Relax promote_scalar_arg logic to enable promotion to Expr::Scalar if conversion to Expr::Literal fails. This is useful to cancel expression template at the scalar level, e.g. with AutoDiff<AutoDiff<>>. This patch also defers calls to NumTraits in cases for which types are not directly compatible.	2016-06-24 11:28:54 +02:00
Gael Guennebaud	deb45ad4bc	bug #1245 : fix compilation with msvc	2016-06-24 09:52:25 +02:00
Gael Guennebaud	55fc04e8b5	Fix operator priority	2016-06-23 15:36:42 +02:00
Gael Guennebaud	bf2d5edecc	Fix warning.	2016-06-23 15:35:17 +02:00
Gael Guennebaud	7c6561485a	merge PR 194	2016-06-23 15:29:57 +02:00
Konstantinos Margaritis	be107e387b	fix compilation with clang 3.9, fix performance with pset1, use vector operators instead of intrinsics in some cases	2016-06-23 10:19:05 -03:00
Gael Guennebaud	76faf4a965	Introduce a NumTraits<T>::Literal type to be used for literals, and improve mixing type support in operations between arrays and scalars: - 2 * ArrayXcf is now optimized in the sense that the integer 2 is properly promoted to a float instead of a complex<float> (fix a regression) - 2.1 * ArrayXi is now forbiden (previously, 2.1 was converted to 2) - This mechanism should be applicable to any custom scalar type, assuming NumTraits<T>::Literal is properly defined (it defaults to T)	2016-06-23 14:27:20 +02:00
Gael Guennebaud	a3f7edf7e7	Biug 1242: fix comma init with empty matrices.	2016-06-23 10:25:04 +02:00
Konstantinos Margaritis	8c34b5a0e3	mostly cleanups and modernizing code	2016-06-19 16:13:17 -03:00
Konstantinos Margaritis	b410d46482	mostly cleanups and modernizing code	2016-06-19 16:12:52 -03:00
Konstantinos Margaritis	b80379bda0	fixed pexp<Packet2d>, was failing tests	2016-06-19 16:11:58 -03:00
Benoit Steiner	b055590e91	Made log1p_impl usable inside a GPU kernel	2016-06-16 11:37:40 -07:00
Gael Guennebaud	67c12531e5	Fix warnings with gcc	2016-06-15 18:11:33 +02:00
Gael Guennebaud	eb91345d64	Move scalar/expr to ArrayBase and fix documentation	2016-06-15 15:22:03 +02:00
Gael Guennebaud	4794834397	Propagate functor to ScalarBinaryOpTraits	2016-06-15 09:58:49 +02:00
Gael Guennebaud	c55035b9c0	Include the cost of stores in unrolling of triangular expressions.	2016-06-15 09:57:33 +02:00
Gael Guennebaud	4e7c3af874	Cleanup useless helper: internal::product_result_scalar	2016-06-15 00:04:10 +02:00
Gael Guennebaud	101ea26f5e	Include the cost of stores in unrolling (also fix infinite unrolling with expression costing 0 like Constant)	2016-06-15 00:01:16 +02:00
Gael Guennebaud	76236cdea4	merge	2016-06-14 15:33:47 +02:00
Gael Guennebaud	1004c4df99	Cleanup unused functors.	2016-06-14 15:27:28 +02:00
Gael Guennebaud	70dad84b73	Generalize expr/expr and scalar/expr wrt scalar types.	2016-06-14 15:26:37 +02:00
Gael Guennebaud	62134082aa	Update AutoDiffScalar wrt to scalar-multiple.	2016-06-14 15:06:35 +02:00
Gael Guennebaud	396d9cfb6e	Generalize expr.pow(scalar), pow(expr,scalar) and pow(scalar,expr). Internal: scalar_pow_op (unary) is removed, and scalar_binary_pow_op is renamed scalar_pow_op.	2016-06-14 14:10:07 +02:00
Gael Guennebaud	a8c08e8b8e	Implement expr+scalar, scalar+expr, expr-scalar, and scalar-expr as binary expressions, and generalize supported scalar types. The following functors are now deprecated: scalar_add_op, scalar_sub_op, and scalar_rsub_op.	2016-06-14 12:06:10 +02:00
Gael Guennebaud	756ac4a93d	Fix doc.	2016-06-14 12:03:39 +02:00
Gael Guennebaud	bcc0f38f98	Add unittesting plugins to scalar_product_op and scalar_quotient_op to help chaking that types are properly propagated.	2016-06-14 11:31:27 +02:00
Gael Guennebaud	f57fd78e30	Generalize coeff-wise sparse products to support different scalar types	2016-06-14 11:29:54 +02:00
Gael Guennebaud	f5b1c73945	Set cost of constant expression to 0 (the cost should be amortized through the expression)	2016-06-14 11:29:06 +02:00
Gael Guennebaud	deb8306e60	Move MatrixBase::operaotr*(UniformScaling) as a free function in Scaling.h, and fix return type.	2016-06-14 11:28:03 +02:00
Gael Guennebaud	64fcfd314f	Implement scalar multiples and division by a scalar as a binary-expression with a constant expression. This slightly complexifies the type of the expressions and implies that we now have to distinguish between scalarexpr and exprscalar to catch scalar-multiple expression (e.g., see BlasUtil.h), but this brings several advantages: - it makes it clear on each side the scalar is applied, - it clearly reflects that we are dealing with a binary-expression, - the complexity of the type is hidden through macros defined at the end of Macros.h, - distinguishing between "scalar op expr" and "expr op scalar" is important to support non commutative fields (like quaternions) - "scalar op expr" is now fully equivalent to "ConstantExpr(scalar) op expr" - scalar_multiple_op, scalar_quotient1_op and scalar_quotient2_op are not used anymore in officially supported modules (still used in Tensor)	2016-06-14 11:26:57 +02:00
Gael Guennebaud	3c12e24164	Add bind1st_op and bind2nd_op helpers to turn binary functors into unary ones, and implement scalar_multiple2 and scalar_quotient2 on top of them.	2016-06-13 16:18:59 +02:00
Gael Guennebaud	7a9ef7bbb4	Add default template parameters for the second scalar type of binary functors. This enhences backward compatibility.	2016-06-13 16:17:23 +02:00
Gael Guennebaud	4c61f00838	Add missing explicit scalar conversion	2016-06-12 22:42:13 +02:00
Gael Guennebaud	83904a21c1	Make sure T(i+1,i)==0 when diagonalizing T(i:i+1,i:i+1)	2016-06-11 14:41:36 +02:00
Gael Guennebaud	fabae6c9a1	Cleanup	2016-06-10 15:58:33 +02:00
Gael Guennebaud	5fdd703629	Enable mixing types in numext::pow	2016-06-10 15:58:04 +02:00
Gael Guennebaud	2e238bafb6	Big 279: enable mixing types for comparisons, min, and max.	2016-06-10 15:05:43 +02:00
Gael Guennebaud	0028049380	bug #1240 : Remove any assumption on NEON vector types.	2016-06-09 23:08:11 +02:00
Gael Guennebaud	2c462f4201	Clean handling for void type in EIGEN_CHECK_BINARY_COMPATIBILIY	2016-06-06 23:11:38 +02:00
Gael Guennebaud	3d71d3918e	Disable shortcuts for res ?= prod when the scalar types do not match exactly.	2016-06-06 23:10:55 +02:00
Gael Guennebaud	66e99ab6a1	Relax mixing-type constraints for binary coefficient-wise operators: - Replace internal::scalar_product_traits<A,B> by Eigen::ScalarBinaryOpTraits<A,B,OP> - Remove the "functor_is_product_like" helper (was pretty ugly) - Currently, OP is not used, but it is available to the user for fine grained tuning - Currently, only the following operators have been generalized: ,/,+,-,=,=,/=,+=,-= - TODO: generalize all other binray operators (comparisons,pow,etc.) - TODO: handle "scalar op array" operators (currently only * is handled) - TODO: move the handling of the "void" scalar type to ScalarBinaryOpTraits	2016-06-06 15:11:41 +02:00
Benoit Steiner	1f1e0b9e30	Silenced compilation warning	2016-06-05 12:59:11 -07:00
Benoit Steiner	5b95b4daf9	Moved static assertions into the class constructor to make the code more portable	2016-06-05 12:57:48 -07:00
Sean Templeton	bd21243821	Fix compile errors initializing packets on ARM DS-5 5.20 The ARM DS-5 5.20 compiler fails compiling with the following errors: "src/Core/arch/NEON/PacketMath.h", line 113: Error: #146: too many initializer values Packet4f countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/PacketMath.h", line 118: Error: #146: too many initializer values Packet4i countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/Complex.h", line 30: Error: #146: too many initializer values static uint32x4_t p4ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET4(0x00000000, 0x80000000, 0x00000000, 0x80000000); ^ "src/Core/arch/NEON/Complex.h", line 31: Error: #146: too many initializer values static uint32x2_t p2ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET2(0x00000000, 0x80000000); ^ The vectors are implemented as two doubles, hence the too many initializer values error. Changed the code to use intrinsic load functions which all compilers implementing NEON should have.	2016-06-03 10:51:35 -05:00
Gael Guennebaud	1fc2746417	Make Arrays's ctor/assignment noexcept	2016-06-09 22:52:37 +02:00
Gael Guennebaud	2bd59b0e0d	Take advantage that T is already diagonal in the extraction of generalized complex eigenvalues.	2016-06-09 17:12:03 +02:00
Gael Guennebaud	c1f9ca9254	Update RealQZ to reduce 2x2 diagonal block of T corresponding to non reduced diagonal block of S to positive diagonal form. This step involve a real 2x2 SVD problem. The respective routine is thus in src/misc/ to be shared by both EVD and AVD modules.	2016-06-09 17:11:03 +02:00
Gael Guennebaud	a20d2ec1c0	Fix shadow variable, and indexing.	2016-06-09 16:16:22 +02:00
Abhijit Kundu	0beabb4776	Fixed type conversion from int	2016-06-08 16:12:04 -04:00
Gael Guennebaud	df095cab10	Fixes for PARDISO: warnings, and defaults to metis+ in-core mode.	2016-06-08 18:31:19 +02:00
Gael Guennebaud	9fc8379328	Fix extraction of complex eigenvalue pairs in real generalized eigenvalue problems.	2016-06-08 16:39:11 +02:00
Benoit Steiner	8fd57a97f2	Enable the vectorization of adds and mults of fp16	2016-06-07 18:22:18 -07:00
Benoit Steiner	ea75dba201	Added missing EIGEN_DEVICE_FUNC qualifiers to the unary array ops	2016-06-06 13:32:28 -07:00
Benoit Steiner	33f0340188	Implement result_of for the new ternary functors	2016-06-06 12:06:42 -07:00
Gael Guennebaud	df24f4a01d	bug #1201 : improve code generation of affine*vec with MSVC	2016-06-06 16:46:46 +02:00
Eugene Brevdo	39baff850c	Add TernaryFunctors and the betainc SpecialFunction. TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.	2016-06-02 17:04:19 -07:00
Gael Guennebaud	8d97ba6b22	bug #725 : make move ctor/assignment noexcept.	2016-06-03 14:28:25 +02:00
Gael Guennebaud	fe62c06d9b	Fix compilation.	2016-06-03 07:47:38 +02:00
Gael Guennebaud	969b8959a0	Fix compilation: Matrix does not indirectly live in the internal namespace anymore!	2016-06-03 07:44:58 +02:00
Gael Guennebaud	f2c2465acc	Fix function dependencies	2016-06-03 07:44:18 +02:00
Gael Guennebaud	53feb73b45	Remove dead code.	2016-06-02 22:19:55 +02:00
Gael Guennebaud	2c00ac0b53	Implement generic scalarexpr and exprscalar operator based on scalar_product_traits. This is especially useful for custom scalar types, e.g., to enable float*expr<multi_prec> without conversion.	2016-06-02 22:16:37 +02:00
Gael Guennebaud	8b6f53222b	bug #1193 : fix lpNorm<Infinity> for empty input.	2016-06-02 15:29:59 +02:00
Gael Guennebaud	360e311b66	Doc: add some cross references (also fix empty macro argument warning)	2016-06-01 23:34:09 +02:00
Gael Guennebaud	3c69afca4c	Add missing ArrayBase::log1p	2016-06-01 17:08:47 +02:00
Gael Guennebaud	89099b0cf7	Expose log1p to Array.	2016-06-01 17:00:08 +02:00
Gael Guennebaud	afd33539dd	Doc: makes the global unary math functions visible to doxygen (and docuement them)	2016-06-01 15:27:13 +02:00
Gael Guennebaud	77e652d8ad	Doc: improve documentation of Map<SparseMatrix>	2016-06-01 10:03:32 +02:00
Gael Guennebaud	da4970ead2	Doc: disable inlining of inherited members, workaround Doxygen's limited C++ parsing abilities, and improve doc of MapBase.	2016-06-01 09:38:49 +02:00
Benoit Steiner	099b354ca7	Pulled latest updates from trunk	2016-05-31 10:34:16 -07:00
Benoit Steiner	b6e306f189	Improved support for CUDA 8.0	2016-05-31 09:47:59 -07:00
Gael Guennebaud	1d3b253329	bug #1181 : help MSVC inlining.	2016-05-31 17:23:42 +02:00
Gael Guennebaud	d79eee05ef	Fix compilation with old icc	2016-05-31 17:13:51 +02:00
Gael Guennebaud	2c1b56f4c1	bug #1238 : fix SparseMatrix::sum() overload for un-compressed mode.	2016-05-31 10:56:53 +02:00
Benoit Steiner	c4bd3b1f21	Silenced some compilation warnings triggered by nvcc 8.0	2016-05-27 14:40:49 -07:00
Benoit Steiner	3a5d6a3c38	Disable the use of MMX instructions since the code is broken on many platforms	2016-05-27 09:13:26 -07:00
Gael Guennebaud	e0cb73b46b	Fix compilation with old ICC version (use C99 types instead of C++11 ones)	2016-05-27 10:28:09 +02:00
Benoit Steiner	094f4a56c8	Deleted extra namespace	2016-05-26 14:49:51 -07:00
Gael Guennebaud	7ff5fadcc0	Disable usage of MMX with msvc.	2016-05-26 17:58:46 +02:00
Gael Guennebaud	e8cef383b7	bug #1236 : fix possible integer overflow in density estimation.	2016-05-26 17:51:04 +02:00
Gael Guennebaud	30d97c03ce	Defer the allocation of the working space: - it is not always needed, - and this fixes a long-to-float conversion warning	2016-05-26 17:39:42 +02:00
Gael Guennebaud	e08f54e9eb	Fix copy ctor prototype.	2016-05-26 17:37:25 +02:00
Gael Guennebaud	c7f54b11ec	linspaced's divisor for integer is better stored as the underlying scalar type.	2016-05-26 17:36:54 +02:00
Gael Guennebaud	bebc5a2147	Fix/handle some int-to-long conversions.	2016-05-26 17:35:53 +02:00
Gael Guennebaud	00c29c2cae	Store permutation's determinant as char. This also fixes some long to float conversion warnings	2016-05-26 17:34:23 +02:00
Gael Guennebaud	2f56d91063	Fix a pointer to integer conversion warning	2016-05-26 17:31:45 +02:00
Gael Guennebaud	2a44a70142	Handle some Index to int conversions in BLAS/LAPACK support.	2016-05-26 17:29:04 +02:00
Gael Guennebaud	f253e19296	Disable some long to float conversion warnings	2016-05-26 17:27:14 +02:00
Gael Guennebaud	37197b602b	Remove debuging code.	2016-05-26 11:53:10 +02:00
Gael Guennebaud	27f0434233	Introduce internal's UIntPtr and IntPtr types for pointer to integer conversions. This fixes "conversion from pointer to same-sized integral type" warnings by ICC. Ideally, we would use the std::[u]intptr_t types all the time, but since they are C99/C++11 only, let's be safe.	2016-05-26 10:52:12 +02:00
Gael Guennebaud	40e4637d79	Turn off ICC's conversion warning in is_convertible implementation	2016-05-26 10:48:43 +02:00
Gael Guennebaud	cc1ab64f29	Add missing inclusion of mmintrin.h	2016-05-26 09:51:50 +02:00
Benoit Steiner	3585ff585e	Silenced a compilation warning	2016-05-25 22:09:19 -07:00
Benoit Steiner	efeb89dcdb	Specify the rounding mode in the correct location	2016-05-25 17:53:24 -07:00
Benoit Steiner	0322c66a3f	Explicitly specify the rounding mode when converting floats to fp16	2016-05-25 15:56:15 -07:00
Benoit Steiner	ed783872ab	Disable the use of MMX instructions on x86_64 since too many compilers only support them in 32bit mode	2016-05-25 08:27:26 -07:00
Benoit Steiner	bcfff64f9e	Use numext:: instead of std:: functions.	2016-05-25 08:08:21 -07:00
Gael Guennebaud	bbf9109e25	Fix compilation with ICC.	2016-05-25 10:00:55 +02:00
Gael Guennebaud	2a1bff67fd	Fix static/inline order.	2016-05-25 10:00:11 +02:00
Benoit Steiner	d041a528da	Cleaned up the fp16 code a little more	2016-05-24 22:43:26 -07:00
Benoit Steiner	cb26784d07	Pulled latest updates from trunk	2016-05-24 18:51:39 -07:00
Benoit Steiner	ff4a289572	Cleaned up the fp16 code	2016-05-24 18:50:09 -07:00
Gael Guennebaud	e68e165a23	bug #256 : enable vectorization with unaligned loads/stores. This concerns all architectures and all sizes. This new behavior can be disabled by defining EIGEN_UNALIGNED_VECTORIZE=0	2016-05-24 21:54:03 +02:00
Gael Guennebaud	78390e4189	Block<> should not disable vectorization based on inner-size, this is the responsibilty of the assignment logic.	2016-05-24 17:14:01 +02:00
Gael Guennebaud	64bb7576eb	Clean propagation of Dest/Src alignments.	2016-05-24 17:12:12 +02:00
Benoit Jacob	40a16282c7	Remove now-unused protate PacketMath func	2016-05-24 11:01:18 -04:00
Benoit Jacob	6136f4fdd4	Remove the rotating kernel. It was only useful on some ARM CPUs (Qualcomm Krait) that are not as ubiquitous today as they were when I introduced it.	2016-05-24 10:00:32 -04:00
Benoit Steiner	e617711306	Don't attempt to use MMX instructions with visualstudio since they're only partially supported.	2016-05-24 06:43:58 -07:00
Benoit Steiner	334e76537f	Worked around missing clang intrinsic	2016-05-24 00:29:28 -07:00
Benoit Steiner	b517ab349b	Use the generic ploadquad intrinsics since it does the job	2016-05-24 00:11:17 -07:00
Benoit Steiner	646872cb3b	Worked around missing clang intrinsics	2016-05-24 00:07:08 -07:00
Benoit Steiner	3dfc391a61	Added missing EIGEN_DEVICE_FUNC qualifier	2016-05-23 20:56:59 -07:00
Benoit Steiner	3d0741f027	Include mmintrin.h to make it possible to use mmx instructions when needed. For example, this will enable the definition of a half packet for the Packet4f type.	2016-05-23 20:43:48 -07:00
Benoit Steiner	33a94f5dc7	Use the Index type instead of integers to specify the strides in pgather/pscatter	2016-05-23 20:37:30 -07:00
Benoit Steiner	6bc684ab6a	Added missing alignment in the fp16 packet traits	2016-05-23 20:32:30 -07:00
Benoit Steiner	283e33dea4	ptranspose is not a template.	2016-05-23 19:55:55 -07:00
Benoit Steiner	a5a3ba2b80	Avoid unnecessary float to double conversions	2016-05-23 17:16:09 -07:00
Benoit Steiner	5ba0ebe7c9	Avoid unnecessary float to double conversion.	2016-05-23 17:14:31 -07:00
Benoit Steiner	7d980d74e5	Started to vectorize the processing of 16bit floats on CPU.	2016-05-23 15:21:40 -07:00
Benoit Steiner	5d51a7f12c	Don't optimize the processing of the last rows of a matrix matrix product in cases that violate the assumptions made by the optimized code path.	2016-05-23 15:13:16 -07:00
Christoph Hertzberg	88654762da	Replace multiple constructors of half-type by a generic/templated constructor. This fixes an incompatibility with long double, exposed by the previous commit.	2016-05-23 10:03:03 +02:00
Christoph Hertzberg	718521d5cf	Silenced several double-promotion warnings	2016-05-22 18:17:04 +02:00
Gael Guennebaud	ccaace03c9	Make EIGEN_HAS_CONSTEXPR user configurable	2016-05-20 15:10:08 +02:00
Gael Guennebaud	c3410804cd	Make EIGEN_HAS_VARIADIC_TEMPLATES user configurable	2016-05-20 15:05:38 +02:00
Gael Guennebaud	abd1c1af7a	Make EIGEN_HAS_STD_RESULT_OF user configurable	2016-05-20 15:01:27 +02:00
Gael Guennebaud	1395056fc0	Make EIGEN_HAS_C99_MATH user configurable	2016-05-20 14:58:19 +02:00
Gael Guennebaud	48bf5ec216	Make EIGEN_HAS_RVALUE_REFERENCES user configurable	2016-05-20 14:54:20 +02:00
Gael Guennebaud	f43ae88892	Rename EIGEN_HAVE_RVALUE_REFERENCES to EIGEN_HAS_RVALUE_REFERENCES	2016-05-20 14:48:51 +02:00
Gael Guennebaud	998f2efc58	Add a EIGEN_MAX_CPP_VER option to limit the C++ version to be used.	2016-05-20 14:44:28 +02:00
Gael Guennebaud	c028d96089	Improve doc of special math functions	2016-05-20 14:18:48 +02:00
Gael Guennebaud	0ba32f99bd	Rename UniformRandom to UnitRandom.	2016-05-20 13:21:34 +02:00
Gael Guennebaud	7a9d9cde94	Fix coding practice in Quaternion::UniformRandom	2016-05-20 13:19:52 +02:00
Joseph Mirabel	eb0cc2573a	bug #823 : add static method to Quaternion for uniform random rotations.	2016-05-20 13:15:40 +02:00
Gael Guennebaud	6761c64d60	zeta and polygamma are not unary functions, but binary ones.	2016-05-19 18:34:16 +02:00
Gael Guennebaud	7a54032408	zeta and digamma do not require C++11/C99	2016-05-19 17:36:47 +02:00
Gael Guennebaud	ce12562710	Add some c++11 flags in documentation	2016-05-19 17:35:30 +02:00
Gael Guennebaud	b6ed8244b4	bug #1201 : optimize affine*vector products	2016-05-19 16:09:15 +02:00
Gael Guennebaud	73693b5de6	bug #1221 : disable gcc 6 warning: ignoring attributes on template argument	2016-05-19 15:21:53 +02:00
Gael Guennebaud	df9a5e13c6	Fix SelfAdjointEigenSolver for some input expression types, and add new regression unit tests for sparse and selfadjointview inputs.	2016-05-19 13:07:33 +02:00
Gael Guennebaud	6a2916df80	DiagonalWrapper is a vector, so it must expose the LinearAccessBit flag.	2016-05-19 13:06:21 +02:00
Gael Guennebaud	a226f6af6b	Add support for SelfAdjointView::diagonal()	2016-05-19 13:05:33 +02:00
Gael Guennebaud	ee7da3c7c5	Fix SelfAdjointView::triangularView for complexes.	2016-05-19 13:01:51 +02:00
Gael Guennebaud	b6b8578a67	bug #1230 : add support for SelfadjointView::triangularView.	2016-05-19 11:36:38 +02:00
Gael Guennebaud	84df9142e7	bug #1231 : fix compilation regression regarding complex_array/=real_array and add respective unit tests	2016-05-18 23:00:13 +02:00
Gael Guennebaud	21d692d054	Use coeff(i,j) instead of operator().	2016-05-18 17:09:20 +02:00
Gael Guennebaud	8456bbbadb	bug #1224 : fix regression in (dense*dense).sparseView() by specializing evaluator<SparseView<Product>> for sparse products only.	2016-05-18 16:53:28 +02:00
Gael Guennebaud	b507b82326	Use default sorting strategy for square products.	2016-05-18 16:51:54 +02:00
Gael Guennebaud	747e3290c0	bug #1213 : rename some enums type for consistency.	2016-05-18 13:26:56 +02:00
Rasmus Munk Larsen	0dbd68145f	Roll back changes to core. Move include of TensorFunctors.h up to satisfy dependence in TensorCostModel.h.	2016-05-17 10:25:19 -07:00
Rasmus Munk Larsen	e55deb21c5	Improvements to parallelFor. Move some scalar functors from TensorFunctors. to Eigen core.	2016-05-12 14:07:22 -07:00
Benoit Steiner	fae0493f98	Fixed a couple of bugs related to the Pascalfamily of GPUs H: Enter commit message. Lines beginning with 'HG:' are removed.	2016-05-11 23:02:26 -07:00
Benoit Steiner	b6a517c47d	Added the ability to load fp16 using the texture path. Improved the performance of some reductions on fp16	2016-05-11 21:26:48 -07:00
Benoit Steiner	518149e868	Misc fixes for fp16	2016-05-11 20:11:14 -07:00
Benoit Steiner	56a1757d74	Made predux_min and predux_max on fp16 less noisy	2016-05-11 17:37:34 -07:00
Benoit Steiner	9091351dbe	__ldg is only available with cuda architectures >= 3.5	2016-05-11 15:22:13 -07:00
Benoit Steiner	02f76dae2d	Fixed a typo	2016-05-11 15:08:38 -07:00
Christoph Hertzberg	131e5a1a4a	Do not copy for trivial 1x1 case. This also avoids a "maybe-uninitialized" warning in some situations.	2016-05-11 23:50:13 +02:00
Benoit Steiner	70195a5ff7	Added missing EIGEN_DEVICE_FUNC	2016-05-11 14:10:09 -07:00
Benoit Steiner	09a19c33a8	Added missing EIGEN_DEVICE_FUNC qualifiers	2016-05-11 14:07:43 -07:00
Christoph Hertzberg	33ca7e3c8d	bug #1207 : Add and fix logical-op warnings	2016-05-11 19:36:34 +02:00
Benoit Steiner	217d984abc	Fixed a typo in my previous commit	2016-05-11 10:22:15 -07:00
Christoph Hertzberg	0f61343893	Workaround maybe-uninitialized warning	2016-05-11 09:00:18 +02:00
Christoph Hertzberg	3bfc9b47ca	Workaround "misleading-indentation" warnings	2016-05-11 08:41:36 +02:00
Benoit Steiner	0b9e3dcd06	Added packet primitives to compute exp, log, sqrt and rsqrt on fp16. This improves the performance by 10 to 30%.	2016-05-10 11:05:33 -07:00
Benoit Steiner	8adf5cc70f	Added support for packet processing of fp16 on kepler and maxwell gpus	2016-05-06 19:16:43 -07:00
Christoph Hertzberg	a11bd82dc3	bug #1213 : Give names to anonymous enums	2016-05-06 11:31:56 +02:00
Benoit Steiner	0451940fa4	Relaxed the dummy precision for fp16	2016-05-05 15:40:01 -07:00
Christoph Hertzberg	dacb469bc9	Enable and fix -Wdouble-conversion warnings	2016-05-05 13:35:45 +02:00
Ola Røer Thorsen	be78aea6b3	fix double-promotion/float-conversion in Core/SpecialFunctions.h	2016-05-04 10:52:08 +02:00
Gael Guennebaud	75a94b9662	Improve documentation of BDCSVD	2016-05-04 12:53:14 +02:00
Gael Guennebaud	e2ca478485	bug #1214 : consider denormals as zero in D&C SVD. This also workaround infinite binary search when compiling with ICC's unsafe optimizations.	2016-05-03 23:15:29 +02:00
Benoit Steiner	4c05fb03a3	Merged eigen/eigen into default	2016-05-03 13:15:00 -07:00
Benoit Steiner	6c3e5b85bc	Fixed compilation error with cuda >= 7.5	2016-05-03 09:38:42 -07:00
Benoit Steiner	da50419df8	Made a cast explicit	2016-05-02 19:50:22 -07:00
Gael Guennebaud	b1bd53aa6b	Fix performance regression: with AVX, unaligned stores were emitted instead of aligned ones for fixed size assignement.	2016-05-01 23:25:06 +02:00
Benoit Steiner	2b890ae618	Fixed compilation errors generated by clang	2016-04-29 18:30:40 -07:00
Benoit Steiner	46bcb70969	Don't turn on const expressions when compiling with gcc >= 4.8 unless the -std=c++11 option has been used	2016-04-29 15:20:59 -07:00
Benoit Steiner	07a247dcf4	Pulled latest updates from upstream	2016-04-29 13:41:26 -07:00
Benoit Steiner	fa5a8f055a	Implemented palign_impl for AVX512	2016-04-29 13:30:13 -07:00
Benoit Steiner	ef3ac9d05a	Fixed the AVX512 packet traits	2016-04-29 13:28:36 -07:00
Benoit Steiner	d7b75e8d86	Added pdiv packet primitives for avx512	2016-04-29 13:26:47 -07:00
Benoit Steiner	5e89ded685	Implemented preduxp for AVX512	2016-04-29 13:00:33 -07:00
Benoit Steiner	5f85662ad8	Implemented the pabs and preverse primitives for avx512.	2016-04-29 12:53:34 -07:00
Benoit Steiner	d37ee89ca8	Disabled some of the AVX512 primitives on compilers that don't support them	2016-04-29 12:50:29 -07:00
Gael Guennebaud	0f3c4c8ff4	Fix compilation of sparse.cast<>().transpose().	2016-04-29 18:26:08 +02:00
Benoit Steiner	dacb23277e	Fixed the igamma and igammac implementations to make them callable from a gpu kernel.	2016-04-28 18:54:54 -07:00
Benoit Steiner	a5d4545083	Deleted unused variable	2016-04-28 14:14:48 -07:00
Justin Lebar	40d1e2f8c7	Eliminate mutual recursion in igamma{,c}_impl::Run. Presently, igammac_impl::Run calls igamma_impl::Run, which in turn calls igammac_impl::Run. This isn't actually mutual recursion; the calls are guarded such that we never get into a loop. Nonetheless, it's a stretch for clang to prove this. As a result, clang emits a recursive call in both igammac_impl::Run and igamma_impl::Run. That this is suboptimal code is bad enough, but it's particularly bad when compiling for CUDA/nvptx. nvptx allows recursion, but only begrudgingly: If you have recursive calls in a kernel, it's on you to manually specify the kernel's stack size. Otherwise, ptxas will dump a warning, make a guess, and who knows if it's right. This change explicitly eliminates the mutual recursion in igammac_impl::Run and igamma_impl::Run.	2016-04-28 13:57:08 -07:00
Konstantinos Margaritis	87294c84a6	define Packet2d constants with VSX only	2016-04-28 14:39:56 -03:00
Konstantinos Margaritis	6ed7a7281c	remove accidentally pasted code	2016-04-28 14:35:55 -03:00
Konstantinos Margaritis	62f9093b31	improve state of MathFunctions as well	2016-04-28 14:33:09 -03:00
Konstantinos Margaritis	8ed26120c8	bring Altivec/VSX to a better state, implement some of the missing functions	2016-04-28 14:32:42 -03:00
Konstantinos Margaritis	950158f6d1	add name to copyrights	2016-04-28 14:32:11 -03:00
Konstantinos Margaritis	ee0459300b	minor fix, add to copyright	2016-04-28 14:31:21 -03:00
Benoit Steiner	2b917291d9	Merged in rmlarsen/eigen2 (pull request PR-183) Detect cxx_constexpr support when compiling with clang.	2016-04-27 15:19:54 -07:00
Rasmus Munk Larsen	09b9e951e3	Depend on the more extensive support for constexpr in clang: http://clang.llvm.org/docs/LanguageExtensions.html#c-1y-relaxed-constexpr	2016-04-27 14:59:11 -07:00
Rasmus Munk Larsen	1a325ef71c	Detect cxx_constexpr support when compiling with clang.	2016-04-27 14:33:51 -07:00
Benoit Steiner	c61170e87d	fpclassify isn't portable enough. In particular, the return values of the function are not available on all the platforms Eigen supportes: remove it from Eigen.	2016-04-27 14:22:20 -07:00
Benoit Steiner	f629fe95c8	Made the index type a template parameter to evaluateProductBlockingSizes Use numext::mini and numext::maxi instead of std::min/std::max to compute blocking sizes.	2016-04-27 13:11:19 -07:00
Benoit Steiner	25141b69d4	Improved support for min and max on 16 bit floats when running on recent cuda gpus	2016-04-27 12:57:21 -07:00
Benoit Steiner	6744d776ba	Added support for fpclassify in Eigen::Numext	2016-04-27 12:10:25 -07:00
Konstantinos Margaritis	3f80696ae1	Merged eigen/eigen into default	2016-04-22 15:05:21 +03:00
Benoit Steiner	5c372d19e3	Merged in rmlarsen/eigen (pull request PR-179) Prevent crash in CompleteOrthogonalDecomposition if object was default constructed.	2016-04-21 18:06:36 -07:00
Rasmus Munk Larsen	a3256d78d8	Prevent crash in CompleteOrthogonalDecomposition if object was default constructed.	2016-04-21 16:49:28 -07:00
Konstantinos Margaritis	e5b2ef47d5	Merged eigen/eigen into default	2016-04-21 18:03:08 +03:00
Benoit Steiner	80200a1828	Don't attempt to leverage the _cvtss_sh and _cvtsh_ss instructions when compiling with clang since it's unclear which versions of clang actually support these instruction.	2016-04-20 12:10:27 -07:00
Benoit Steiner	1d0238375d	Made sure all the required header files are included when trying to use fp16	2016-04-19 17:44:12 -07:00
Gael Guennebaud	e4fe611e2c	Enable lazy-coeff-based-product for vector*(1x1) products	2016-04-16 15:17:39 +02:00
Benoit Steiner	1a16fb1532	Deleted extraneous comma.	2016-04-15 15:50:13 -07:00
Gael Guennebaud	2a7115daca	bug #1203 : by-pass large stack-allocation in stableNorm if EIGEN_STACK_ALLOCATION_LIMIT is too small	2016-04-15 22:34:11 +02:00
Benoit Steiner	1d23430628	Improved the matrix multiplication blocking in the case where mr is not a power of 2 (e.g on Haswell CPUs).	2016-04-15 10:53:31 -07:00
Gael Guennebaud	1e80bddde3	Fix trmv for mixing types.	2016-04-15 17:58:36 +02:00
Konstantinos Margaritis	0e8fc31087	remove pgather/pscatter for std::complex<double> for s390x	2016-04-15 07:08:57 -04:00
Benoit Steiner	a62e924656	Added ability to access the cache sizes from the tensor devices	2016-04-14 21:25:06 -07:00
Benoit Steiner	18e6f67426	Added support for exclusive or	2016-04-14 20:37:46 -07:00
Gael Guennebaud	20f387fafa	Improve numerical robustness of JacoviSVD: - avoid noise amplification in complex to real conversion - compare off-diagonal entries to the current biggest diagonal entry: no need to bother about a 2x2 block containing ridiculously small entries compared to the rest of the matrix.	2016-04-14 22:46:55 +02:00
Benoit Steiner	7718749fee	Force the inlining of the << operator on half floats	2016-04-14 11:51:54 -07:00
Benoit Steiner	5379d2b594	Inline the << operator on half floats	2016-04-14 11:40:48 -07:00
Benoit Steiner	5c13765ee3	Added ability to printf fp16	2016-04-14 10:24:52 -07:00
Gael Guennebaud	3551dea887	Cleaning pass on rcond estimator.	2016-04-14 16:45:41 +02:00
Gael Guennebaud	d402adc3d7	Better use .data() than &coeffRef(0)	2016-04-14 15:18:08 +02:00
Gael Guennebaud	ea7087ef31	Merged in rmlarsen/eigen (pull request PR-174) Add matrix condition number estimation module.	2016-04-14 15:11:33 +02:00
Benoit Steiner	36f5a10198	Properly gate the definition of the error and gamma functions for fp16	2016-04-13 18:44:48 -07:00
Benoit Steiner	10b69810d1	Improved support for trigonometric functions on GPU	2016-04-13 16:00:51 -07:00
Benoit Steiner	d6105b53b8	Added basic implementation of the lgamma, digamma, igamma, igammac, polygamma, and zeta function for fp16	2016-04-13 15:26:02 -07:00
Gael Guennebaud	703251f10f	merge	2016-04-13 23:45:10 +02:00
Gael Guennebaud	39211ba46b	Fix JacobiSVD for complex when the complex-to-real update already gives a diagonal 2x2 block.	2016-04-13 23:43:26 +02:00
Benoit Steiner	2986253259	Cleaned up the implementation of digamma	2016-04-13 14:24:06 -07:00
Benoit Steiner	d5de1a8220	Pulled latest updates from trunk	2016-04-13 14:17:11 -07:00
Benoit Steiner	87ca15c4e8	Added support for sin, cos, tan, and tanh on fp16	2016-04-13 14:12:38 -07:00
Gael Guennebaud	feef39e2d1	Fix underflow in JacoviSVD's complex to real preconditioner	2016-04-13 22:49:51 +02:00
Benoit Steiner	bf3f6688f0	Added support for computing cos, sin, tan, and tanh on GPU.	2016-04-13 11:55:08 -07:00
Benoit Steiner	473c8380ea	Added constructors to convert unsigned integers into fp16	2016-04-13 11:03:37 -07:00
Gael Guennebaud	42a3352a3b	Workaround a division by zero when outerstride==0	2016-04-13 19:02:02 +02:00
Gael Guennebaud	6f960b83ff	Make use of is_same_dense helper instead of extract_data to detect input/outputs are the same.	2016-04-13 18:47:12 +02:00
Gael Guennebaud	b7716c0328	Fix incomplete previous patch on matrix comparision.	2016-04-13 18:32:56 +02:00
Gael Guennebaud	2630d97c62	Fix detection of same matrices when both matrices are not handled by extract_data.	2016-04-13 18:26:08 +02:00
Gael Guennebaud	06447e0a39	Improve half-packet vectorization logic to distinguish linear versus inner traversal modes.	2016-04-13 18:15:49 +02:00
Gael Guennebaud	bbb8854bf7	Enable half-packet in reduxions.	2016-04-13 13:02:34 +02:00
Benoit Steiner	aa1ba8bbd2	Don't put a command at the end of an enumerator list	2016-04-12 16:28:11 -07:00
Gael Guennebaud	b67c983291	Enable the use of half-packet in coeff-based product. For instance, Matrix4f*Vector4f is now vectorized again when using AVX.	2016-04-12 23:03:03 +02:00
Benoit Steiner	8bfe739cd2	Updated the AVX512 PacketMath to properly leverage the AVX512DQ instructions	2016-04-11 18:40:16 -07:00
Rasmus Larsen	6498dadc2f	Merged eigen/eigen into default	2016-04-11 17:42:05 -07:00
Benoit Steiner	d6e596174d	Pull latest updates from upstream	2016-04-11 17:20:17 -07:00
Benoit Steiner	748c4c4599	More accurate cost estimates for exp, log, tanh, and sqrt.	2016-04-11 13:11:04 -07:00

... 4 5 6 7 8 ...

5123 Commits