eigen/Eigen
Rasmus Munk Larsen 3ed67cb0bb Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (enabled by EIGEN_FAST_MATH), which causes the vectorized parts of the computation to return -0.0 instead of NaN for negative arguments.
Benchmark speed in Giga-sqrts/s
Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz
-----------------------------------------
                    SSE        AVX
Fast=1              2.529G     4.380G
Fast=0              1.944G     1.898G
Fast=1 fixed        2.214G     3.739G

This table illustrates the worst case in terms speed impact: It was measured by repeatedly computing the sqrt of an n=4096 float vector that fits in L1 cache. For large vectors the operation becomes memory bound and the differences between the different versions almost negligible.
2016-10-04 14:22:56 -07:00
..
src Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (enabled by EIGEN_FAST_MATH), which causes the vectorized parts of the computation to return -0.0 instead of NaN for negative arguments. 2016-10-04 14:22:56 -07:00
Cholesky Clean references to MKL in LAPACKe support. 2016-07-25 18:20:08 +02:00
CholmodSupport
CMakeLists.txt bug #1167: simplify installation of header files using cmake's install(DIRECTORY ...) command. 2016-08-29 10:59:37 +02:00
Core Added support for constand std::complex numbers on GPU 2016-10-03 11:06:24 -07:00
Dense
Eigen
Eigenvalues Clean references to MKL in LAPACKe support. 2016-07-25 18:20:08 +02:00
Geometry
Householder
IterativeLinearSolvers
Jacobi
LU Clean references to MKL in LAPACKe support. 2016-07-25 18:20:08 +02:00
MetisSupport
OrderingMethods
PardisoSupport
PaStiXSupport
QR Clean references to MKL in LAPACKe support. 2016-07-25 18:20:08 +02:00
QtAlignedMalloc
Sparse
SparseCholesky
SparseCore
SparseLU
SparseQR
SPQRSupport
StdDeque
StdList
StdVector
SuperLUSupport bug #1119: Adjust call to ?gssvx for SuperLU 5 2016-07-10 02:29:57 +02:00
SVD Clean references to MKL in LAPACKe support. 2016-07-25 18:20:08 +02:00
UmfPackSupport