eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-03-01 18:26:24 +08:00

History

Ilya Tokar 231ce21535 Run two independent chains, when reducing tensors. Running two chains exposes more instruction level parallelism, by allowing to execute both chains at the same time. Results are a bit noisy, but for medium length we almost hit theoretical upper bound of 2x. BM_fullReduction_16T/3 [using 16 threads] 17.3ns ±11% 17.4ns ± 9% ~ (p=0.178 n=18+19) BM_fullReduction_16T/4 [using 16 threads] 17.6ns ±17% 17.0ns ±18% ~ (p=0.835 n=20+19) BM_fullReduction_16T/7 [using 16 threads] 18.9ns ±12% 18.2ns ±10% ~ (p=0.756 n=20+18) BM_fullReduction_16T/8 [using 16 threads] 19.8ns ±13% 19.4ns ±21% ~ (p=0.512 n=20+20) BM_fullReduction_16T/10 [using 16 threads] 23.5ns ±15% 20.8ns ±24% -11.37% (p=0.000 n=20+19) BM_fullReduction_16T/15 [using 16 threads] 35.8ns ±21% 26.9ns ±17% -24.76% (p=0.000 n=20+19) BM_fullReduction_16T/16 [using 16 threads] 38.7ns ±22% 27.7ns ±18% -28.40% (p=0.000 n=20+19) BM_fullReduction_16T/31 [using 16 threads] 146ns ±17% 74ns ±11% -49.05% (p=0.000 n=20+18) BM_fullReduction_16T/32 [using 16 threads] 154ns ±19% 84ns ±30% -45.79% (p=0.000 n=20+19) BM_fullReduction_16T/64 [using 16 threads] 603ns ± 8% 308ns ±12% -48.94% (p=0.000 n=17+17) BM_fullReduction_16T/128 [using 16 threads] 2.44µs ±13% 1.22µs ± 1% -50.29% (p=0.000 n=17+17) BM_fullReduction_16T/256 [using 16 threads] 9.84µs ±14% 5.13µs ±30% -47.82% (p=0.000 n=19+19) BM_fullReduction_16T/512 [using 16 threads] 78.0µs ± 9% 56.1µs ±17% -28.02% (p=0.000 n=18+20) BM_fullReduction_16T/1k [using 16 threads] 325µs ± 5% 263µs ± 4% -19.00% (p=0.000 n=20+16) BM_fullReduction_16T/2k [using 16 threads] 1.09ms ± 3% 0.99ms ± 1% -9.04% (p=0.000 n=20+20) BM_fullReduction_16T/4k [using 16 threads] 7.66ms ± 3% 7.57ms ± 3% -1.24% (p=0.017 n=20+20) BM_fullReduction_16T/10k [using 16 threads] 65.3ms ± 4% 65.0ms ± 3% ~ (p=0.718 n=20+20)		2020-06-16 15:55:11 -04:00
..
CXX11	Run two independent chains, when reducing tensors.	2020-06-16 15:55:11 -04:00
src	Update MarketIO.h	2020-02-28 12:41:51 +00:00
AdolcForward	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
AlignedVector3	fix AlignedVector3 inconsisent interface with other Vector classes, default constructor and operator- were missing.	2019-12-06 21:07:39 +01:00
ArpackSupport	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
AutoDiff	Fix numerous shadow-warnings for GCC<=4.8	2018-08-28 18:32:39 +02:00
BVH	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
CMakeLists.txt
EulerAngles	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
FFT	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
IterativeSolvers	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
KroneckerProduct
LevenbergMarquardt	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
MatrixFunctions	Fix most Doxygen warnings. Also add links to stable documentation from unsupported modules (by using the corresponding Doxytags file).	2018-10-19 21:10:28 +02:00
MoreVectorization	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
MPRealSupport	Fix MPrealSupport	2018-09-20 18:30:10 +02:00
NonLinearOptimization	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
NumericalDiff	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
OpenGLSupport	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
Polynomials	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
Skyline	bug #1596 : fix inclusion of Eigen's header within unsupported modules.	2018-09-17 09:54:29 +02:00
SparseExtra
SpecialFunctions	fix compilation due to new HIP scalar accessor	2019-12-17 20:27:30 +00:00
Splines	Fix numerous shadow-warnings for GCC<=4.8	2018-08-28 18:32:39 +02:00