eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-01-06 14:14:46 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	dacc544b84	asm escape was not strong enough to prevent too aggressive compiler optimization let's fallback to no-inline.	2016-07-04 14:32:15 +02:00
Gael Guennebaud	b74e45906c	Few fixes in perf-monitoring.	2016-07-04 14:30:50 +02:00
Gael Guennebaud	e2b3836326	Include recent changesets that played with product's kernel	2016-06-09 17:13:33 +02:00
Benoit Steiner	457204cb83	Updated the README file for the tensor benchmarks	2016-05-25 16:13:41 -07:00
Benoit Steiner	034aa3b2c0	Improved the performance of tensor padding	2016-05-25 11:43:08 -07:00
Benoit Steiner	069a0b04d7	Added benchmarks for contraction on CPU.	2016-05-13 14:32:17 -07:00
Benoit Steiner	f81e413180	Added a benchmark to measure the performance of full reductions of 16 bit floats	2016-05-05 14:15:11 -07:00
Benoit Steiner	79b900375f	Use index list for the striding benchmarks	2016-04-21 11:58:27 -07:00
Gael Guennebaud	d8a3bdaa24	remove useless include	2016-04-14 15:18:56 +02:00
Benoit Steiner	eaeb6ca93a	Enable the benchmarks for algebraic and transcendental fnctions on fp16.	2016-04-12 16:29:00 -07:00
Benoit Steiner	53121c0119	Turned on the contraction benchmarks for fp16	2016-04-12 14:11:52 -07:00
Benoit Steiner	63102ee43d	Turn on the coeffWise benchmarks on fp16	2016-04-07 23:05:20 -07:00
Benoit Steiner	7c47d3e663	Fixed the type casting benchmarks for fp16	2016-04-07 22:50:25 -07:00
Benoit Steiner	a6d08be9b2	Fixed the benchmarking of fp16 coefficient wise operations	2016-04-07 17:13:44 -07:00
Benoit Steiner	0968e925a0	Updated the benchmarking code to use Eigen::half instead of half	2016-03-24 18:00:33 -07:00
Benoit Steiner	7168afde5e	Made the tensor benchmarks compile on MacOS	2016-03-23 14:21:04 -07:00
Christoph Hertzberg	b224771f40	bug #1178 : Simplified modification of the SSE control register for better portability	2016-03-20 10:57:08 +01:00
Benoit Steiner	56a3ada670	Added benchmarks for full reduction	2016-02-29 14:57:52 -08:00
Benoit Steiner	1031b31571	Improved the README	2016-02-27 20:22:04 +00:00
Benoit Steiner	93485d86bc	Added benchmarks for type casting of float16	2016-02-26 12:24:58 -08:00
Benoit Steiner	002824e32d	Added benchmarks for fp16	2016-02-26 12:21:25 -08:00
Benoit Steiner	8cb9bfab87	Extended the tensor benchmark suite to support types other than floats	2016-02-23 05:28:02 +00:00
Benoit Steiner	f442a5a5b3	Updated the tensor benchmarking code to work with compilers that don't support cxx11.	2016-02-23 04:15:48 +00:00
Gael Guennebaud	485823b5f5	Add COD and BDCSVD in list of benched solvers.	2016-02-19 23:00:33 +01:00
Benoit Steiner	4281eb1e2c	Added 2 benchmarks to the suite of tensor benchmarks running on GPU	2016-01-30 10:20:43 -08:00
Benoit Steiner	e4f83bae5d	Fixed the tensor benchmarks on apple devices	2016-01-28 21:08:07 -08:00
Benoit Steiner	10bea90c4a	Fixed clang related compilation error	2016-01-28 20:52:08 -08:00
Benoit Steiner	211d350fc3	Fixed a typo	2016-01-28 17:13:04 -08:00
Benoit Steiner	bd2e5a788a	Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.	2016-01-28 17:10:40 -08:00
Benoit Steiner	120e13b1b6	Added a readme to explain how to compile the tensor benchmarks.	2016-01-28 17:06:00 -08:00
Benoit Steiner	a68864b6bc	Updated the benchmarking code to print the number of flops processed instead of the number of bytes.	2016-01-28 16:51:40 -08:00
Benoit Steiner	c8d5f21941	Added extra tensor benchmarks	2016-01-28 16:20:36 -08:00
Yangqing Jia	270c4e1ecd	bugfix	2016-01-28 11:11:45 -08:00
Yangqing Jia	c4e47630b1	benchmark modifications to make it compilable in a standalone fashion.	2016-01-28 10:35:14 -08:00
Gael Guennebaud	4d708457d0	Increase axpy vector size	2015-12-11 23:07:22 +01:00
Gael Guennebaud	274b2272b7	Make bench_gemm compatible with 3.2	2015-12-01 09:57:31 +01:00
Gael Guennebaud	6fcd316f23	Extend superlu cmake script to check version	2015-11-30 14:48:11 +01:00
Gael Guennebaud	0ff127e896	Preserve CMAKE_CXX_FLAGS in BTL	2015-11-27 10:18:39 +01:00
Gael Guennebaud	13294b5152	Unify gemm and lazy_gemm benchmarks	2015-10-07 16:06:48 +02:00
Gael Guennebaud	247259f805	Add a perfromance regression benchmark for lazyProduct	2015-10-07 15:51:06 +02:00
Gael Guennebaud	c6eb17cbe9	Add helper routines to help bypassing some compiler otpimization when benchmarking	2015-10-07 15:50:42 +02:00
Gael Guennebaud	913a61870d	Update utility for experimenting with 3x3 eigenvalues	2015-06-08 15:54:53 +02:00
Gael Guennebaud	acc761cf0c	Merged in FlorianGeorge/eigen_blaze_fork_2 (pull request PR-60) Use trans(X) instead of X.transpose() in Blaze Benchmark	2015-06-04 09:15:22 +02:00
Benoit Jacob	dc04f12967	use unsigned short instead of uint16_t which doesn't exist in c++98	2015-03-17 10:31:45 -04:00
Benoit Jacob	ca5c12587b	Polish lookup tables generation	2015-03-15 18:05:53 -04:00
Benoit Jacob	b6b88c0808	update perf_monitoring/gemm/changesets.txt	2015-03-13 14:57:05 -07:00
Benoit Jacob	d73ccd717e	Add support for dumping blocking sizes tables	2015-03-13 10:36:01 -07:00
Benoit Jacob	f2c3e2b10f	Add --only-cubic-sizes option to analyze-blocking-sizes tool	2015-03-12 13:16:33 -07:00
Benoit Jacob	39228cb224	deserialization assumed benchmarks in same order, but we shuffle them.	2015-03-06 19:29:01 -05:00
Benoit Jacob	a4f956b1da	merge	2015-03-06 19:13:36 -05:00
Benoit Jacob	19bf13aa62	Automatically serialize partial results to disk, reboot, and resume, when timings are getting bad	2015-03-06 19:11:50 -05:00
Gael Guennebaud	4c8eeeaed6	update gemm changeset list	2015-03-06 15:08:20 +01:00
Gael Guennebaud	eedd5063fd	Update gemm performance monitoring tool: - permit to recompute a subset of changesets - update changeset list - add a few more cases	2015-03-06 11:47:13 +01:00
Benoit Jacob	4ab01f7c21	slightly increase tolerance to clock speed variation	2015-03-05 14:41:16 -05:00
Benoit Jacob	5db2baa573	Make benchmark-blocking-sizes detect changes to clock speed and be resilient to that.	2015-03-05 13:44:20 -05:00
Benoit Jacob	2231b3dece	output to cout, not cerr, the actual results	2015-03-04 09:45:12 -05:00
Benoit Jacob	00ea121881	Complete the tool to analyze the efficiency of default sizes.	2015-03-04 09:30:56 -05:00
Benoit Jacob	f64b4480af	Add missing copyright notices	2015-03-03 11:43:56 -05:00
Benoit Jacob	eae8e27b7d	Add a benchmark-default-sizes action to benchmark-blocking-sizes.cpp	2015-03-03 11:41:21 -05:00
Benoit Jacob	9930e9583b	Improve analyze-blocking-sizes, and in particular give it a evaluate-defaults tool that shows the efficiency of Eigen's default blocking sizes choices, using a previously computed table from benchmark-blocking-sizes.	2015-03-02 18:08:38 -05:00
Gael Guennebaud	2e9cb06a87	Update changeset list to be checked by perf_monitoring/gemm.	2015-02-26 16:13:33 +01:00
Gael Guennebaud	a46061ab7b	Make perf_monitoring/gemm script more flexible: - skip existing dataset - add a "-up" option to recompute the dataset (see script header) - allow to specify a filename prefix	2015-02-26 16:12:58 +01:00
Benoit Jacob	488874781b	Add analyze-blocking-sizes program under bench/ to analyze multiple logs generated by benchmark-blocking-sizes.	2015-02-23 14:02:29 -05:00
Benoit Jacob	458cf91cd9	Add benchmark-blocking-sizes.cpp to bench/ per mailing list discussion.	2015-02-20 17:08:04 -05:00
Gael Guennebaud	03ec601ff7	Initial version of a small script to help tracking performance regressions	2015-02-20 19:20:34 +01:00
Gael Guennebaud	333b497383	update bench_gemm	2015-02-20 11:59:49 +01:00
Benoit Steiner	c739102ef9	Pulled the latest changes from the trunk	2015-02-06 05:25:03 -08:00
Benoit Steiner	46fc881e4a	Added a few benchmarks for the tensor code	2015-01-26 17:46:40 -08:00
Benoit Steiner	7acd38d19e	Created some benchmarks for the tensor code	2014-10-17 09:49:03 -07:00
Konstantinos Margaritis	60e093a9dc	Merged eigen/eigen into default	2014-09-21 14:02:51 +03:00
Konstantinos Margaritis	7ff266e3ce	Initial VSX commit	2014-08-29 20:03:49 +00:00
Gael Guennebaud	57f71a5552	Update bench_norm utility	2014-09-11 10:27:46 +02:00
Chen-Pang He	1eefa5a841	Find benchmark opponents also in /usr/lib64	2014-07-07 22:55:28 +08:00
Chen-Pang He	e4b6979334	Find OpenBLAS more aggressively. This made a difference on Fedora 20	2014-07-07 21:32:33 +08:00
Florian George	f56d452c7e	Enable atv in Blaze Benchmark	2014-05-04 17:07:17 +02:00
Florian George	af79b158a1	Use trans(X) instead of X.transpose() in Blaze Benchmark	2014-05-04 17:06:34 +02:00
Gael Guennebaud	2fb64578aa	Add a small benchmark to compare dense solvers for small to large problems.	2014-04-28 16:16:29 +02:00
Gael Guennebaud	c354bd47f7	Make our gemm bench a little more powerful.	2014-04-17 21:03:26 +02:00
Gael Guennebaud	9777a5ca60	Various minor fixes in BTL	2014-04-17 21:01:45 +02:00
Benoit Steiner	aecc78325a	Pulled the latest updates from the eigen trunk.	2014-04-01 22:07:05 -07:00
Florian George	56c4851323	Fixed typo: symmretric -> symmetric	2014-04-01 15:52:25 +02:00
Gael Guennebaud	1221dd90aa	Fix no newline at end of file warning	2014-04-01 11:21:14 +02:00
Gael Guennebaud	93870d95b7	BTL: add blaze	2014-03-31 10:59:55 +02:00
Gael Guennebaud	f603823ef3	BTL: fix warnings and extend to 5k matrices, update GotoBlas to OpenBlas, etc.	2014-03-31 10:58:30 +02:00
Gael Guennebaud	33ca9b4ee6	Add support for OSX in BTL and fix a few warnings	2014-03-07 23:11:38 +01:00
Gael Guennebaud	c0e08e9e4b	fix stable norm benchmark	2014-02-13 15:53:51 +01:00
Gael Guennebaud	05c9be65ce	Fix bug #595 : typo	2013-06-10 13:10:36 +02:00
Christoph Hertzberg	6300e8ca02	replaced compiler specific __attribute__((noinline)) by EIGEN_DONT_INLINE	2012-12-17 16:55:14 +01:00
Jakob Schwendner	22e6741da9	updated geometry benchmark to handle additional cases	2012-12-17 09:33:22 +01:00
Jakob Schwendner	98798e904b	added benchmark for test vectorization in geometry package	2012-12-16 23:30:56 +01:00
Desire NUENTSA	15a9f6b9c1	Doc for sparseLU	2012-09-25 11:48:18 +02:00
Desire NUENTSA	45672e724e	Incomplete Cholesky preconditioner... not yet stable	2012-09-11 12:12:19 +02:00
Desire NUENTSA	2c99d84133	add SparseLU in sparse bench	2012-09-10 12:41:26 +02:00
Desire NUENTSA	fdd0f0c5fc	merge Sparse LU branch	2012-09-07 13:18:16 +02:00
Desire NUENTSA	063705b5be	Add tutorial for sparse solvers	2012-09-07 13:14:57 +02:00
Desire NUENTSA	2280f2490e	Init perf values	2012-09-04 12:21:07 +02:00
Desire NUENTSA	288e6aab14	Insert XSL styles into output XML files	2012-09-03 10:33:39 +02:00
Desire NUENTSA W.	fe9956defe	Read real and complex bench matrices from a unique folder Output and display bench results using XML and XSLT	2012-08-27 22:52:43 +02:00
Desire NUENTSA	4d3b7e2a13	Add support for Metis fill-reducing ordering ; it is generally more efficient than COLAMD ordering	2012-08-06 14:55:02 +02:00
Desire NUENTSA	7dc39b7037	Add unit tests	2012-08-03 13:05:45 +02:00

1 2 3 4 5 ...

383 Commits