Benoit Steiner
|
79b900375f
|
Use index list for the striding benchmarks
|
2016-04-21 11:58:27 -07:00 |
|
Gael Guennebaud
|
d8a3bdaa24
|
remove useless include
|
2016-04-14 15:18:56 +02:00 |
|
Benoit Steiner
|
eaeb6ca93a
|
Enable the benchmarks for algebraic and transcendental fnctions on fp16.
|
2016-04-12 16:29:00 -07:00 |
|
Benoit Steiner
|
53121c0119
|
Turned on the contraction benchmarks for fp16
|
2016-04-12 14:11:52 -07:00 |
|
Benoit Steiner
|
63102ee43d
|
Turn on the coeffWise benchmarks on fp16
|
2016-04-07 23:05:20 -07:00 |
|
Benoit Steiner
|
7c47d3e663
|
Fixed the type casting benchmarks for fp16
|
2016-04-07 22:50:25 -07:00 |
|
Benoit Steiner
|
a6d08be9b2
|
Fixed the benchmarking of fp16 coefficient wise operations
|
2016-04-07 17:13:44 -07:00 |
|
Benoit Steiner
|
0968e925a0
|
Updated the benchmarking code to use Eigen::half instead of half
|
2016-03-24 18:00:33 -07:00 |
|
Benoit Steiner
|
7168afde5e
|
Made the tensor benchmarks compile on MacOS
|
2016-03-23 14:21:04 -07:00 |
|
Christoph Hertzberg
|
b224771f40
|
bug #1178: Simplified modification of the SSE control register for better portability
|
2016-03-20 10:57:08 +01:00 |
|
Benoit Steiner
|
56a3ada670
|
Added benchmarks for full reduction
|
2016-02-29 14:57:52 -08:00 |
|
Benoit Steiner
|
1031b31571
|
Improved the README
|
2016-02-27 20:22:04 +00:00 |
|
Benoit Steiner
|
93485d86bc
|
Added benchmarks for type casting of float16
|
2016-02-26 12:24:58 -08:00 |
|
Benoit Steiner
|
002824e32d
|
Added benchmarks for fp16
|
2016-02-26 12:21:25 -08:00 |
|
Benoit Steiner
|
8cb9bfab87
|
Extended the tensor benchmark suite to support types other than floats
|
2016-02-23 05:28:02 +00:00 |
|
Benoit Steiner
|
f442a5a5b3
|
Updated the tensor benchmarking code to work with compilers that don't support cxx11.
|
2016-02-23 04:15:48 +00:00 |
|
Gael Guennebaud
|
485823b5f5
|
Add COD and BDCSVD in list of benched solvers.
|
2016-02-19 23:00:33 +01:00 |
|
Benoit Steiner
|
4281eb1e2c
|
Added 2 benchmarks to the suite of tensor benchmarks running on GPU
|
2016-01-30 10:20:43 -08:00 |
|
Benoit Steiner
|
e4f83bae5d
|
Fixed the tensor benchmarks on apple devices
|
2016-01-28 21:08:07 -08:00 |
|
Benoit Steiner
|
10bea90c4a
|
Fixed clang related compilation error
|
2016-01-28 20:52:08 -08:00 |
|
Benoit Steiner
|
211d350fc3
|
Fixed a typo
|
2016-01-28 17:13:04 -08:00 |
|
Benoit Steiner
|
bd2e5a788a
|
Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.
|
2016-01-28 17:10:40 -08:00 |
|
Benoit Steiner
|
120e13b1b6
|
Added a readme to explain how to compile the tensor benchmarks.
|
2016-01-28 17:06:00 -08:00 |
|
Benoit Steiner
|
a68864b6bc
|
Updated the benchmarking code to print the number of flops processed instead of the number of bytes.
|
2016-01-28 16:51:40 -08:00 |
|
Benoit Steiner
|
c8d5f21941
|
Added extra tensor benchmarks
|
2016-01-28 16:20:36 -08:00 |
|
Yangqing Jia
|
270c4e1ecd
|
bugfix
|
2016-01-28 11:11:45 -08:00 |
|
Yangqing Jia
|
c4e47630b1
|
benchmark modifications to make it compilable in a standalone fashion.
|
2016-01-28 10:35:14 -08:00 |
|
Gael Guennebaud
|
4d708457d0
|
Increase axpy vector size
|
2015-12-11 23:07:22 +01:00 |
|
Gael Guennebaud
|
274b2272b7
|
Make bench_gemm compatible with 3.2
|
2015-12-01 09:57:31 +01:00 |
|
Gael Guennebaud
|
6fcd316f23
|
Extend superlu cmake script to check version
|
2015-11-30 14:48:11 +01:00 |
|
Gael Guennebaud
|
0ff127e896
|
Preserve CMAKE_CXX_FLAGS in BTL
|
2015-11-27 10:18:39 +01:00 |
|
Gael Guennebaud
|
13294b5152
|
Unify gemm and lazy_gemm benchmarks
|
2015-10-07 16:06:48 +02:00 |
|
Gael Guennebaud
|
247259f805
|
Add a perfromance regression benchmark for lazyProduct
|
2015-10-07 15:51:06 +02:00 |
|
Gael Guennebaud
|
c6eb17cbe9
|
Add helper routines to help bypassing some compiler otpimization when benchmarking
|
2015-10-07 15:50:42 +02:00 |
|
Gael Guennebaud
|
913a61870d
|
Update utility for experimenting with 3x3 eigenvalues
|
2015-06-08 15:54:53 +02:00 |
|
Gael Guennebaud
|
acc761cf0c
|
Merged in FlorianGeorge/eigen_blaze_fork_2 (pull request PR-60)
Use trans(X) instead of X.transpose() in Blaze Benchmark
|
2015-06-04 09:15:22 +02:00 |
|
Benoit Jacob
|
dc04f12967
|
use unsigned short instead of uint16_t which doesn't exist in c++98
|
2015-03-17 10:31:45 -04:00 |
|
Benoit Jacob
|
ca5c12587b
|
Polish lookup tables generation
|
2015-03-15 18:05:53 -04:00 |
|
Benoit Jacob
|
b6b88c0808
|
update perf_monitoring/gemm/changesets.txt
|
2015-03-13 14:57:05 -07:00 |
|
Benoit Jacob
|
d73ccd717e
|
Add support for dumping blocking sizes tables
|
2015-03-13 10:36:01 -07:00 |
|
Benoit Jacob
|
f2c3e2b10f
|
Add --only-cubic-sizes option to analyze-blocking-sizes tool
|
2015-03-12 13:16:33 -07:00 |
|
Benoit Jacob
|
39228cb224
|
deserialization assumed benchmarks in same order, but we shuffle them.
|
2015-03-06 19:29:01 -05:00 |
|
Benoit Jacob
|
a4f956b1da
|
merge
|
2015-03-06 19:13:36 -05:00 |
|
Benoit Jacob
|
19bf13aa62
|
Automatically serialize partial results to disk, reboot, and resume, when timings are getting bad
|
2015-03-06 19:11:50 -05:00 |
|
Gael Guennebaud
|
4c8eeeaed6
|
update gemm changeset list
|
2015-03-06 15:08:20 +01:00 |
|
Gael Guennebaud
|
eedd5063fd
|
Update gemm performance monitoring tool:
- permit to recompute a subset of changesets
- update changeset list
- add a few more cases
|
2015-03-06 11:47:13 +01:00 |
|
Benoit Jacob
|
4ab01f7c21
|
slightly increase tolerance to clock speed variation
|
2015-03-05 14:41:16 -05:00 |
|
Benoit Jacob
|
5db2baa573
|
Make benchmark-blocking-sizes detect changes to clock speed and be resilient to that.
|
2015-03-05 13:44:20 -05:00 |
|
Benoit Jacob
|
2231b3dece
|
output to cout, not cerr, the actual results
|
2015-03-04 09:45:12 -05:00 |
|
Benoit Jacob
|
00ea121881
|
Complete the tool to analyze the efficiency of default sizes.
|
2015-03-04 09:30:56 -05:00 |
|