Gael Guennebaud
|
3855ab472f
|
Cleanup file structure
|
2016-12-07 14:23:49 +01:00 |
|
Gael Guennebaud
|
59a59fa8e7
|
Update perf monitoring scripts to generate html/svg outputs
|
2016-12-07 13:36:56 +01:00 |
|
Gael Guennebaud
|
1b4e085a7f
|
generate png file for web upload
|
2016-12-06 16:46:22 +01:00 |
|
Gael Guennebaud
|
f90c4aebc5
|
Update monitored changeset lists
|
2016-12-06 15:07:46 +01:00 |
|
Gael Guennebaud
|
c68c8631e7
|
fix compilation of BTL's blaze interface
|
2016-12-05 23:02:16 +01:00 |
|
Gael Guennebaud
|
1ff1d4a124
|
Add performance monitoring for LLT
|
2016-12-05 23:01:52 +01:00 |
|
Gael Guennebaud
|
445c015751
|
extend monitoring benchmarks with transpose matrix-vector and triangular matrix-vectors.
|
2016-12-05 13:36:26 +01:00 |
|
Gael Guennebaud
|
4c0d5f3c01
|
Add perf monitoring for gemv
|
2016-12-02 11:34:12 +01:00 |
|
Gael Guennebaud
|
d2718d662c
|
Re-enable A^T*A action in BTL
|
2016-12-02 11:32:03 +01:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Christoph Hertzberg
|
4b377715d7
|
Do not manually add absolute path to boost-library.
Also set C++ standard for blaze to C++14
|
2016-09-22 00:10:47 +02:00 |
|
Luke Iwanski
|
cb81975714
|
Partial OpenCL support via SYCL compatible with ComputeCpp CE.
|
2016-09-19 12:44:13 +01:00 |
|
Gael Guennebaud
|
5fbe7aa604
|
Update and fix Cholesky mini benchmark
|
2016-07-28 11:26:30 +02:00 |
|
Gael Guennebaud
|
9b76be9d21
|
Update benchmark for dense solver to stress least-squares pb, and to output a HTML table
|
2016-07-21 12:30:53 +02:00 |
|
Gael Guennebaud
|
75e80792cc
|
Update relevent list of changesets.
|
2016-07-04 14:32:34 +02:00 |
|
Gael Guennebaud
|
dacc544b84
|
asm escape was not strong enough to prevent too aggressive compiler optimization let's fallback to no-inline.
|
2016-07-04 14:32:15 +02:00 |
|
Gael Guennebaud
|
b74e45906c
|
Few fixes in perf-monitoring.
|
2016-07-04 14:30:50 +02:00 |
|
Gael Guennebaud
|
e2b3836326
|
Include recent changesets that played with product's kernel
|
2016-06-09 17:13:33 +02:00 |
|
Benoit Steiner
|
457204cb83
|
Updated the README file for the tensor benchmarks
|
2016-05-25 16:13:41 -07:00 |
|
Benoit Steiner
|
034aa3b2c0
|
Improved the performance of tensor padding
|
2016-05-25 11:43:08 -07:00 |
|
Benoit Steiner
|
069a0b04d7
|
Added benchmarks for contraction on CPU.
|
2016-05-13 14:32:17 -07:00 |
|
Benoit Steiner
|
f81e413180
|
Added a benchmark to measure the performance of full reductions of 16 bit floats
|
2016-05-05 14:15:11 -07:00 |
|
Benoit Steiner
|
79b900375f
|
Use index list for the striding benchmarks
|
2016-04-21 11:58:27 -07:00 |
|
Gael Guennebaud
|
d8a3bdaa24
|
remove useless include
|
2016-04-14 15:18:56 +02:00 |
|
Benoit Steiner
|
eaeb6ca93a
|
Enable the benchmarks for algebraic and transcendental fnctions on fp16.
|
2016-04-12 16:29:00 -07:00 |
|
Benoit Steiner
|
53121c0119
|
Turned on the contraction benchmarks for fp16
|
2016-04-12 14:11:52 -07:00 |
|
Benoit Steiner
|
63102ee43d
|
Turn on the coeffWise benchmarks on fp16
|
2016-04-07 23:05:20 -07:00 |
|
Benoit Steiner
|
7c47d3e663
|
Fixed the type casting benchmarks for fp16
|
2016-04-07 22:50:25 -07:00 |
|
Benoit Steiner
|
a6d08be9b2
|
Fixed the benchmarking of fp16 coefficient wise operations
|
2016-04-07 17:13:44 -07:00 |
|
Benoit Steiner
|
0968e925a0
|
Updated the benchmarking code to use Eigen::half instead of half
|
2016-03-24 18:00:33 -07:00 |
|
Benoit Steiner
|
7168afde5e
|
Made the tensor benchmarks compile on MacOS
|
2016-03-23 14:21:04 -07:00 |
|
Christoph Hertzberg
|
b224771f40
|
bug #1178: Simplified modification of the SSE control register for better portability
|
2016-03-20 10:57:08 +01:00 |
|
Benoit Steiner
|
56a3ada670
|
Added benchmarks for full reduction
|
2016-02-29 14:57:52 -08:00 |
|
Benoit Steiner
|
1031b31571
|
Improved the README
|
2016-02-27 20:22:04 +00:00 |
|
Benoit Steiner
|
93485d86bc
|
Added benchmarks for type casting of float16
|
2016-02-26 12:24:58 -08:00 |
|
Benoit Steiner
|
002824e32d
|
Added benchmarks for fp16
|
2016-02-26 12:21:25 -08:00 |
|
Benoit Steiner
|
8cb9bfab87
|
Extended the tensor benchmark suite to support types other than floats
|
2016-02-23 05:28:02 +00:00 |
|
Benoit Steiner
|
f442a5a5b3
|
Updated the tensor benchmarking code to work with compilers that don't support cxx11.
|
2016-02-23 04:15:48 +00:00 |
|
Gael Guennebaud
|
485823b5f5
|
Add COD and BDCSVD in list of benched solvers.
|
2016-02-19 23:00:33 +01:00 |
|
Benoit Steiner
|
4281eb1e2c
|
Added 2 benchmarks to the suite of tensor benchmarks running on GPU
|
2016-01-30 10:20:43 -08:00 |
|
Benoit Steiner
|
e4f83bae5d
|
Fixed the tensor benchmarks on apple devices
|
2016-01-28 21:08:07 -08:00 |
|
Benoit Steiner
|
10bea90c4a
|
Fixed clang related compilation error
|
2016-01-28 20:52:08 -08:00 |
|
Benoit Steiner
|
211d350fc3
|
Fixed a typo
|
2016-01-28 17:13:04 -08:00 |
|
Benoit Steiner
|
bd2e5a788a
|
Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.
|
2016-01-28 17:10:40 -08:00 |
|
Benoit Steiner
|
120e13b1b6
|
Added a readme to explain how to compile the tensor benchmarks.
|
2016-01-28 17:06:00 -08:00 |
|
Benoit Steiner
|
a68864b6bc
|
Updated the benchmarking code to print the number of flops processed instead of the number of bytes.
|
2016-01-28 16:51:40 -08:00 |
|
Benoit Steiner
|
c8d5f21941
|
Added extra tensor benchmarks
|
2016-01-28 16:20:36 -08:00 |
|
Yangqing Jia
|
270c4e1ecd
|
bugfix
|
2016-01-28 11:11:45 -08:00 |
|
Yangqing Jia
|
c4e47630b1
|
benchmark modifications to make it compilable in a standalone fashion.
|
2016-01-28 10:35:14 -08:00 |
|
Gael Guennebaud
|
4d708457d0
|
Increase axpy vector size
|
2015-12-11 23:07:22 +01:00 |
|