Commit Graph

361 Commits

Author SHA1 Message Date
Gael Guennebaud
0f83aeb6b2 Improve cmake scripts for Pastix and BLAS detection. 2017-04-14 10:22:12 +02:00
Mehdi Goli
f499fe9496 Adding synchronisation to convolution kernel for sycl backend. 2017-03-13 09:18:37 +00:00
Mehdi Goli
aadb7405a7 Fixing typo in sycl Benchmark. 2017-03-08 18:20:06 +00:00
Mehdi Goli
5e9a1e7a7a Adding sycl Benchmarks. 2017-03-08 14:17:48 +00:00
Benoit Steiner
fbc39fd02c Merge latest changes from upstream 2017-01-30 15:25:57 -08:00
Gael Guennebaud
45b289505c Add debug output 2017-01-03 11:31:02 +01:00
Gael Guennebaud
5838f078a7 Fix inclusion 2017-01-03 11:30:27 +01:00
Benoit Steiner
3eda02d78d Fixed the sycl benchmarking code 2016-12-22 10:37:05 -08:00
Gael Guennebaud
747202d338 typo 2016-12-08 12:48:15 +01:00
Gael Guennebaud
bb297abb9e make sure we use the right eigen version 2016-12-08 12:00:11 +01:00
Gael Guennebaud
8b4b00d277 fix usage of custom compiler 2016-12-08 11:59:39 +01:00
Gael Guennebaud
7105596899 Add missing include and use -O3 2016-12-07 16:56:08 +01:00
Gael Guennebaud
780f3c1adf Fix call to convert on linux 2016-12-07 16:30:11 +01:00
Gael Guennebaud
3855ab472f Cleanup file structure 2016-12-07 14:23:49 +01:00
Gael Guennebaud
59a59fa8e7 Update perf monitoring scripts to generate html/svg outputs 2016-12-07 13:36:56 +01:00
Gael Guennebaud
1b4e085a7f generate png file for web upload 2016-12-06 16:46:22 +01:00
Gael Guennebaud
f90c4aebc5 Update monitored changeset lists 2016-12-06 15:07:46 +01:00
Gael Guennebaud
c68c8631e7 fix compilation of BTL's blaze interface 2016-12-05 23:02:16 +01:00
Gael Guennebaud
1ff1d4a124 Add performance monitoring for LLT 2016-12-05 23:01:52 +01:00
Gael Guennebaud
445c015751 extend monitoring benchmarks with transpose matrix-vector and triangular matrix-vectors. 2016-12-05 13:36:26 +01:00
Gael Guennebaud
4c0d5f3c01 Add perf monitoring for gemv 2016-12-02 11:34:12 +01:00
Gael Guennebaud
d2718d662c Re-enable A^T*A action in BTL 2016-12-02 11:32:03 +01:00
Benoit Steiner
ae1385c7e4 Pull the latest updates from trunk 2016-10-05 14:54:36 -07:00
Christoph Hertzberg
4b377715d7 Do not manually add absolute path to boost-library.
Also set C++ standard for blaze to C++14
2016-09-22 00:10:47 +02:00
Luke Iwanski
cb81975714 Partial OpenCL support via SYCL compatible with ComputeCpp CE. 2016-09-19 12:44:13 +01:00
Gael Guennebaud
5fbe7aa604 Update and fix Cholesky mini benchmark 2016-07-28 11:26:30 +02:00
Gael Guennebaud
9b76be9d21 Update benchmark for dense solver to stress least-squares pb, and to output a HTML table 2016-07-21 12:30:53 +02:00
Gael Guennebaud
75e80792cc Update relevent list of changesets. 2016-07-04 14:32:34 +02:00
Gael Guennebaud
dacc544b84 asm escape was not strong enough to prevent too aggressive compiler optimization let's fallback to no-inline. 2016-07-04 14:32:15 +02:00
Gael Guennebaud
b74e45906c Few fixes in perf-monitoring. 2016-07-04 14:30:50 +02:00
Gael Guennebaud
e2b3836326 Include recent changesets that played with product's kernel 2016-06-09 17:13:33 +02:00
Benoit Steiner
457204cb83 Updated the README file for the tensor benchmarks 2016-05-25 16:13:41 -07:00
Benoit Steiner
034aa3b2c0 Improved the performance of tensor padding 2016-05-25 11:43:08 -07:00
Benoit Steiner
069a0b04d7 Added benchmarks for contraction on CPU. 2016-05-13 14:32:17 -07:00
Benoit Steiner
f81e413180 Added a benchmark to measure the performance of full reductions of 16 bit floats 2016-05-05 14:15:11 -07:00
Benoit Steiner
79b900375f Use index list for the striding benchmarks 2016-04-21 11:58:27 -07:00
Gael Guennebaud
d8a3bdaa24 remove useless include 2016-04-14 15:18:56 +02:00
Benoit Steiner
eaeb6ca93a Enable the benchmarks for algebraic and transcendental fnctions on fp16. 2016-04-12 16:29:00 -07:00
Benoit Steiner
53121c0119 Turned on the contraction benchmarks for fp16 2016-04-12 14:11:52 -07:00
Benoit Steiner
63102ee43d Turn on the coeffWise benchmarks on fp16 2016-04-07 23:05:20 -07:00
Benoit Steiner
7c47d3e663 Fixed the type casting benchmarks for fp16 2016-04-07 22:50:25 -07:00
Benoit Steiner
a6d08be9b2 Fixed the benchmarking of fp16 coefficient wise operations 2016-04-07 17:13:44 -07:00
Benoit Steiner
0968e925a0 Updated the benchmarking code to use Eigen::half instead of half 2016-03-24 18:00:33 -07:00
Benoit Steiner
7168afde5e Made the tensor benchmarks compile on MacOS 2016-03-23 14:21:04 -07:00
Christoph Hertzberg
b224771f40 bug #1178: Simplified modification of the SSE control register for better portability 2016-03-20 10:57:08 +01:00
Benoit Steiner
56a3ada670 Added benchmarks for full reduction 2016-02-29 14:57:52 -08:00
Benoit Steiner
1031b31571 Improved the README 2016-02-27 20:22:04 +00:00
Benoit Steiner
93485d86bc Added benchmarks for type casting of float16 2016-02-26 12:24:58 -08:00
Benoit Steiner
002824e32d Added benchmarks for fp16 2016-02-26 12:21:25 -08:00
Benoit Steiner
8cb9bfab87 Extended the tensor benchmark suite to support types other than floats 2016-02-23 05:28:02 +00:00