Mehdi Goli
|
f499fe9496
|
Adding synchronisation to convolution kernel for sycl backend.
|
2017-03-13 09:18:37 +00:00 |
|
Mehdi Goli
|
5e9a1e7a7a
|
Adding sycl Benchmarks.
|
2017-03-08 14:17:48 +00:00 |
|
Benoit Steiner
|
034aa3b2c0
|
Improved the performance of tensor padding
|
2016-05-25 11:43:08 -07:00 |
|
Benoit Steiner
|
f81e413180
|
Added a benchmark to measure the performance of full reductions of 16 bit floats
|
2016-05-05 14:15:11 -07:00 |
|
Benoit Steiner
|
79b900375f
|
Use index list for the striding benchmarks
|
2016-04-21 11:58:27 -07:00 |
|
Benoit Steiner
|
7c47d3e663
|
Fixed the type casting benchmarks for fp16
|
2016-04-07 22:50:25 -07:00 |
|
Benoit Steiner
|
a6d08be9b2
|
Fixed the benchmarking of fp16 coefficient wise operations
|
2016-04-07 17:13:44 -07:00 |
|
Benoit Steiner
|
7168afde5e
|
Made the tensor benchmarks compile on MacOS
|
2016-03-23 14:21:04 -07:00 |
|
Benoit Steiner
|
56a3ada670
|
Added benchmarks for full reduction
|
2016-02-29 14:57:52 -08:00 |
|
Benoit Steiner
|
93485d86bc
|
Added benchmarks for type casting of float16
|
2016-02-26 12:24:58 -08:00 |
|
Benoit Steiner
|
8cb9bfab87
|
Extended the tensor benchmark suite to support types other than floats
|
2016-02-23 05:28:02 +00:00 |
|
Benoit Steiner
|
f442a5a5b3
|
Updated the tensor benchmarking code to work with compilers that don't support cxx11.
|
2016-02-23 04:15:48 +00:00 |
|
Benoit Steiner
|
10bea90c4a
|
Fixed clang related compilation error
|
2016-01-28 20:52:08 -08:00 |
|
Benoit Steiner
|
bd2e5a788a
|
Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.
|
2016-01-28 17:10:40 -08:00 |
|
Benoit Steiner
|
a68864b6bc
|
Updated the benchmarking code to print the number of flops processed instead of the number of bytes.
|
2016-01-28 16:51:40 -08:00 |
|
Benoit Steiner
|
c8d5f21941
|
Added extra tensor benchmarks
|
2016-01-28 16:20:36 -08:00 |
|
Yangqing Jia
|
270c4e1ecd
|
bugfix
|
2016-01-28 11:11:45 -08:00 |
|
Yangqing Jia
|
c4e47630b1
|
benchmark modifications to make it compilable in a standalone fashion.
|
2016-01-28 10:35:14 -08:00 |
|
Benoit Steiner
|
46fc881e4a
|
Added a few benchmarks for the tensor code
|
2015-01-26 17:46:40 -08:00 |
|