Gael Guennebaud
|
0cb4ba98e7
|
update wrt recent changes
|
2019-02-21 17:19:36 +01:00 |
|
Gael Guennebaud
|
902a7793f7
|
Add possibility to bench row-major lhs and rhs
|
2019-02-15 16:52:34 +01:00 |
|
Gael Guennebaud
|
b3c4344a68
|
bug #1676: workaround GCC's bug in c++17 mode.
|
2019-02-07 15:21:35 +01:00 |
|
Gael Guennebaud
|
efe02292a6
|
Add recent gemm related changesets and various cleanups in perf-monitoring
|
2019-01-29 11:53:47 +01:00 |
|
Gael Guennebaud
|
c64d5d3827
|
Bypass inline asm for non compatible compilers.
|
2019-01-23 23:43:13 +01:00 |
|
Gael Guennebaud
|
f20c991679
|
add changesets related to matrix product perf.
|
2018-12-13 10:33:29 +01:00 |
|
luz.paz"
|
f67b19a884
|
[PATCH 1/2] Misc. typos
From 68d431b4c14ad60a778ee93c1f59ecc4b931950e Mon Sep 17 00:00:00 2001
Found via `codespell -q 3 -I ../eigen-word-whitelist.txt` where the whitelists consists of:
```
als
ans
cas
dum
lastr
lowd
nd
overfl
pres
preverse
substraction
te
uint
whch
```
---
CMakeLists.txt | 26 +++++++++----------
Eigen/src/Core/GenericPacketMath.h | 2 +-
Eigen/src/SparseLU/SparseLU.h | 2 +-
bench/bench_norm.cpp | 2 +-
doc/HiPerformance.dox | 2 +-
doc/QuickStartGuide.dox | 2 +-
.../Eigen/CXX11/src/Tensor/TensorChipping.h | 6 ++---
.../Eigen/CXX11/src/Tensor/TensorDeviceGpu.h | 2 +-
.../src/Tensor/TensorForwardDeclarations.h | 4 +--
.../src/Tensor/TensorGpuHipCudaDefines.h | 2 +-
.../Eigen/CXX11/src/Tensor/TensorReduction.h | 2 +-
.../CXX11/src/Tensor/TensorReductionGpu.h | 2 +-
.../test/cxx11_tensor_concatenation.cpp | 2 +-
unsupported/test/cxx11_tensor_executor.cpp | 2 +-
14 files changed, 29 insertions(+), 29 deletions(-)
|
2018-09-18 04:15:01 -04:00 |
|
Gael Guennebaud
|
995730fc6c
|
Add option to disable plot generation
|
2018-11-07 00:41:16 +01:00 |
|
Gael Guennebaud
|
8a5955a052
|
Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.
|
2018-07-11 17:16:50 +02:00 |
|
luz.paz
|
e3912f5e63
|
MIsc. source and comment typos
Found using `codespell` and `grep` from downstream FreeCAD
|
2018-03-11 10:01:44 -04:00 |
|
Gael Guennebaud
|
31e0bda2e3
|
Fix cmake warning
|
2017-12-14 15:48:27 +01:00 |
|
Gael Guennebaud
|
0f83aeb6b2
|
Improve cmake scripts for Pastix and BLAS detection.
|
2017-04-14 10:22:12 +02:00 |
|
Mehdi Goli
|
f499fe9496
|
Adding synchronisation to convolution kernel for sycl backend.
|
2017-03-13 09:18:37 +00:00 |
|
Mehdi Goli
|
aadb7405a7
|
Fixing typo in sycl Benchmark.
|
2017-03-08 18:20:06 +00:00 |
|
Mehdi Goli
|
5e9a1e7a7a
|
Adding sycl Benchmarks.
|
2017-03-08 14:17:48 +00:00 |
|
Benoit Steiner
|
fbc39fd02c
|
Merge latest changes from upstream
|
2017-01-30 15:25:57 -08:00 |
|
Gael Guennebaud
|
45b289505c
|
Add debug output
|
2017-01-03 11:31:02 +01:00 |
|
Gael Guennebaud
|
5838f078a7
|
Fix inclusion
|
2017-01-03 11:30:27 +01:00 |
|
Benoit Steiner
|
3eda02d78d
|
Fixed the sycl benchmarking code
|
2016-12-22 10:37:05 -08:00 |
|
Gael Guennebaud
|
747202d338
|
typo
|
2016-12-08 12:48:15 +01:00 |
|
Gael Guennebaud
|
bb297abb9e
|
make sure we use the right eigen version
|
2016-12-08 12:00:11 +01:00 |
|
Gael Guennebaud
|
8b4b00d277
|
fix usage of custom compiler
|
2016-12-08 11:59:39 +01:00 |
|
Gael Guennebaud
|
7105596899
|
Add missing include and use -O3
|
2016-12-07 16:56:08 +01:00 |
|
Gael Guennebaud
|
780f3c1adf
|
Fix call to convert on linux
|
2016-12-07 16:30:11 +01:00 |
|
Gael Guennebaud
|
3855ab472f
|
Cleanup file structure
|
2016-12-07 14:23:49 +01:00 |
|
Gael Guennebaud
|
59a59fa8e7
|
Update perf monitoring scripts to generate html/svg outputs
|
2016-12-07 13:36:56 +01:00 |
|
Gael Guennebaud
|
1b4e085a7f
|
generate png file for web upload
|
2016-12-06 16:46:22 +01:00 |
|
Gael Guennebaud
|
f90c4aebc5
|
Update monitored changeset lists
|
2016-12-06 15:07:46 +01:00 |
|
Gael Guennebaud
|
c68c8631e7
|
fix compilation of BTL's blaze interface
|
2016-12-05 23:02:16 +01:00 |
|
Gael Guennebaud
|
1ff1d4a124
|
Add performance monitoring for LLT
|
2016-12-05 23:01:52 +01:00 |
|
Gael Guennebaud
|
445c015751
|
extend monitoring benchmarks with transpose matrix-vector and triangular matrix-vectors.
|
2016-12-05 13:36:26 +01:00 |
|
Gael Guennebaud
|
4c0d5f3c01
|
Add perf monitoring for gemv
|
2016-12-02 11:34:12 +01:00 |
|
Gael Guennebaud
|
d2718d662c
|
Re-enable A^T*A action in BTL
|
2016-12-02 11:32:03 +01:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Christoph Hertzberg
|
4b377715d7
|
Do not manually add absolute path to boost-library.
Also set C++ standard for blaze to C++14
|
2016-09-22 00:10:47 +02:00 |
|
Luke Iwanski
|
cb81975714
|
Partial OpenCL support via SYCL compatible with ComputeCpp CE.
|
2016-09-19 12:44:13 +01:00 |
|
Gael Guennebaud
|
5fbe7aa604
|
Update and fix Cholesky mini benchmark
|
2016-07-28 11:26:30 +02:00 |
|
Gael Guennebaud
|
9b76be9d21
|
Update benchmark for dense solver to stress least-squares pb, and to output a HTML table
|
2016-07-21 12:30:53 +02:00 |
|
Gael Guennebaud
|
75e80792cc
|
Update relevent list of changesets.
|
2016-07-04 14:32:34 +02:00 |
|
Gael Guennebaud
|
dacc544b84
|
asm escape was not strong enough to prevent too aggressive compiler optimization let's fallback to no-inline.
|
2016-07-04 14:32:15 +02:00 |
|
Gael Guennebaud
|
b74e45906c
|
Few fixes in perf-monitoring.
|
2016-07-04 14:30:50 +02:00 |
|
Gael Guennebaud
|
e2b3836326
|
Include recent changesets that played with product's kernel
|
2016-06-09 17:13:33 +02:00 |
|
Benoit Steiner
|
457204cb83
|
Updated the README file for the tensor benchmarks
|
2016-05-25 16:13:41 -07:00 |
|
Benoit Steiner
|
034aa3b2c0
|
Improved the performance of tensor padding
|
2016-05-25 11:43:08 -07:00 |
|
Benoit Steiner
|
069a0b04d7
|
Added benchmarks for contraction on CPU.
|
2016-05-13 14:32:17 -07:00 |
|
Benoit Steiner
|
f81e413180
|
Added a benchmark to measure the performance of full reductions of 16 bit floats
|
2016-05-05 14:15:11 -07:00 |
|
Benoit Steiner
|
79b900375f
|
Use index list for the striding benchmarks
|
2016-04-21 11:58:27 -07:00 |
|
Gael Guennebaud
|
d8a3bdaa24
|
remove useless include
|
2016-04-14 15:18:56 +02:00 |
|
Benoit Steiner
|
eaeb6ca93a
|
Enable the benchmarks for algebraic and transcendental fnctions on fp16.
|
2016-04-12 16:29:00 -07:00 |
|
Benoit Steiner
|
53121c0119
|
Turned on the contraction benchmarks for fp16
|
2016-04-12 14:11:52 -07:00 |
|