eigen/bench
2010-03-05 18:11:54 +01:00
..
btl BTL: allow to bench real time 2010-02-26 14:57:49 +01:00
basicbench.cxxlist
basicbenchmark.cpp
basicbenchmark.h
bench_gemm.cpp clean a bit the bench_gemm files 2010-03-05 11:35:43 +01:00
bench_multi_compilers.sh
bench_norm.cpp re-implement stableNorm using a homemade blocky and 2009-07-17 16:22:39 +02:00
bench_reverse.cpp add bench_reverse, draft of a reverse vectorization for AltiVec, make 2009-02-06 13:28:55 +00:00
bench_sum.cpp
bench_unrolling provide default values for CXX, remove duplicate define 2010-02-22 15:39:17 +01:00
benchBlasGemm.cpp update product bench 2009-11-06 11:33:18 +01:00
benchCholesky.cpp s/cholesky/llt in precompiled lib and BTL 2009-02-06 14:01:01 +00:00
benchEigenSolver.cpp
benchFFT.cpp added benchmark for unscaled and half-spectrum FFTs 2010-01-21 21:09:26 -05:00
benchmark_suite provide default values for CXX, remove duplicate define 2010-02-22 15:39:17 +01:00
benchmark.cpp
benchmarkSlice.cpp
benchmarkX.cpp
benchmarkXcwise.cpp
BenchSparseUtil.h extend sparse product benchmark with ublas 2010-02-09 15:55:36 +01:00
BenchTimer.h clean #defined tokens, and use clock_gettime for the real time 2010-03-03 09:41:29 +01:00
BenchUtil.h
benchVecAdd.cpp the big memory changes. the most important changes are: 2009-01-08 15:20:21 +00:00
product_threshold.cpp Fixed line endings. 2010-03-05 18:11:54 +01:00
quat_slerp.cpp add a slerp benchmark (for accuracy and speed)) 2009-12-04 15:02:38 +01:00
README.txt
sparse_cholesky.cpp * the Upper->UpperTriangular change 2008-12-20 13:36:12 +00:00
sparse_dense_product.cpp implement __gnuc_forget_about_setZero_its_over_now 2009-09-18 15:36:05 +02:00
sparse_lu.cpp big huge changes, so i dont remember everything. 2009-10-28 18:19:29 -04:00
sparse_product.cpp extend sparse product benchmark with ublas 2010-02-09 15:55:36 +01:00
sparse_randomsetter.cpp sparse module: 2008-10-21 13:35:04 +00:00
sparse_setter.cpp extend the sparse matrix assembly benchmark 2009-10-07 14:25:53 +02:00
sparse_transpose.cpp
sparse_trisolver.cpp * the Upper->UpperTriangular change 2008-12-20 13:36:12 +00:00
vdw_new.cpp

This folder contains a couple of benchmark utities and Eigen benchmarks.

****************************
* bench_multi_compilers.sh *
****************************

This script allows to run a benchmark on a set of different compilers/compiler options.
It takes two arguments:
 - a file defining the list of the compilers with their options
 - the .cpp file of the benchmark

Examples:

$ ./bench_multi_compilers.sh basicbench.cxxlist basicbenchmark.cpp

    g++-4.1 -O3 -DNDEBUG -finline-limit=10000
    3d-3x3   /   4d-4x4   /   Xd-4x4   /   Xd-20x20   /
    0.271102   0.131416   0.422322   0.198633
    0.201658   0.102436   0.397566   0.207282

    g++-4.2 -O3 -DNDEBUG -finline-limit=10000
    3d-3x3   /   4d-4x4   /   Xd-4x4   /   Xd-20x20   /
    0.107805   0.0890579   0.30265   0.161843
    0.127157   0.0712581   0.278341   0.191029

    g++-4.3 -O3 -DNDEBUG -finline-limit=10000
    3d-3x3   /   4d-4x4   /   Xd-4x4   /   Xd-20x20   /
    0.134318   0.105291   0.3704   0.180966
    0.137703   0.0732472   0.31225   0.202204

    icpc -fast -DNDEBUG -fno-exceptions -no-inline-max-size
    3d-3x3   /   4d-4x4   /   Xd-4x4   /   Xd-20x20   /
    0.226145   0.0941319   0.371873   0.159433
    0.109302   0.0837538   0.328102   0.173891


$ ./bench_multi_compilers.sh ompbench.cxxlist ompbenchmark.cpp

    g++-4.2 -O3 -DNDEBUG -finline-limit=10000 -fopenmp
    double, fixed-size 4x4: 0.00165105s  0.0778739s
    double, 32x32: 0.0654769s 0.075289s  => x0.869674 (2)
    double, 128x128: 0.054148s 0.0419669s  => x1.29025 (2)
    double, 512x512: 0.913799s 0.428533s  => x2.13239 (2)
    double, 1024x1024: 14.5972s 9.3542s  => x1.5605 (2)

    icpc -fast -DNDEBUG -fno-exceptions -no-inline-max-size -openmp
    double, fixed-size 4x4: 0.000589848s  0.019949s
    double, 32x32: 0.0682781s 0.0449722s  => x1.51823 (2)
    double, 128x128: 0.0547509s 0.0435519s  => x1.25714 (2)
    double, 512x512: 0.829436s 0.424438s  => x1.9542 (2)
    double, 1024x1024: 14.5243s 10.7735s  => x1.34815 (2)