Gael Guennebaud
|
0212eec23f
|
simplify and optimize block sizes computation for matrix products. They
are now automatically computed from the L1 and L2 cache sizes which are
themselves automatically determined at runtime.
|
2010-06-21 23:28:50 +02:00 |
|
Gael Guennebaud
|
4cd38b333c
|
make bench_gemm print out the queried cache sizes
|
2010-06-21 12:07:05 +02:00 |
|
Gael Guennebaud
|
6db6e358f5
|
add the possibility to set the cache size at runtime
|
2010-06-18 23:25:57 +02:00 |
|
Gael Guennebaud
|
5b192930b6
|
add runtime API to control multithreading
|
2010-06-10 23:30:15 +02:00 |
|
Gael Guennebaud
|
c442208358
|
clean a bit the bench_gemm files
|
2010-03-05 11:35:43 +01:00 |
|
Gael Guennebaud
|
24ef5fedcd
|
minor cleaning
|
2010-03-05 09:57:04 +01:00 |
|
Gael Guennebaud
|
1710c07f63
|
remove Qt's atomic dependency, I don't know what I was doing wrong...
|
2010-03-01 13:09:47 +01:00 |
|
Gael Guennebaud
|
aeff3ff391
|
make Aron's idea work using Qt's atomic implementation for the synchronisation
|
2010-03-01 10:57:32 +01:00 |
|
Gael Guennebaud
|
ac425090f3
|
BTL: allow to bench real time
|
2010-02-26 14:57:49 +01:00 |
|
Gael Guennebaud
|
3ac2b96a2f
|
implement a smarter parallelization strategy for gemm avoiding multiple
paking of the same data
|
2010-02-26 12:32:00 +01:00 |
|
Gael Guennebaud
|
eb905500b6
|
significant speedup in the matrix-matrix products
|
2010-02-23 13:06:49 +01:00 |
|
Gael Guennebaud
|
d579d4cc37
|
oops
|
2010-02-22 17:57:15 +01:00 |
|
Gael Guennebaud
|
aaaf855a88
|
add a small benchmark to quickly bench/compare SMP support
|
2010-02-22 11:09:57 +01:00 |
|