Benoit Jacob
74a7c5caee
implement the idea that row-vectors have the RowMajorBit and col-vectors don't.
2010-03-09 00:16:07 -05:00
Adolfo Rodriguez Tsouroukdissian
5a36f4a8d1
Propagate all five matrix template parameters to members and temporaries of decomposition classes. One particular advantage of this is that decomposing matrices with max sizes known at compile time will not allocate.
...
NOTE: The ComplexEigenSolver class currently _does_ allocate (line 135 of Eigenvalues/ComplexEigenSolver.h), but the reason appears to be in the implementation of matrix-matrix products, and not in the decomposition itself.
The nomalloc unit test has been extended to verify that decompositions do not allocate when max sizes are specified. There are currently two workarounds to prevent the test from failing (see comments in test/nomalloc.cpp), both of which are related to matrix products that allocate on the stack.
2010-03-08 19:31:27 +01:00
Gael Guennebaud
898762529e
update the product selection logic to use the Max* sizes
2010-03-08 22:55:58 +01:00
Gael Guennebaud
9a3b00c040
add missing cmake directives for arch/Default
2010-03-08 22:15:35 +01:00
Benoit Jacob
89343a38af
* Fix #97 : Householder operations on 1x1 matrices
...
* Fix VectorBlock on 1x1 "vectors"
* remove useless makeTrivialHouseholder function
2010-03-08 12:37:04 -05:00
Benoit Jacob
4293a4d1af
* let a = foo() work when a is a row-vector xpr and foo() returns a ReturnByValue col-vector
...
* remove a few useless resize() in evalTo() implementations
2010-03-08 10:34:59 -05:00
Hauke Heibel
9fe040ad29
Reintroduced the if-clause for MSVC ei_ploadu via _loadu_.
2010-03-07 14:05:26 +01:00
Gael Guennebaud
3130b7a722
bugcount--, this time trmm
2010-03-06 22:57:50 +01:00
Gael Guennebaud
1958b7eccc
stride() => inner/outerStride()
2010-03-06 22:39:15 +01:00
Gael Guennebaud
4402034678
pff I introduced much too many bugs latey, count--
2010-03-06 22:24:50 +01:00
Gael Guennebaud
61ce1de048
fix symm
2010-03-06 22:15:59 +01:00
Gael Guennebaud
a7d199bf9a
fix trsolve
2010-03-06 21:37:14 +01:00
Gael Guennebaud
6f0b96dcf4
fix issue #100 (fix syrk)
2010-03-06 21:16:43 +01:00
Gael Guennebaud
271fc84e47
bugfix in gebp for 32bits x86
2010-03-06 20:52:20 +01:00
Benoit Jacob
c4f8afdf49
#undef minor at the right place
2010-03-06 14:44:57 -05:00
Benoit Jacob
7e2afe7e95
remove the __ARM_NEON__ check there since Konstantinos said he removed it but apparently didn't commit :)
2010-03-06 12:11:08 -05:00
Benoit Jacob
bf0a21a695
* disable static alignment on QCC
...
* remove obsolete #error
2010-03-06 09:28:58 -05:00
Benoit Jacob
2bd31d3fbc
* include Macros.h much earlier: since it takes care of the alignment platform detection, it is needed before we do the vectorization stuff in Eigen/Core !!
...
* kill EIGEN_DONT_ALIGN_HEAP option (one should use EIGEN_DONT_ALIGN)
* rename EIGEN_DONT_ALIGN_STACK to EIGEN_DONT_ALIGN_STATICALLY. hope it's a better name.
2010-03-06 09:05:15 -05:00
Benoit Jacob
f03d95348d
introduce EIGEN_DONT_ALIGN_STACK (disables alignment attributes) and EIGEN_DONT_ALIGN_HEAP (disables aligned malloc)...
...
you can still use EIGEN_DONT_ALIGN to do both at once.
2010-03-06 02:17:37 -05:00
Gael Guennebaud
afd7ee759b
fix copy pasted comment
2010-03-05 21:35:11 +01:00
Konstantinos Margaritis
273b236f72
Altivec brought up to date. Most tests pass and performance is better than before too!
2010-03-05 22:28:49 +02:00
Gael Guennebaud
f2a246c225
add a small program to bench all combinations of small products
2010-03-05 17:16:19 +01:00
Gael Guennebaud
5f172cd01f
add a FIXME
2010-03-05 10:45:29 +01:00
Gael Guennebaud
48d0595c29
* dynamically adjust the number of threads
...
* disbale parallelisation if we already are in a parallel session
2010-03-05 10:44:31 +01:00
Gael Guennebaud
62ac021606
fix openmp version for scalar types different than float
2010-03-05 10:16:25 +01:00
Gael Guennebaud
d13b877014
remove the 1D and 2D parallelizer, keep only the GEMM specialized one
2010-03-05 10:04:17 +01:00
Gael Guennebaud
24ef5fedcd
minor cleaning
2010-03-05 09:57:04 +01:00
Gael Guennebaud
279ad44509
merge
2010-03-05 09:46:58 +01:00
Gael Guennebaud
620bd28480
enable posix_memalign for QNX
2010-03-05 09:44:21 +01:00
Gael Guennebaud
7e2683dc39
merge
2010-03-04 18:59:56 +01:00
Gael Guennebaud
0964810fba
merge
2010-03-04 18:59:03 +01:00
Gael Guennebaud
ea8cad5151
make the number of registers easier to configure per architectures
2010-03-04 18:58:12 +01:00
Gael Guennebaud
cefd9b8888
merge with default branch
2010-03-04 18:47:52 +01:00
Hauke Heibel
1723068694
Moved x()/y()/z() and w() access functions to DenseBase; they are now available for Arrays as well.
2010-03-04 18:33:51 +01:00
Gael Guennebaud
8ed1ef4469
add a minor FIXME
2010-03-04 18:30:28 +01:00
Benoit Jacob
68d94d914e
integer division is vectorizable on no SIMD platform, not just SSE.
2010-03-04 09:03:06 -05:00
Konstantinos Margaritis
710bc073a7
arm_neon.h is a standard header file, fixed
2010-03-03 12:15:34 -06:00
Benoit Jacob
6c89fd4df0
minor cleanup
2010-03-03 13:16:21 -05:00
Gael Guennebaud
7dd81aad74
factorize default performance related settings to a single file
...
included after the architecture specific files such that they
can be adapted by each platform.
2010-03-03 18:47:58 +01:00
Konstantinos Margaritis
112c550b4a
Added initial NEON support, most tests pass however we had to use some hackish workarounds
...
as gcc on ARM (both CodeSourcery 4.4.1 used and experimental 4.5) fail to
ensure proper alignment with __attribute__((aligned(16))). This has to be
fixed upstream to remove the workarounds.
2010-03-03 11:25:41 -06:00
Benoit Jacob
45d19afb18
cleanup/simplification in computation of matrix flags
2010-03-03 09:58:43 -05:00
Benoit Jacob
7dbe806711
merge
2010-03-03 09:55:46 -05:00
Benoit Jacob
6a92168915
Backed out changeset 2f3d685e0c
...
This was implementing deep changes that after discussion on the mailing list seem to need further discussion/thinking.
2010-03-03 09:54:50 -05:00
Hauke Heibel
aa6570c3a3
Added a missing inline hints.
...
Removed a useless Nested temporary.
2010-03-03 15:24:58 +01:00
Gael Guennebaud
2f3d685e0c
a matrix (or array) does not always have the LinearAccessBit!
...
=> fixes in outerStride and matrix flags
2010-03-02 15:31:39 +01:00
Gael Guennebaud
7fd6458fec
selfadjoint: do not reference the imaginary part of the diagonal
2010-03-02 12:43:55 +01:00
Hauke Heibel
32823caa62
Adapted the comment and removed it from the public dox.
2010-03-03 07:52:19 +01:00
Gael Guennebaud
3295c1c3e6
product selector: the symmetric case
2010-03-02 23:18:13 +01:00
Hauke Heibel
afad108b5f
Added a comment to prevent placing an EIGEN_STRONG_INLINE where it makes no sense.
2010-03-02 19:36:21 +01:00
Eamon Nerbonne
3efb3cc828
Changed product type selector to fix perf regression.
2010-03-02 12:08:49 +01:00