Benoit Steiner
|
bfdd9f3ac9
|
Made the blocking computation aware of the l3 cache
Also optimized the blocking parameters to take into account the number of threads used for a computation
|
2014-10-15 15:32:59 -07:00 |
|
Gael Guennebaud
|
c566cfe2ba
|
Make SVD unit test even more tough
|
2014-10-15 23:37:47 +02:00 |
|
Benoit Steiner
|
dba55041ab
|
Added support for promises
Started to improve multithreaded contractions
|
2014-10-15 11:20:36 -07:00 |
|
Gael Guennebaud
|
fd1aaf4772
|
merge
|
2014-10-15 16:33:14 +02:00 |
|
Gael Guennebaud
|
c806009453
|
Extend svd unit tests to stress problems with duplicated singular values.
|
2014-10-15 16:32:16 +02:00 |
|
Gael Guennebaud
|
2cc41dbe83
|
D&C SVD: fix some numerical issues by truly skipping deflated singular values when computing them
|
2014-10-15 15:21:12 +02:00 |
|
Gael Guennebaud
|
c26e8a1af3
|
D&C SVD: fix deflation of repeated singular values, fix sorting of singular values, fix case of complete deflation
|
2014-10-15 11:59:21 +02:00 |
|
Christoph Hertzberg
|
0ec1fc9e11
|
bug #891: Determine sizeof(void*) via CMAKE variable instead of test program
|
2014-10-14 14:14:25 +02:00 |
|
Benoit Steiner
|
99d75235a9
|
Misc improvements and cleanups
|
2014-10-13 17:02:09 -07:00 |
|
Benoit Steiner
|
4c70b0a762
|
Added support for patch extraction
|
2014-10-13 10:04:04 -07:00 |
|
Christoph Hertzberg
|
d3f52debc6
|
Make cuda_basic test compile again by adding lots of EIGEN_DEVICE_FUNC.
Although the test passes now, there might still be some missing.
|
2014-10-13 17:18:26 +02:00 |
|
Benoit Steiner
|
0219f8aed4
|
Added ability to print a tensor using an iostream.
|
2014-10-10 16:17:26 -07:00 |
|
Benoit Steiner
|
2ed1838aeb
|
Added support for tensor chips
|
2014-10-10 16:11:27 -07:00 |
|
Benoit Steiner
|
4b36c3591f
|
Fixed the tensor shuffling test
|
2014-10-10 15:43:21 -07:00 |
|
Benoit Steiner
|
a991f94c0e
|
Fixed the thread pool test
|
2014-10-10 15:20:37 -07:00 |
|
Benoit Steiner
|
498b7eed25
|
Rewrote the TensorBase::random method to support the generation of random number on gpu.
|
2014-10-09 15:39:13 -07:00 |
|
Benoit Steiner
|
767424af18
|
Improved the functors defined for standard reductions
Added a functor to encapsulate the generation of random numbers on cpu and gpu.
|
2014-10-09 15:36:23 -07:00 |
|
Gael Guennebaud
|
a80e17cfe8
|
Remove unused and dangerous CompressedStorage::Map function
|
2014-10-09 23:42:33 +02:00 |
|
Gael Guennebaud
|
349c2c9235
|
bug #367: fix double copies in atWithInsertion, and add respective unit-test
|
2014-10-09 23:35:49 +02:00 |
|
Gael Guennebaud
|
48d537f59f
|
Fix indentation
|
2014-10-09 23:35:26 +02:00 |
|
Gael Guennebaud
|
538c059aa4
|
bug #887: fix CompressedStorage::reallocate wrt memory leaks
|
2014-10-09 23:35:05 +02:00 |
|
Gael Guennebaud
|
a48b82eece
|
Add a scoped_array helper class to handle locally allocated/used arrays
|
2014-10-09 23:34:05 +02:00 |
|
Gael Guennebaud
|
ccd70ba123
|
Various numerical fixes in D&C SVD: I cannot make it fail with double, but still need to tune for single precision, and carefully test with duplicated singular values
|
2014-10-09 23:29:01 +02:00 |
|
Benoit Steiner
|
44beee9d68
|
Removed dead code
|
2014-10-08 14:14:20 -07:00 |
|
Benoit Steiner
|
0a07ac574e
|
Added support for the *= and /* operators to TensorBase
|
2014-10-08 13:32:41 -07:00 |
|
Benoit Steiner
|
6c047d398d
|
Fixed a comment
|
2014-10-08 13:29:36 -07:00 |
|
Gael Guennebaud
|
4b886e6b39
|
bug #889: fix protected typedef
|
2014-10-08 07:48:30 +02:00 |
|
Gael Guennebaud
|
5741349294
|
bug #882: fix various const-correctness issues with *View classes.
|
2014-10-07 18:29:28 +02:00 |
|
Gael Guennebaud
|
118b1113d9
|
Workaround MSVC issue.
|
2014-10-07 09:53:39 +02:00 |
|
Gael Guennebaud
|
503c176d8e
|
Fix missing outer() member in DynamicSparseMatrix
|
2014-10-07 09:53:27 +02:00 |
|
Gael Guennebaud
|
dbdd8b0883
|
D&C SVD: add scaling to avoid overflow, fix handling of fixed size matrices
|
2014-10-06 19:35:57 +02:00 |
|
Gael Guennebaud
|
d44d432baa
|
Re-enable products with triangular views of sparse matrices: we simply have to treat them as a sparse matrix.
|
2014-10-06 16:11:26 +02:00 |
|
Gael Guennebaud
|
893bfcf95f
|
bug #887: use ei_declare_aligned_stack_constructed_variable instead of manual new[]/delete[] pairs in AMD and Paralellizer
|
2014-10-06 11:54:30 +02:00 |
|
Gael Guennebaud
|
fb53ff1eda
|
Fix SparseLU regarding uncompressed inputs and avoid manual new/delete calls.
|
2014-10-06 11:42:31 +02:00 |
|
Gael Guennebaud
|
7a17639953
|
Extend unit tests to check uncompressed sparse inputs in sparse solvers
|
2014-10-06 11:41:50 +02:00 |
|
Benoit Steiner
|
bbce6fa65d
|
define EIGEN_VECTORIZE_CUDA when compiling with nvcc
|
2014-10-03 19:55:35 -07:00 |
|
Benoit Steiner
|
95a430a2ca
|
Vector primitives for CUDA
|
2014-10-03 19:45:19 -07:00 |
|
Benoit Steiner
|
152f3218ac
|
Improved contraction test
|
2014-10-03 19:33:44 -07:00 |
|
Benoit Steiner
|
af2e5995e2
|
Improved support for CUDA devices.
Improved contractions on GPU
|
2014-10-03 19:18:07 -07:00 |
|
Benoit Steiner
|
1269392822
|
Created the IndexPair type to store pair of tensor indices. CUDA doesn't support std::pair so we can't use them when targeting GPUs.
Improved the performance on tensor contractions
|
2014-10-03 10:16:59 -07:00 |
|
Benoit Steiner
|
b7271dffb5
|
Generalized the gebp apis
|
2014-10-02 16:51:57 -07:00 |
|
Benoit Steiner
|
8b2afe33a1
|
Fixes for the forced evaluation of tensor expressions
More tests
|
2014-10-02 10:39:36 -07:00 |
|
Benoit Steiner
|
5cc23199be
|
More tests to validate the const-correctness of the tensor code.
|
2014-10-02 10:30:44 -07:00 |
|
Benoit Steiner
|
7caaf6453b
|
Added support for tensor reductions and concatenations
|
2014-10-01 20:38:22 -07:00 |
|
Benoit Steiner
|
1c236f4c9a
|
Added tests for tensors of const values and tensors of stringswwq::
|
2014-10-01 20:21:42 -07:00 |
|
Christoph Hertzberg
|
1fa6fe2abd
|
template keyword not allowed before non-template function call
|
2014-10-01 14:33:55 +02:00 |
|
Konstantinos Margaritis
|
9d3c69952b
|
fixed to make big-endian VSX work as well
|
2014-10-01 09:43:56 +00:00 |
|
Gael Guennebaud
|
5180bb5e47
|
Add missing default ctor in Rotation2D
|
2014-09-30 16:59:28 +02:00 |
|
Christoph Hertzberg
|
0187504912
|
Avoid `unneeded-internal-declaration' warning
|
2014-09-30 16:43:52 +02:00 |
|
Christoph Hertzberg
|
6d26deb894
|
Missing outerStride in AlignedVector3 resulted in infinite recursion
|
2014-09-30 16:43:19 +02:00 |
|