Benoit Steiner
|
c94174b4fe
|
Improved tensor references
|
2015-01-14 10:13:08 -08:00 |
|
Benoit Steiner
|
91dd53e54d
|
Created some documentation
|
2015-01-13 16:07:51 -08:00 |
|
Benoit Steiner
|
9f98650d0a
|
Ensured that contractions that can be reduced to a matrix vector product work correctly even when the input coefficients aren't aligned.
|
2015-01-06 09:29:13 -08:00 |
|
Benoit Steiner
|
509e4ddc02
|
Added reduction packet primitives for CUDA
|
2014-11-19 10:34:11 -08:00 |
|
Benoit Steiner
|
b33cf92878
|
Fixed the evaluation of expressions involving tensors of 2 or 3 elements on CUDA devices.
|
2014-11-18 14:32:41 -08:00 |
|
Benoit Steiner
|
1d3c8306f8
|
Fixed compilation errors with clang.
H: Enter commit message. Lines beginning with 'HG:' are removed.
|
2014-11-13 19:13:17 -08:00 |
|
Benoit Steiner
|
ec785b0180
|
Added support for extraction of patches from images
|
2014-11-13 09:28:54 -08:00 |
|
Benoit Steiner
|
eeabf7975e
|
Optimized broadcasting
|
2014-11-12 22:35:44 -08:00 |
|
Benoit Steiner
|
c2d1074932
|
Added support for static list of indices
|
2014-11-12 22:25:38 -08:00 |
|
Benoit Steiner
|
cb37f818ca
|
Fixed a compilation error triggered by some operations on fixed sized tensors
|
2014-11-05 23:25:11 -08:00 |
|
Benoit Steiner
|
9a06a71627
|
Fixed a test
|
2014-11-05 07:49:51 -08:00 |
|
Benoit Steiner
|
9ea09179b5
|
Fixed the return type of the coefficient-wise tensor operations.
|
2014-11-04 10:24:42 -08:00 |
|
Benoit Steiner
|
b1789c112b
|
Improved handling of 1d tensors
|
2014-11-03 08:51:33 -08:00 |
|
Benoit Steiner
|
2dde63499c
|
Generalized the matrix vector product code.
|
2014-10-31 16:33:51 -07:00 |
|
Benoit Steiner
|
7f2c6ed2fa
|
Fixed a compilation warning
|
2014-10-31 11:45:21 -07:00 |
|
Benoit Steiner
|
85c3389b28
|
Fixed a test
|
2014-10-31 00:04:13 -07:00 |
|
Benoit Steiner
|
67fcf47ecb
|
Merged from trunk
|
2014-10-30 21:59:22 -07:00 |
|
Benoit Steiner
|
fcecafde3a
|
Fixed a compilation error with clang
|
2014-10-30 21:58:14 -07:00 |
|
Benoit Steiner
|
d62bfe73a9
|
Use the proper index type in the padding code
|
2014-10-30 18:15:05 -07:00 |
|
Benoit Steiner
|
bc99c5f7db
|
fixed some potential alignment issues.
|
2014-10-30 18:09:53 -07:00 |
|
Benoit Steiner
|
1946cc4478
|
Added missing packet primitives for CUDA.
|
2014-10-30 17:52:32 -07:00 |
|
Benoit Steiner
|
5e62427e22
|
Use the proper index type
|
2014-10-30 17:49:39 -07:00 |
|
Benoit Steiner
|
debc97821c
|
Added support for tensor references
|
2014-10-28 23:10:13 -07:00 |
|
Benoit Steiner
|
f786897e4b
|
Added access to the unerlying raw data of a tnsor slice/chip whenever possible
|
2014-10-17 15:33:27 -07:00 |
|
Benoit Steiner
|
7acd38d19e
|
Created some benchmarks for the tensor code
|
2014-10-17 09:49:03 -07:00 |
|
Benoit Steiner
|
65af852b54
|
Silenced one last warning
|
2014-10-16 15:02:30 -07:00 |
|
Benoit Steiner
|
ae697b471c
|
Silenced a few compilation warnings
Generalized a TensorMap constructor
|
2014-10-16 14:52:50 -07:00 |
|
Benoit Steiner
|
94e47798f4
|
Fixed the return types of unary and binary expressions to properly handle the case where it is different from the input type (e.g. abs(complex<float>))
|
2014-10-16 10:41:07 -07:00 |
|
Benoit Steiner
|
d853adffdb
|
Avoid calling get_future() more than once on a given promise.
|
2014-10-16 10:10:04 -07:00 |
|
Benoit Steiner
|
bfdd9f3ac9
|
Made the blocking computation aware of the l3 cache
Also optimized the blocking parameters to take into account the number of threads used for a computation
|
2014-10-15 15:32:59 -07:00 |
|
Benoit Steiner
|
dba55041ab
|
Added support for promises
Started to improve multithreaded contractions
|
2014-10-15 11:20:36 -07:00 |
|
Benoit Steiner
|
99d75235a9
|
Misc improvements and cleanups
|
2014-10-13 17:02:09 -07:00 |
|
Benoit Steiner
|
4c70b0a762
|
Added support for patch extraction
|
2014-10-13 10:04:04 -07:00 |
|
Benoit Steiner
|
0219f8aed4
|
Added ability to print a tensor using an iostream.
|
2014-10-10 16:17:26 -07:00 |
|
Benoit Steiner
|
2ed1838aeb
|
Added support for tensor chips
|
2014-10-10 16:11:27 -07:00 |
|
Benoit Steiner
|
4b36c3591f
|
Fixed the tensor shuffling test
|
2014-10-10 15:43:21 -07:00 |
|
Benoit Steiner
|
a991f94c0e
|
Fixed the thread pool test
|
2014-10-10 15:20:37 -07:00 |
|
Benoit Steiner
|
498b7eed25
|
Rewrote the TensorBase::random method to support the generation of random number on gpu.
|
2014-10-09 15:39:13 -07:00 |
|
Benoit Steiner
|
767424af18
|
Improved the functors defined for standard reductions
Added a functor to encapsulate the generation of random numbers on cpu and gpu.
|
2014-10-09 15:36:23 -07:00 |
|
Benoit Steiner
|
44beee9d68
|
Removed dead code
|
2014-10-08 14:14:20 -07:00 |
|
Benoit Steiner
|
0a07ac574e
|
Added support for the *= and /* operators to TensorBase
|
2014-10-08 13:32:41 -07:00 |
|
Benoit Steiner
|
6c047d398d
|
Fixed a comment
|
2014-10-08 13:29:36 -07:00 |
|
Benoit Steiner
|
bbce6fa65d
|
define EIGEN_VECTORIZE_CUDA when compiling with nvcc
|
2014-10-03 19:55:35 -07:00 |
|
Benoit Steiner
|
95a430a2ca
|
Vector primitives for CUDA
|
2014-10-03 19:45:19 -07:00 |
|
Benoit Steiner
|
152f3218ac
|
Improved contraction test
|
2014-10-03 19:33:44 -07:00 |
|
Benoit Steiner
|
af2e5995e2
|
Improved support for CUDA devices.
Improved contractions on GPU
|
2014-10-03 19:18:07 -07:00 |
|
Benoit Steiner
|
1269392822
|
Created the IndexPair type to store pair of tensor indices. CUDA doesn't support std::pair so we can't use them when targeting GPUs.
Improved the performance on tensor contractions
|
2014-10-03 10:16:59 -07:00 |
|
Benoit Steiner
|
b7271dffb5
|
Generalized the gebp apis
|
2014-10-02 16:51:57 -07:00 |
|
Benoit Steiner
|
8b2afe33a1
|
Fixes for the forced evaluation of tensor expressions
More tests
|
2014-10-02 10:39:36 -07:00 |
|
Benoit Steiner
|
5cc23199be
|
More tests to validate the const-correctness of the tensor code.
|
2014-10-02 10:30:44 -07:00 |
|