Benoit Steiner
|
dc413dbe8a
|
Merged in ville-k/eigen/explicit_long_constructors (pull request PR-158)
Add constructor for long types.
|
2016-02-02 20:58:06 -08:00 |
|
Ville Kallioniemi
|
783018d8f6
|
Use EIGEN_STATIC_ASSERT for backward compatibility.
|
2016-02-02 16:45:12 -07:00 |
|
Benoit Steiner
|
99cde88341
|
Don't try to use direct offsets when computing a tensor product, since the required stride isn't available.
|
2016-02-02 11:06:53 -08:00 |
|
Ville Kallioniemi
|
ff0a83aaf8
|
Use single template constructor to avoid overload resolution issues.
|
2016-02-02 00:33:25 -07:00 |
|
Ville Kallioniemi
|
aedea349aa
|
Replace separate low word constructors with a single templated constructor.
|
2016-02-01 20:25:02 -07:00 |
|
Ville Kallioniemi
|
f0fdefa96f
|
Rebase to latest.
|
2016-02-01 19:32:31 -07:00 |
|
Benoit Steiner
|
64ce78c2ec
|
Cleaned up a tensor contraction test
|
2016-02-01 13:57:41 -08:00 |
|
Benoit Steiner
|
0ce5d32be5
|
Sharded the cxx11_tensor_contract_cuda test
|
2016-02-01 13:33:23 -08:00 |
|
Benoit Steiner
|
922b5f527b
|
Silenced a few compilation warnings
|
2016-02-01 13:30:49 -08:00 |
|
Benoit Steiner
|
6b5dff875e
|
Made it possible to limit the number of blocks that will be used to evaluate a tensor expression on a CUDA device. This makesit possible to set aside streaming multiprocessors for other computations.
|
2016-02-01 12:46:32 -08:00 |
|
Benoit Steiner
|
264f8141f8
|
Shared the tensor reduction test
|
2016-02-01 07:44:31 -08:00 |
|
Benoit Steiner
|
11bb71c8fc
|
Sharded the tensor device test
|
2016-02-01 07:34:59 -08:00 |
|
Gael Guennebaud
|
ff1157bcbf
|
bug #694: document that SparseQR::matrixR is not sorted.
|
2016-02-01 16:09:34 +01:00 |
|
Gael Guennebaud
|
ec469700dc
|
bug #557: make InnerIterator of sparse storage types more versatile by adding default-ctor, copy-ctor/assignment
|
2016-02-01 15:04:33 +01:00 |
|
Gael Guennebaud
|
6e0a86194c
|
Fix integer path for num_steps==1
|
2016-02-01 15:00:04 +01:00 |
|
Gael Guennebaud
|
e1d219e5c9
|
bug #698: fix linspaced for integer types.
|
2016-02-01 14:25:34 +01:00 |
|
Gael Guennebaud
|
2c3224924b
|
Fix warning and replace min/max macros by calls to mini/maxi
|
2016-02-01 10:23:45 +01:00 |
|
Benoit Steiner
|
e80ed948e1
|
Fixed a number of compilation warnings generated by the cuda tests
|
2016-01-31 20:09:41 -08:00 |
|
Benoit Steiner
|
6720b38fbf
|
Fixed a few compilation warnings
|
2016-01-31 16:48:50 -08:00 |
|
Benoit Steiner
|
3f1ee45833
|
Fixed compilation errors triggered by duplicate inline declaration
|
2016-01-31 10:48:49 -08:00 |
|
Benoit Steiner
|
70be6f6531
|
Pulled latest changes from trunk
|
2016-01-31 10:44:45 -08:00 |
|
Benoit Steiner
|
4a2ddfb81d
|
Sharded the CUDA argmax tensor test
|
2016-01-31 10:44:15 -08:00 |
|
Gael Guennebaud
|
d142165942
|
bug #667: declare several critical functions as FORECE_INLINE to make ICC happier.
<g.gael@free.fr> HG: branch 'default' HG: changed Eigen/src/Core/ArrayBase.h HG: changed Eigen/src/Core/AssignEvaluator.h HG: changed
Eigen/src/Core/CoreEvaluators.h HG: changed Eigen/src/Core/CwiseUnaryOp.h HG: changed Eigen/src/Core/DenseBase.h HG: changed Eigen/src/Core/MatrixBase.h
|
2016-01-31 16:34:10 +01:00 |
|
Gael Guennebaud
|
a4e4542b89
|
Avoid overflow in unit test.
|
2016-01-30 22:26:17 +01:00 |
|
Gael Guennebaud
|
3ba8a3ab1a
|
Disable underflow unit test on the i387 FPU.
|
2016-01-30 22:14:04 +01:00 |
|
Benoit Steiner
|
483082ef6e
|
Fixed a few memory leaks in the cuda tests
|
2016-01-30 11:59:22 -08:00 |
|
Benoit Steiner
|
bd21aba181
|
Sharded the cxx11_tensor_cuda test and fixed a memory leak
|
2016-01-30 11:47:09 -08:00 |
|
Benoit Steiner
|
9de155d153
|
Added a test to cover threaded tensor shuffling
|
2016-01-30 10:56:47 -08:00 |
|
Benoit Steiner
|
32088c06a1
|
Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.
|
2016-01-30 10:51:14 -08:00 |
|
Benoit Steiner
|
2053478c56
|
Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test
|
2016-01-30 10:46:36 -08:00 |
|
Benoit Steiner
|
d0db95f730
|
Sharded the tensor thread pool test
|
2016-01-30 10:43:57 -08:00 |
|
Benoit Steiner
|
ba27c8a7de
|
Made the CUDA contract test more robust to numerical noise.
|
2016-01-30 10:28:43 -08:00 |
|
Benoit Steiner
|
4281eb1e2c
|
Added 2 benchmarks to the suite of tensor benchmarks running on GPU
|
2016-01-30 10:20:43 -08:00 |
|
Gael Guennebaud
|
102fa96a96
|
Extend doc on dense+sparse
|
2016-01-30 14:58:21 +01:00 |
|
Gael Guennebaud
|
1bc207c528
|
backout changeset d4a9e61569
: the extended SparseView is not needed anymore
|
2016-01-30 14:43:21 +01:00 |
|
Gael Guennebaud
|
8ed1553d20
|
bug #632: implement general coefficient-wise "dense op sparse" operations through specialized evaluators instead of using SparseView.
This permits to deal with arbitrary storage order, and to by-pass the more complex iterator of the sparse-sparse case.
|
2016-01-30 14:39:50 +01:00 |
|
Gael Guennebaud
|
699634890a
|
bug #946: generalize Cholmod::solve to handle any rhs expression
|
2016-01-29 23:02:22 +01:00 |
|
Gael Guennebaud
|
15084cf1ac
|
bug #632: add support for "dense +/- sparse" operations. The current implementation is based on SparseView to make the dense subexpression compatible with the sparse one.
|
2016-01-29 22:09:45 +01:00 |
|
Gael Guennebaud
|
d4a9e61569
|
Extend SparseView to allow keeping explicit zeros. This is equivalent to sparseView(1,-1) but faster because the test is removed at compile-time.
|
2016-01-29 22:07:56 +01:00 |
|
Gael Guennebaud
|
d8d37349c3
|
bug #696: enable zero-sized block at compile-time by relaxing the respective assertion
|
2016-01-29 12:44:49 +01:00 |
|
Gael Guennebaud
|
e8ccc06fe5
|
merge
|
2016-01-29 09:40:38 +01:00 |
|
Benoit Steiner
|
963f2d2a8f
|
Marked several methods EIGEN_DEVICE_FUNC
|
2016-01-28 23:37:48 -08:00 |
|
Benoit Steiner
|
c5d25bf1d0
|
Fixed a couple of compilation warnings.
|
2016-01-28 23:15:45 -08:00 |
|
Benoit Steiner
|
e4f83bae5d
|
Fixed the tensor benchmarks on apple devices
|
2016-01-28 21:08:07 -08:00 |
|
Benoit Steiner
|
10bea90c4a
|
Fixed clang related compilation error
|
2016-01-28 20:52:08 -08:00 |
|
Benoit Steiner
|
d3f533b395
|
Fixed compilation warning
|
2016-01-28 20:09:45 -08:00 |
|
Abhijit Kundu
|
3fde202215
|
Making ceil() functor generic w.r.t packet type
|
2016-01-28 21:27:00 -05:00 |
|
Benoit Steiner
|
211d350fc3
|
Fixed a typo
|
2016-01-28 17:13:04 -08:00 |
|
Benoit Steiner
|
bd2e5a788a
|
Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.
|
2016-01-28 17:10:40 -08:00 |
|
Benoit Steiner
|
120e13b1b6
|
Added a readme to explain how to compile the tensor benchmarks.
|
2016-01-28 17:06:00 -08:00 |
|