Benoit Steiner
|
ba27c8a7de
|
Made the CUDA contract test more robust to numerical noise.
|
2016-01-30 10:28:43 -08:00 |
|
Benoit Steiner
|
4281eb1e2c
|
Added 2 benchmarks to the suite of tensor benchmarks running on GPU
|
2016-01-30 10:20:43 -08:00 |
|
Gael Guennebaud
|
102fa96a96
|
Extend doc on dense+sparse
|
2016-01-30 14:58:21 +01:00 |
|
Gael Guennebaud
|
1bc207c528
|
backout changeset d4a9e61569
: the extended SparseView is not needed anymore
|
2016-01-30 14:43:21 +01:00 |
|
Gael Guennebaud
|
8ed1553d20
|
bug #632: implement general coefficient-wise "dense op sparse" operations through specialized evaluators instead of using SparseView.
This permits to deal with arbitrary storage order, and to by-pass the more complex iterator of the sparse-sparse case.
|
2016-01-30 14:39:50 +01:00 |
|
Gael Guennebaud
|
699634890a
|
bug #946: generalize Cholmod::solve to handle any rhs expression
|
2016-01-29 23:02:22 +01:00 |
|
Gael Guennebaud
|
15084cf1ac
|
bug #632: add support for "dense +/- sparse" operations. The current implementation is based on SparseView to make the dense subexpression compatible with the sparse one.
|
2016-01-29 22:09:45 +01:00 |
|
Gael Guennebaud
|
d4a9e61569
|
Extend SparseView to allow keeping explicit zeros. This is equivalent to sparseView(1,-1) but faster because the test is removed at compile-time.
|
2016-01-29 22:07:56 +01:00 |
|
Gael Guennebaud
|
d8d37349c3
|
bug #696: enable zero-sized block at compile-time by relaxing the respective assertion
|
2016-01-29 12:44:49 +01:00 |
|
Gael Guennebaud
|
e8ccc06fe5
|
merge
|
2016-01-29 09:40:38 +01:00 |
|
Benoit Steiner
|
963f2d2a8f
|
Marked several methods EIGEN_DEVICE_FUNC
|
2016-01-28 23:37:48 -08:00 |
|
Benoit Steiner
|
c5d25bf1d0
|
Fixed a couple of compilation warnings.
|
2016-01-28 23:15:45 -08:00 |
|
Benoit Steiner
|
e4f83bae5d
|
Fixed the tensor benchmarks on apple devices
|
2016-01-28 21:08:07 -08:00 |
|
Benoit Steiner
|
10bea90c4a
|
Fixed clang related compilation error
|
2016-01-28 20:52:08 -08:00 |
|
Benoit Steiner
|
d3f533b395
|
Fixed compilation warning
|
2016-01-28 20:09:45 -08:00 |
|
Abhijit Kundu
|
3fde202215
|
Making ceil() functor generic w.r.t packet type
|
2016-01-28 21:27:00 -05:00 |
|
Benoit Steiner
|
211d350fc3
|
Fixed a typo
|
2016-01-28 17:13:04 -08:00 |
|
Benoit Steiner
|
bd2e5a788a
|
Made sure the number of floating point operations done by a benchmark is computed using 64 bit integers to avoid overflows.
|
2016-01-28 17:10:40 -08:00 |
|
Benoit Steiner
|
120e13b1b6
|
Added a readme to explain how to compile the tensor benchmarks.
|
2016-01-28 17:06:00 -08:00 |
|
Benoit Steiner
|
a68864b6bc
|
Updated the benchmarking code to print the number of flops processed instead of the number of bytes.
|
2016-01-28 16:51:40 -08:00 |
|
Benoit Steiner
|
8217281ae4
|
Merge latest updates from trunk
|
2016-01-28 16:20:53 -08:00 |
|
Benoit Steiner
|
c8d5f21941
|
Added extra tensor benchmarks
|
2016-01-28 16:20:36 -08:00 |
|
Benoit Steiner
|
7b3044d086
|
Made sure to call nvcc with the relaxed-constexpr flag.
|
2016-01-28 15:36:34 -08:00 |
|
Gael Guennebaud
|
b908e071a8
|
bug #178: get rid of some const_cast in SparseCore
|
2016-01-28 22:11:18 +01:00 |
|
Gael Guennebaud
|
c1d900af61
|
bug #178: remove additional const on nested expression, and remove several const_cast.
|
2016-01-28 21:43:20 +01:00 |
|
Benoit Steiner
|
12f8bd12a2
|
Merged in jiayq/eigen (pull request PR-159)
Modifications to the tensor benchmarks to allow compilation in a standalone fashion.
|
2016-01-28 11:28:55 -08:00 |
|
Yangqing Jia
|
270c4e1ecd
|
bugfix
|
2016-01-28 11:11:45 -08:00 |
|
Yangqing Jia
|
c4e47630b1
|
benchmark modifications to make it compilable in a standalone fashion.
|
2016-01-28 10:35:14 -08:00 |
|
Gael Guennebaud
|
f50bb1e6f3
|
Fix compilation with gcc
|
2016-01-28 13:25:26 +01:00 |
|
Gael Guennebaud
|
ddf64babde
|
merge
|
2016-01-28 13:21:48 +01:00 |
|
Gael Guennebaud
|
df15fbc452
|
bug #1158: PartialReduxExpr is a vector expression, and it thus must expose the LinearAccessBit flag
|
2016-01-28 13:16:30 +01:00 |
|
Gael Guennebaud
|
9bcadb7fd1
|
Disable stupid MSVC warning
|
2016-01-28 12:14:16 +01:00 |
|
Gael Guennebaud
|
b4d87fff4a
|
Fix MSVC warning.
|
2016-01-28 12:12:30 +01:00 |
|
Gael Guennebaud
|
2bad3e78d9
|
bug #96, bug #1006: fix by value argument in result_of.
|
2016-01-28 12:12:06 +01:00 |
|
Gael Guennebaud
|
7802a6bb1c
|
Fix unit test filename.
|
2016-01-28 09:35:37 +01:00 |
|
Benoit Steiner
|
4bf9eaf77a
|
Deleted an invalid assertion that prevented the assignment of empty tensors.
|
2016-01-27 17:09:30 -08:00 |
|
Benoit Steiner
|
291069e885
|
Fixed some compilation problems with nvcc + clang
|
2016-01-27 15:37:03 -08:00 |
|
Benoit Steiner
|
47ca9dc809
|
Fixed the tensor_cuda test
|
2016-01-27 14:58:48 -08:00 |
|
Benoit Steiner
|
55a5204319
|
Fixed the flags passed to nvcc to compile the tensor code.
|
2016-01-27 14:46:34 -08:00 |
|
Gael Guennebaud
|
4865e1e732
|
Update link to suitesparse.
|
2016-01-27 22:48:40 +01:00 |
|
Benoit Steiner
|
9dfbd4fe8d
|
Made the cuda tests compile using make check
|
2016-01-27 12:22:17 -08:00 |
|
Benoit Steiner
|
5973bcf939
|
Properly specify the namespace when calling cout/endl
|
2016-01-27 12:04:42 -08:00 |
|
Eugene Brevdo
|
c8d94ae944
|
digamma special function: merge shared code.
Moved type-specific code into a helper class digamma_impl_maybe_poly<Scalar>.
|
2016-01-27 09:52:29 -08:00 |
|
Gael Guennebaud
|
9c8f7dfe94
|
bug #1156: fix several function declarations whose arguments were passed by value instead of being passed by reference
|
2016-01-27 18:34:42 +01:00 |
|
Gael Guennebaud
|
9aa6fae123
|
bug #1154: move to dynamic scheduling for spmv products.
|
2016-01-27 18:03:51 +01:00 |
|
Gael Guennebaud
|
9ac8e8c6a1
|
Extend mixing type unit test with trmv, and the following not yet supported products: trmm, symv, symm
|
2016-01-27 17:29:53 +01:00 |
|
Gael Guennebaud
|
6da5d87f92
|
add nomalloc unit test for rank2 updates
|
2016-01-27 17:26:48 +01:00 |
|
Gael Guennebaud
|
9801c959e6
|
Fix tri = complex * real product, and add respective unit test.
|
2016-01-27 17:12:25 +01:00 |
|
Gael Guennebaud
|
21b5345782
|
Add meta_least_common_multiple helper.
|
2016-01-27 17:11:39 +01:00 |
|
Gael Guennebaud
|
fecea26d93
|
Extend doc on shifting strategy
|
2016-01-27 15:55:15 +01:00 |
|