Benoit Steiner
|
c0d56a543e
|
Added several missing EIGEN_DEVICE_FUNC qualifiers
|
2016-09-14 14:06:21 -07:00 |
|
Benoit Steiner
|
488ad7dd1b
|
Added missing EIGEN_DEVICE_FUNC qualifiers
|
2016-09-14 13:35:00 -07:00 |
|
Benoit Steiner
|
779faaaeba
|
Fixed compilation warnings generated by nvcc 6.5 (and below) when compiling the EIGEN_THROW macro
|
2016-09-14 09:56:11 -07:00 |
|
Gael Guennebaud
|
1c8347e554
|
Fix product for custom complex type. (conjugation was ignored)
|
2016-09-14 18:28:49 +02:00 |
|
Benoit Steiner
|
ff47717f25
|
Suppress warning 2527 and 2529, which correspond to the "calling a __host__ function from a __host__ __device__ function is not allowed" message in nvcc 6.5.
|
2016-09-13 12:49:40 -07:00 |
|
Benoit Steiner
|
309190cf02
|
Suppress message 1222 when compiling with nvcc: this ensures that we don't warnings about unknown warning messages when compiling with older versions of nvcc
|
2016-09-13 12:42:13 -07:00 |
|
Gael Guennebaud
|
c10620b2b0
|
Fix typo in doc.
|
2016-09-13 09:25:07 +02:00 |
|
Gael Guennebaud
|
73c8f2f697
|
bug #1285: fix regression introduced in changeset 00c29c2cae
|
2016-09-13 07:58:39 +02:00 |
|
Benoit Steiner
|
e4d4d15588
|
Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= 3.0) since the test instantiate contractions that require a modern gpu.
|
2016-09-12 19:01:52 -07:00 |
|
Benoit Steiner
|
4dfd888c92
|
CUDA contractions require arch >= 3.0: don't compile the cuda contraction tests on older architectures.
|
2016-09-12 18:49:01 -07:00 |
|
Benoit Steiner
|
028e299577
|
Fixed a bug impacting some outer reductions on GPU
|
2016-09-12 18:36:52 -07:00 |
|
Benoit Steiner
|
5f50f12d2c
|
Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.
|
2016-09-12 13:46:13 -07:00 |
|
Benoit Steiner
|
8321dcce76
|
Merged latest updates from trunk
|
2016-09-12 10:33:05 -07:00 |
|
Benoit Steiner
|
eb6ba00cc8
|
Properly size the list of waiters
|
2016-09-12 10:31:55 -07:00 |
|
Benoit Steiner
|
a618094b62
|
Added a resize method to MaxSizeVector
|
2016-09-12 10:30:53 -07:00 |
|
Gael Guennebaud
|
228ae29591
|
Fix compilation on 32 bits systems.
|
2016-09-09 22:34:38 +02:00 |
|
Gael Guennebaud
|
471eac5399
|
bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with some specializations in arch/SSE and arch/AVX)
|
2016-09-08 08:36:27 +02:00 |
|
Gael Guennebaud
|
d780983f59
|
Doc: explain minimal requirements on nullary functors
|
2016-09-06 23:14:52 +02:00 |
|
Gael Guennebaud
|
85fb517eaf
|
Generalize ScalarBinaryOpTraits to any complex-real combination as defined by NumTraits (instead of supporting std::complex only).
|
2016-09-06 17:23:15 +02:00 |
|
Gael Guennebaud
|
447f269561
|
Disable previous workaround.
|
2016-09-06 15:49:02 +02:00 |
|
Gael Guennebaud
|
b046a3f87d
|
Workaround MSVC instantiation faillure of has_*ary_operator at the level of triats<Ref>::match so that the has_*ary_operator are really properly instantiated throughout the compilation unit.
|
2016-09-06 15:47:04 +02:00 |
|
Gael Guennebaud
|
3cb914f332
|
bug #1266: remove CUDA guards on MatrixBase::<decomposition> definitions. (those used to break old nvcc versions that we propably don't care anymore)
|
2016-09-06 09:55:50 +02:00 |
|
Gael Guennebaud
|
e1642f485c
|
bug #1288: fix memory leak in arpack wrapper.
|
2016-09-05 18:01:30 +02:00 |
|
Gael Guennebaud
|
19a95b3309
|
Fix shadowing wrt Eigen::Index
|
2016-09-05 17:19:47 +02:00 |
|
Gael Guennebaud
|
dabc81751f
|
Fix compilation when cuda_fp16.h does not exist.
|
2016-09-05 17:14:20 +02:00 |
|
Gael Guennebaud
|
e13071dd13
|
Workaround a weird msvc 2012 compilation error.
|
2016-09-05 15:50:41 +02:00 |
|
Gael Guennebaud
|
d123717e21
|
Fix for msvc 2012 and older
|
2016-09-05 15:26:56 +02:00 |
|
Benoit Steiner
|
87a8a1975e
|
Fixed a regression test
|
2016-09-02 19:29:33 -07:00 |
|
Benoit Steiner
|
13df3441ae
|
Use MaxSizeVector instead of std::vector: xcode sometimes assumes that std::vector allocates aligned memory and therefore issues aligned instruction to initialize it. This can result in random crashes when compiling with AVX instructions enabled.
|
2016-09-02 19:25:47 -07:00 |
|
Benoit Steiner
|
373c340b71
|
Fixed a typo
|
2016-09-02 15:41:17 -07:00 |
|
Benoit Steiner
|
cadd124d73
|
Pulled latest update from trunk
|
2016-09-02 15:30:02 -07:00 |
|
Benoit Steiner
|
05b0518077
|
Made the index type an explicit template parameter to help some compilers compile the code.
|
2016-09-02 15:29:34 -07:00 |
|
Benoit Steiner
|
adf864fec0
|
Merged in rmlarsen/eigen (pull request PR-222)
Fix CUDA build broken by changes to min and max reduction.
|
2016-09-02 14:11:20 -07:00 |
|
Benoit Steiner
|
5a6be66cef
|
Turned the Index type used by the nullary wrapper into a template parameter.
|
2016-09-02 14:10:29 -07:00 |
|
Rasmus Munk Larsen
|
13e93ca8b7
|
Fix CUDA build broken by changes to min and max reduction.
|
2016-09-02 13:41:36 -07:00 |
|
Benoit Steiner
|
6c05c3dd49
|
Fix the cxx11_tensor_cuda.cu test on 32bit platforms.
|
2016-09-02 11:12:16 -07:00 |
|
Gael Guennebaud
|
49c0390ce0
|
merge
|
2016-09-02 15:24:14 +02:00 |
|
Gael Guennebaud
|
d6c8366d84
|
Fix compilation with MSVC 2012
|
2016-09-02 15:23:32 +02:00 |
|
Benoit Steiner
|
039e225f7f
|
Added a test for nullary expressions on CUDA
Also check that we can mix 64 and 32 bit indices in the same compilation unit
|
2016-09-01 13:28:12 -07:00 |
|
Benoit Steiner
|
c53f783705
|
Updated the contraction code to support constant inputs.
|
2016-09-01 11:41:27 -07:00 |
|
Gael Guennebaud
|
ef54723dbe
|
One more msvc fix iteration, the previous one was over-simplified for visual
|
2016-09-01 15:04:53 +02:00 |
|
Gael Guennebaud
|
46475eff9a
|
Adjust Tensor module wrt recent change in nullary functor
|
2016-09-01 13:40:45 +02:00 |
|
Gael Guennebaud
|
72a4d49315
|
Fix compilation with CUDA 8
|
2016-09-01 13:39:33 +02:00 |
|
Gael Guennebaud
|
f9f32e9e2d
|
Fix compilation with nvcc
|
2016-09-01 13:06:14 +02:00 |
|
Gael Guennebaud
|
3d946e42b3
|
Fix compilation with visual studio
|
2016-09-01 12:59:32 +02:00 |
|
Benoit Steiner
|
221f619bea
|
Merged in rmlarsen/eigen (pull request PR-221)
Fix bugs to make min- and max reducers work with correctly with IEEE infinities.
|
2016-08-31 15:10:10 -07:00 |
|
Rasmus Munk Larsen
|
a1e092d1e8
|
Fix bugs to make min- and max reducers with correctly with IEEE infinities.
|
2016-08-31 15:04:16 -07:00 |
|
Gael Guennebaud
|
836fa25a82
|
Make sure sizeof is truelly needed, thus improving SFINAE portability.
|
2016-08-31 23:40:18 +02:00 |
|
Gael Guennebaud
|
84cf6e42ca
|
minor tweaks in has_* helpers
|
2016-08-31 23:04:14 +02:00 |
|
Gael Guennebaud
|
7ae819123c
|
Simplify CwiseNullaryOp example.
|
2016-08-31 15:46:04 +02:00 |
|