Benoit Steiner
|
048c4d6efd
|
Made half floats usable on hardware that doesn't support them natively.
|
2016-03-11 17:21:42 -08:00 |
|
Benoit Steiner
|
456e038a4e
|
Fixed the +=, -=, *= and /= operators to return a reference
|
2016-03-10 15:17:44 -08:00 |
|
Benoit Steiner
|
05bbca079a
|
Turn on some of the cxx11 features when compiling with visual studio 2015
|
2016-03-05 10:52:08 -08:00 |
|
Benoit Steiner
|
1032441c6f
|
Enable partial support for half floats on Kepler GPUs.
|
2016-03-03 10:34:20 -08:00 |
|
Benoit Steiner
|
1da10a7358
|
Enable the conversion between floats and half floats on older GPUs that support it.
|
2016-03-03 10:33:20 -08:00 |
|
Benoit Steiner
|
2de8cc9122
|
Merged in ebrevdo/eigen (pull request PR-167)
Add infinity() support to numext::numeric_limits, use it in lgamma.
I tested the code on my gtx-titan-black gpu, and it appears to work as expected.
|
2016-03-03 09:42:12 -08:00 |
|
Eugene Brevdo
|
ab3dc0b0fe
|
Small bugfix to numeric_limits for CUDA.
|
2016-03-02 21:48:46 -08:00 |
|
Eugene Brevdo
|
6afea46838
|
Add infinity() support to numext::numeric_limits, use it in lgamma.
This makes the infinity access a __device__ function, removing
nvcc warnings.
|
2016-03-02 21:35:48 -08:00 |
|
Gael Guennebaud
|
3fccef6f50
|
bug #537: fix compilation with Apples's compiler
|
2016-03-02 13:22:46 +01:00 |
|
Gael Guennebaud
|
dfa80b2060
|
Compilation fix
|
2016-03-01 12:48:56 +01:00 |
|
Gael Guennebaud
|
bee9efc203
|
Compilation fix
|
2016-03-01 12:47:27 +01:00 |
|
Gael Guennebaud
|
e9bea614ec
|
Fix shortcoming in fixed-value deduction of startRow/startCol
|
2016-02-29 10:31:27 +01:00 |
|
Gael Guennebaud
|
8e6faab51e
|
bug #1172: make valuePtr and innderIndexPtr properly return null for empty matrices.
|
2016-02-27 14:55:40 +01:00 |
|
Gael Guennebaud
|
91e1375ba9
|
merge
|
2016-02-23 11:09:05 +01:00 |
|
Gael Guennebaud
|
055000a424
|
Fix startRow()/startCol() for dense Block with direct access:
the initial implementation failed for empty rows/columns for which are ambiguous.
|
2016-02-23 11:07:59 +01:00 |
|
Benoit Steiner
|
6270d851e3
|
Declare the half float type as arithmetic.
|
2016-02-22 13:59:33 -08:00 |
|
Benoit Steiner
|
584832cb3c
|
Implemented the ptranspose function on half floats
|
2016-02-21 12:44:53 -08:00 |
|
Benoit Steiner
|
95fceb6452
|
Added the ability to compute the absolute value of a half float
|
2016-02-21 20:24:11 +00:00 |
|
Benoit Steiner
|
9ff269a1d3
|
Moved some of the fp16 operators outside the Eigen namespace to workaround some nvcc limitations.
|
2016-02-20 07:47:23 +00:00 |
|
Gael Guennebaud
|
d90a2dac5e
|
merge
|
2016-02-19 23:01:27 +01:00 |
|
Gael Guennebaud
|
6fa35bbd28
|
bug #1170: skip calls to memcpy/memmove for empty imput.
|
2016-02-19 22:58:52 +01:00 |
|
Gael Guennebaud
|
6f0992c05b
|
Fix nesting type and complete reflection methods of Block expressions.
|
2016-02-19 22:21:02 +01:00 |
|
Gael Guennebaud
|
f3643eec57
|
Add typedefs for the return type of all block methods.
|
2016-02-19 22:15:01 +01:00 |
|
Benoit Steiner
|
180156ba1a
|
Added support for tensor reductions on half floats
|
2016-02-19 10:05:59 -08:00 |
|
Benoit Steiner
|
5c4901b83a
|
Implemented the scalar division of 2 half floats
|
2016-02-19 10:03:19 -08:00 |
|
Benoit Steiner
|
f7cb755299
|
Added support for operators +=, -=, *= and /= on CUDA half floats
|
2016-02-19 15:57:26 +00:00 |
|
Benoit Steiner
|
dc26459b99
|
Implemented protate() for CUDA
|
2016-02-19 15:16:54 +00:00 |
|
Benoit Steiner
|
ac5d706a94
|
Added support for simple coefficient wise tensor expression using half floats on CUDA devices
|
2016-02-19 08:19:12 +00:00 |
|
Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
7151bd8768
|
Reverted unintended changes introduced by a bad merge
|
2016-02-19 06:20:50 +00:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Benoit Steiner
|
8ce46f9d89
|
Improved implementation of ptanh for SSE and AVX
|
2016-02-18 13:24:34 -08:00 |
|
Eugene Brevdo
|
832380c455
|
Merged eigen/eigen into default
|
2016-02-17 14:44:06 -08:00 |
|
Eugene Brevdo
|
06a2bc7c9c
|
Tiny bugfix in SpecialFunctions: some compilers don't like doubles
implicitly downcast to floats in an array constructor.
|
2016-02-17 14:41:59 -08:00 |
|
Gael Guennebaud
|
f6f057bb7d
|
bug #1166: fix shortcomming in gemv when the destination is not a vector at compile-time.
|
2016-02-15 21:43:07 +01:00 |
|
Gael Guennebaud
|
4252af6897
|
Remove dead code.
|
2016-02-12 16:13:35 +01:00 |
|
Gael Guennebaud
|
2f5f56a820
|
Fix usage of evaluator in sparse * permutation products.
|
2016-02-12 16:13:16 +01:00 |
|
Gael Guennebaud
|
0a537cb2d8
|
bug #901: fix triangular-view with unit diagonal of sparse rectangular matrices.
|
2016-02-12 15:58:31 +01:00 |
|
Benoit Steiner
|
17e93ba148
|
Pulled latest updates from trunk
|
2016-02-11 15:05:38 -08:00 |
|
Benoit Steiner
|
3628f7655d
|
Made it possible to run the scalar_binary_pow_op functor on GPU
|
2016-02-11 15:05:03 -08:00 |
|
Hauke Heibel
|
eeac46f980
|
bug #774: re-added comment referencing equations in the original paper
|
2016-02-11 19:38:37 +01:00 |
|
Benoit Steiner
|
c569cfe12a
|
Inline the +=, -=, *= and /= operators consistently between DenseBase.h and SelfCwiseBinaryOp.h
|
2016-02-11 09:33:32 -08:00 |
|
Gael Guennebaud
|
8cc9232b9a
|
bug #774: fix a numerical issue producing unwanted reflections.
|
2016-02-11 15:32:56 +01:00 |
|
Gael Guennebaud
|
2d35c0cb5f
|
Merged in rmlarsen/eigen (pull request PR-163)
Implement complete orthogonal decomposition in Eigen.
|
2016-02-11 15:12:34 +01:00 |
|
Benoit Steiner
|
33e2373f01
|
Merged in nnyby/eigen/nnyby/doc-grammar-fix-linearly-space-linearly-1443742971203 (pull request PR-138)
[doc] grammar fix: "linearly space" -> "linearly spaced"
|
2016-02-10 23:29:59 -08:00 |
|
Benoit Steiner
|
6d8b1dce06
|
Avoid implicit cast from double to float.
|
2016-02-10 18:07:11 -08:00 |
|
Rasmus Munk Larsen
|
b6fdf7468c
|
Rename inverse -> pseudoInverse.
|
2016-02-10 13:03:07 -08:00 |
|
Benoit Jacob
|
9d6f1ad398
|
I'm told to use __EMSCRIPTEN__ by an Emscripten dev.
|
2016-02-10 12:48:34 -05:00 |
|
Benoit Steiner
|
bfb3fcd94f
|
Optimized implementation of the tanh function for SSE
|
2016-02-10 08:52:30 -08:00 |
|
Benoit Steiner
|
2d523332b3
|
Optimized implementation of the hyperbolic tangent function for AVX
|
2016-02-10 08:48:05 -08:00 |
|