Benoit Steiner
|
180156ba1a
|
Added support for tensor reductions on half floats
|
2016-02-19 10:05:59 -08:00 |
|
Benoit Steiner
|
5c4901b83a
|
Implemented the scalar division of 2 half floats
|
2016-02-19 10:03:19 -08:00 |
|
Benoit Steiner
|
f268db1c4b
|
Added the ability to query the minor version of a cuda device
|
2016-02-19 16:31:04 +00:00 |
|
Benoit Steiner
|
a08d2ff0c9
|
Started to work on contractions and reductions using half floats
|
2016-02-19 15:59:59 +00:00 |
|
Benoit Steiner
|
f3352e0fb0
|
Don't make the array constructors explicit
|
2016-02-19 15:58:57 +00:00 |
|
Benoit Steiner
|
f7cb755299
|
Added support for operators +=, -=, *= and /= on CUDA half floats
|
2016-02-19 15:57:26 +00:00 |
|
Benoit Steiner
|
dc26459b99
|
Implemented protate() for CUDA
|
2016-02-19 15:16:54 +00:00 |
|
Benoit Steiner
|
cd042dbbfd
|
Fixed a bug in the tensor type converter
|
2016-02-19 15:03:26 +00:00 |
|
Benoit Steiner
|
ac5d706a94
|
Added support for simple coefficient wise tensor expression using half floats on CUDA devices
|
2016-02-19 08:19:12 +00:00 |
|
Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
f36c0c2c65
|
Added regression test for float16
|
2016-02-19 06:23:28 +00:00 |
|
Benoit Steiner
|
7151bd8768
|
Reverted unintended changes introduced by a bad merge
|
2016-02-19 06:20:50 +00:00 |
|
Benoit Steiner
|
1304e1fb5e
|
Pulled latest updates from trunk
|
2016-02-19 06:17:02 +00:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Benoit Steiner
|
8ce46f9d89
|
Improved implementation of ptanh for SSE and AVX
|
2016-02-18 13:24:34 -08:00 |
|
Eugene Brevdo
|
832380c455
|
Merged eigen/eigen into default
|
2016-02-17 14:44:06 -08:00 |
|
Eugene Brevdo
|
06a2bc7c9c
|
Tiny bugfix in SpecialFunctions: some compilers don't like doubles
implicitly downcast to floats in an array constructor.
|
2016-02-17 14:41:59 -08:00 |
|
Gael Guennebaud
|
f6f057bb7d
|
bug #1166: fix shortcomming in gemv when the destination is not a vector at compile-time.
|
2016-02-15 21:43:07 +01:00 |
|
Gael Guennebaud
|
8e1f1ba6a6
|
Import wiki's paragraph: "I disabled vectorization, but I'm still getting annoyed about alignment issues"
|
2016-02-12 22:16:59 +01:00 |
|
Gael Guennebaud
|
c8b4c4b48a
|
bug #795: mention allocate_shared as a condidate for aligned_allocator.
|
2016-02-12 22:09:16 +01:00 |
|
Gael Guennebaud
|
6eff3e5185
|
Fix triangularView versus triangularPart.
|
2016-02-12 17:09:28 +01:00 |
|
Gael Guennebaud
|
4252af6897
|
Remove dead code.
|
2016-02-12 16:13:35 +01:00 |
|
Gael Guennebaud
|
2f5f56a820
|
Fix usage of evaluator in sparse * permutation products.
|
2016-02-12 16:13:16 +01:00 |
|
Gael Guennebaud
|
0a537cb2d8
|
bug #901: fix triangular-view with unit diagonal of sparse rectangular matrices.
|
2016-02-12 15:58:31 +01:00 |
|
Gael Guennebaud
|
b35d1a122e
|
Fix unit test: accessing elements in a deque by offsetting a pointer to another element causes undefined behavior.
|
2016-02-12 15:31:16 +01:00 |
|
Benoit Steiner
|
9e3f3a2d27
|
Deleted outdated comment
|
2016-02-11 17:27:35 -08:00 |
|
Benoit Steiner
|
de345eff2e
|
Added a method to conjugate the content of a tensor or the result of a tensor expression.
|
2016-02-11 16:34:07 -08:00 |
|
Benoit Steiner
|
17e93ba148
|
Pulled latest updates from trunk
|
2016-02-11 15:05:38 -08:00 |
|
Benoit Steiner
|
3628f7655d
|
Made it possible to run the scalar_binary_pow_op functor on GPU
|
2016-02-11 15:05:03 -08:00 |
|
Hauke Heibel
|
eeac46f980
|
bug #774: re-added comment referencing equations in the original paper
|
2016-02-11 19:38:37 +01:00 |
|
Benoit Steiner
|
c569cfe12a
|
Inline the +=, -=, *= and /= operators consistently between DenseBase.h and SelfCwiseBinaryOp.h
|
2016-02-11 09:33:32 -08:00 |
|
Gael Guennebaud
|
8cc9232b9a
|
bug #774: fix a numerical issue producing unwanted reflections.
|
2016-02-11 15:32:56 +01:00 |
|
Gael Guennebaud
|
2d35c0cb5f
|
Merged in rmlarsen/eigen (pull request PR-163)
Implement complete orthogonal decomposition in Eigen.
|
2016-02-11 15:12:34 +01:00 |
|
Benoit Steiner
|
33e2373f01
|
Merged in nnyby/eigen/nnyby/doc-grammar-fix-linearly-space-linearly-1443742971203 (pull request PR-138)
[doc] grammar fix: "linearly space" -> "linearly spaced"
|
2016-02-10 23:29:59 -08:00 |
|
Benoit Steiner
|
6d8b1dce06
|
Avoid implicit cast from double to float.
|
2016-02-10 18:07:11 -08:00 |
|
Benoit Steiner
|
1dfaafe28a
|
Added a regression test for tanh
|
2016-02-10 17:41:47 -08:00 |
|
Rasmus Munk Larsen
|
b6fdf7468c
|
Rename inverse -> pseudoInverse.
|
2016-02-10 13:03:07 -08:00 |
|
Benoit Jacob
|
9d6f1ad398
|
I'm told to use __EMSCRIPTEN__ by an Emscripten dev.
|
2016-02-10 12:48:34 -05:00 |
|
Benoit Steiner
|
bfb3fcd94f
|
Optimized implementation of the tanh function for SSE
|
2016-02-10 08:52:30 -08:00 |
|
Benoit Steiner
|
2d523332b3
|
Optimized implementation of the hyperbolic tangent function for AVX
|
2016-02-10 08:48:05 -08:00 |
|
Benoit Jacob
|
e6ee18d6b4
|
Make the GCC workaround for sqrt GCC-only; detect Emscripten as non-GCC
|
2016-02-10 11:11:49 -05:00 |
|
Benoit Steiner
|
2ac59e5d36
|
Pulled latest updates from trunk
|
2016-02-10 08:03:02 -08:00 |
|
Benoit Steiner
|
9a21b38ccc
|
Worked around a few clang compilation warnings
|
2016-02-10 08:02:04 -08:00 |
|
Benoit Jacob
|
964a95bf5e
|
Work around Emscripten bug - https://github.com/kripken/emscripten/issues/4088
|
2016-02-10 10:37:22 -05:00 |
|
Benoit Steiner
|
72ab7879f7
|
Fixed clang comilation warnings
|
2016-02-10 06:48:28 -08:00 |
|
Benoit Steiner
|
e88535634d
|
Fixed some clang compilation warnings
|
2016-02-09 23:32:41 -08:00 |
|
Benoit Steiner
|
970751ece3
|
Disabling the nvcc warnings in addition to the clang warnings when clang is used as a frontend for nvcc
|
2016-02-09 20:55:50 -08:00 |
|
Benoit Steiner
|
6323851ea9
|
Fixed compilation warning
|
2016-02-09 20:43:41 -08:00 |
|
Rasmus Munk Larsen
|
bb8811c655
|
Enable inverse() method for computing pseudo-inverse.
|
2016-02-09 20:35:20 -08:00 |
|
Benoit Steiner
|
5cc0dd5f44
|
Fixed the code that disables the use of variadic templates when compiling with nvcc on ARM devices.
|
2016-02-09 10:32:01 -08:00 |
|