Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
f36c0c2c65
|
Added regression test for float16
|
2016-02-19 06:23:28 +00:00 |
|
Benoit Steiner
|
7151bd8768
|
Reverted unintended changes introduced by a bad merge
|
2016-02-19 06:20:50 +00:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Benoit Steiner
|
9e3f3a2d27
|
Deleted outdated comment
|
2016-02-11 17:27:35 -08:00 |
|
Benoit Steiner
|
de345eff2e
|
Added a method to conjugate the content of a tensor or the result of a tensor expression.
|
2016-02-11 16:34:07 -08:00 |
|
Benoit Steiner
|
6323851ea9
|
Fixed compilation warning
|
2016-02-09 20:43:41 -08:00 |
|
Benoit Steiner
|
4d4211c04e
|
Avoid unecessary type conversions
|
2016-02-05 18:19:41 -08:00 |
|
Benoit Steiner
|
d2cba52015
|
Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type
|
2016-02-05 18:14:23 -08:00 |
|
Benoit Steiner
|
fb00a4af2b
|
Made the tensor fft test compile on tegra x1
|
2016-02-06 01:42:14 +00:00 |
|
Benoit Steiner
|
5d82e47ef6
|
Properly disable nvcc warning messages in user code.
|
2016-02-03 14:10:06 -08:00 |
|
Benoit Steiner
|
af8436b196
|
Silenced the "calling a __host__ function from a __host__ __device__ function is not allowed" messages
|
2016-02-03 13:48:36 -08:00 |
|
Ville Kallioniemi
|
aedea349aa
|
Replace separate low word constructors with a single templated constructor.
|
2016-02-01 20:25:02 -07:00 |
|
Benoit Steiner
|
64ce78c2ec
|
Cleaned up a tensor contraction test
|
2016-02-01 13:57:41 -08:00 |
|
Benoit Steiner
|
0ce5d32be5
|
Sharded the cxx11_tensor_contract_cuda test
|
2016-02-01 13:33:23 -08:00 |
|
Benoit Steiner
|
922b5f527b
|
Silenced a few compilation warnings
|
2016-02-01 13:30:49 -08:00 |
|
Benoit Steiner
|
264f8141f8
|
Shared the tensor reduction test
|
2016-02-01 07:44:31 -08:00 |
|
Benoit Steiner
|
11bb71c8fc
|
Sharded the tensor device test
|
2016-02-01 07:34:59 -08:00 |
|
Benoit Steiner
|
6720b38fbf
|
Fixed a few compilation warnings
|
2016-01-31 16:48:50 -08:00 |
|
Benoit Steiner
|
4a2ddfb81d
|
Sharded the CUDA argmax tensor test
|
2016-01-31 10:44:15 -08:00 |
|
Benoit Steiner
|
483082ef6e
|
Fixed a few memory leaks in the cuda tests
|
2016-01-30 11:59:22 -08:00 |
|
Benoit Steiner
|
bd21aba181
|
Sharded the cxx11_tensor_cuda test and fixed a memory leak
|
2016-01-30 11:47:09 -08:00 |
|
Benoit Steiner
|
9de155d153
|
Added a test to cover threaded tensor shuffling
|
2016-01-30 10:56:47 -08:00 |
|
Benoit Steiner
|
32088c06a1
|
Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.
|
2016-01-30 10:51:14 -08:00 |
|
Benoit Steiner
|
2053478c56
|
Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test
|
2016-01-30 10:46:36 -08:00 |
|
Benoit Steiner
|
d0db95f730
|
Sharded the tensor thread pool test
|
2016-01-30 10:43:57 -08:00 |
|
Benoit Steiner
|
ba27c8a7de
|
Made the CUDA contract test more robust to numerical noise.
|
2016-01-30 10:28:43 -08:00 |
|
Benoit Steiner
|
7b3044d086
|
Made sure to call nvcc with the relaxed-constexpr flag.
|
2016-01-28 15:36:34 -08:00 |
|
Gael Guennebaud
|
7802a6bb1c
|
Fix unit test filename.
|
2016-01-28 09:35:37 +01:00 |
|
Benoit Steiner
|
4bf9eaf77a
|
Deleted an invalid assertion that prevented the assignment of empty tensors.
|
2016-01-27 17:09:30 -08:00 |
|
Benoit Steiner
|
47ca9dc809
|
Fixed the tensor_cuda test
|
2016-01-27 14:58:48 -08:00 |
|
Benoit Steiner
|
55a5204319
|
Fixed the flags passed to nvcc to compile the tensor code.
|
2016-01-27 14:46:34 -08:00 |
|
Benoit Steiner
|
9dfbd4fe8d
|
Made the cuda tests compile using make check
|
2016-01-27 12:22:17 -08:00 |
|
Benoit Steiner
|
5973bcf939
|
Properly specify the namespace when calling cout/endl
|
2016-01-27 12:04:42 -08:00 |
|
Gael Guennebaud
|
6a44ccb58b
|
Backout changeset 690bc950f7
|
2016-01-22 15:03:53 +01:00 |
|
Jan Prach
|
690bc950f7
|
fix clang warnings
"braces around scalar initializer"
|
2016-01-20 19:35:59 -08:00 |
|
Christoph Hertzberg
|
92655e7215
|
bug #1136: Protect isinf for Intel compilers. Also don't distinguish GCC from ICC and don't rely on EIGEN_NOT_A_MACRO, which might not be defined when including this.
|
2015-12-15 11:34:52 +01:00 |
|
Gael Guennebaud
|
ca39b1546e
|
Merged in ebrevdo/eigen (pull request PR-148)
Add special functions to eigen: lgamma, erf, erfc.
|
2015-12-11 11:52:09 +01:00 |
|
Benoit Steiner
|
2d8f2e4042
|
Made 2 tests compile without cxx11.
HdG: --
|
2015-12-10 23:20:04 -08:00 |
|
Benoit Steiner
|
8d28a161b2
|
Use the proper accessor to refer to the value of a scalar tensor
|
2015-12-10 22:53:56 -08:00 |
|
Benoit Steiner
|
9db8316c93
|
Updated the cxx11_tensor_custom_op to not require cxx11.
|
2015-12-10 20:53:44 -08:00 |
|
Benoit Steiner
|
4e324ca6ae
|
Updated the cxx11_tensor_assign test to make it compile without support for cxx11
|
2015-12-10 20:47:25 -08:00 |
|
Eugene Brevdo
|
fa4f933c0f
|
Add special functions to Eigen: lgamma, erf, erfc.
Includes CUDA support and unit tests.
|
2015-12-07 15:24:49 -08:00 |
|
Gael Guennebaud
|
fd727249ad
|
Update ADOL-C support.
|
2015-11-30 16:00:22 +01:00 |
|
Benoit Steiner
|
0ad7c7b1ad
|
Fixed another clang compilation warning
|
2015-11-19 15:52:51 -08:00 |
|
Benoit Steiner
|
66ff9b2c6c
|
Fixed compilation warning generated by clang
|
2015-11-19 15:40:32 -08:00 |
|
Benoit Steiner
|
04f1284f9a
|
Shard the uint128 test
|
2015-11-19 14:08:08 -08:00 |
|
Benoit Steiner
|
e2859c6b71
|
Cleanup the integer division test
|
2015-11-19 14:07:50 -08:00 |
|
Benoit Steiner
|
f8df393165
|
Added support for 128bit integers on CUDA devices.
|
2015-11-19 13:57:27 -08:00 |
|
Benoit Steiner
|
1dd444ea71
|
Avoid using the version of TensorIntDiv optimized for 32-bit integers when the divisor can be equal to one since it isn't supported.
|
2015-11-18 11:37:58 -08:00 |
|