Benoit Steiner
|
667fcc2b53
|
Fixed syntax error
|
2016-03-04 14:37:51 -08:00 |
|
Benoit Steiner
|
4416a5dcff
|
Added missing include
|
2016-03-04 14:35:43 -08:00 |
|
Benoit Steiner
|
174edf976b
|
Made the contraction test more portable
|
2016-03-04 14:11:13 -08:00 |
|
Benoit Steiner
|
deea866bbd
|
Added tests to cover the new rounding, flooring and ceiling tensor operations.
|
2016-03-03 12:38:02 -08:00 |
|
Benoit Steiner
|
dac58d7c35
|
Added a test to validate the conversion of half floats into floats on Kepler GPUs.
Restricted the testing of the random number generation code to GPU architecture greater than or equal to 3.5.
|
2016-03-03 10:37:25 -08:00 |
|
Benoit Steiner
|
af199b4658
|
Made the CUDA architecture level a build setting.
|
2016-02-25 09:06:18 -08:00 |
|
Benoit Steiner
|
e644f60907
|
Pulled latest updates from trunk
|
2016-02-21 20:24:59 +00:00 |
|
Benoit Steiner
|
95fceb6452
|
Added the ability to compute the absolute value of a half float
|
2016-02-21 20:24:11 +00:00 |
|
Benoit Steiner
|
ed69cbeef0
|
Added some debugging information to the test to figure out why it fails sometimes
|
2016-02-21 11:20:20 -08:00 |
|
Benoit Steiner
|
1e6fe6f046
|
Fixed the float16 tensor test.
|
2016-02-20 07:44:17 +00:00 |
|
Benoit Steiner
|
180156ba1a
|
Added support for tensor reductions on half floats
|
2016-02-19 10:05:59 -08:00 |
|
Benoit Steiner
|
a08d2ff0c9
|
Started to work on contractions and reductions using half floats
|
2016-02-19 15:59:59 +00:00 |
|
Benoit Steiner
|
ac5d706a94
|
Added support for simple coefficient wise tensor expression using half floats on CUDA devices
|
2016-02-19 08:19:12 +00:00 |
|
Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
f36c0c2c65
|
Added regression test for float16
|
2016-02-19 06:23:28 +00:00 |
|
Benoit Steiner
|
7151bd8768
|
Reverted unintended changes introduced by a bad merge
|
2016-02-19 06:20:50 +00:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Benoit Steiner
|
9e3f3a2d27
|
Deleted outdated comment
|
2016-02-11 17:27:35 -08:00 |
|
Benoit Steiner
|
de345eff2e
|
Added a method to conjugate the content of a tensor or the result of a tensor expression.
|
2016-02-11 16:34:07 -08:00 |
|
Benoit Steiner
|
6323851ea9
|
Fixed compilation warning
|
2016-02-09 20:43:41 -08:00 |
|
Benoit Steiner
|
4d4211c04e
|
Avoid unecessary type conversions
|
2016-02-05 18:19:41 -08:00 |
|
Benoit Steiner
|
d2cba52015
|
Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type
|
2016-02-05 18:14:23 -08:00 |
|
Benoit Steiner
|
fb00a4af2b
|
Made the tensor fft test compile on tegra x1
|
2016-02-06 01:42:14 +00:00 |
|
Benoit Steiner
|
5d82e47ef6
|
Properly disable nvcc warning messages in user code.
|
2016-02-03 14:10:06 -08:00 |
|
Benoit Steiner
|
af8436b196
|
Silenced the "calling a __host__ function from a __host__ __device__ function is not allowed" messages
|
2016-02-03 13:48:36 -08:00 |
|
Ville Kallioniemi
|
aedea349aa
|
Replace separate low word constructors with a single templated constructor.
|
2016-02-01 20:25:02 -07:00 |
|
Benoit Steiner
|
64ce78c2ec
|
Cleaned up a tensor contraction test
|
2016-02-01 13:57:41 -08:00 |
|
Benoit Steiner
|
0ce5d32be5
|
Sharded the cxx11_tensor_contract_cuda test
|
2016-02-01 13:33:23 -08:00 |
|
Benoit Steiner
|
922b5f527b
|
Silenced a few compilation warnings
|
2016-02-01 13:30:49 -08:00 |
|
Benoit Steiner
|
264f8141f8
|
Shared the tensor reduction test
|
2016-02-01 07:44:31 -08:00 |
|
Benoit Steiner
|
11bb71c8fc
|
Sharded the tensor device test
|
2016-02-01 07:34:59 -08:00 |
|
Benoit Steiner
|
6720b38fbf
|
Fixed a few compilation warnings
|
2016-01-31 16:48:50 -08:00 |
|
Benoit Steiner
|
4a2ddfb81d
|
Sharded the CUDA argmax tensor test
|
2016-01-31 10:44:15 -08:00 |
|
Benoit Steiner
|
483082ef6e
|
Fixed a few memory leaks in the cuda tests
|
2016-01-30 11:59:22 -08:00 |
|
Benoit Steiner
|
bd21aba181
|
Sharded the cxx11_tensor_cuda test and fixed a memory leak
|
2016-01-30 11:47:09 -08:00 |
|
Benoit Steiner
|
9de155d153
|
Added a test to cover threaded tensor shuffling
|
2016-01-30 10:56:47 -08:00 |
|
Benoit Steiner
|
32088c06a1
|
Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.
|
2016-01-30 10:51:14 -08:00 |
|
Benoit Steiner
|
2053478c56
|
Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test
|
2016-01-30 10:46:36 -08:00 |
|
Benoit Steiner
|
d0db95f730
|
Sharded the tensor thread pool test
|
2016-01-30 10:43:57 -08:00 |
|
Benoit Steiner
|
ba27c8a7de
|
Made the CUDA contract test more robust to numerical noise.
|
2016-01-30 10:28:43 -08:00 |
|
Benoit Steiner
|
7b3044d086
|
Made sure to call nvcc with the relaxed-constexpr flag.
|
2016-01-28 15:36:34 -08:00 |
|
Gael Guennebaud
|
7802a6bb1c
|
Fix unit test filename.
|
2016-01-28 09:35:37 +01:00 |
|
Benoit Steiner
|
4bf9eaf77a
|
Deleted an invalid assertion that prevented the assignment of empty tensors.
|
2016-01-27 17:09:30 -08:00 |
|
Benoit Steiner
|
47ca9dc809
|
Fixed the tensor_cuda test
|
2016-01-27 14:58:48 -08:00 |
|
Benoit Steiner
|
55a5204319
|
Fixed the flags passed to nvcc to compile the tensor code.
|
2016-01-27 14:46:34 -08:00 |
|
Benoit Steiner
|
9dfbd4fe8d
|
Made the cuda tests compile using make check
|
2016-01-27 12:22:17 -08:00 |
|
Benoit Steiner
|
5973bcf939
|
Properly specify the namespace when calling cout/endl
|
2016-01-27 12:04:42 -08:00 |
|
Gael Guennebaud
|
6a44ccb58b
|
Backout changeset 690bc950f7
|
2016-01-22 15:03:53 +01:00 |
|
Jan Prach
|
690bc950f7
|
fix clang warnings
"braces around scalar initializer"
|
2016-01-20 19:35:59 -08:00 |
|
Christoph Hertzberg
|
92655e7215
|
bug #1136: Protect isinf for Intel compilers. Also don't distinguish GCC from ICC and don't rely on EIGEN_NOT_A_MACRO, which might not be defined when including this.
|
2015-12-15 11:34:52 +01:00 |
|