Benoit Steiner
|
af199b4658
|
Made the CUDA architecture level a build setting.
|
2016-02-25 09:06:18 -08:00 |
|
Benoit Steiner
|
e644f60907
|
Pulled latest updates from trunk
|
2016-02-21 20:24:59 +00:00 |
|
Benoit Steiner
|
95fceb6452
|
Added the ability to compute the absolute value of a half float
|
2016-02-21 20:24:11 +00:00 |
|
Benoit Steiner
|
ed69cbeef0
|
Added some debugging information to the test to figure out why it fails sometimes
|
2016-02-21 11:20:20 -08:00 |
|
Benoit Steiner
|
1e6fe6f046
|
Fixed the float16 tensor test.
|
2016-02-20 07:44:17 +00:00 |
|
Benoit Steiner
|
180156ba1a
|
Added support for tensor reductions on half floats
|
2016-02-19 10:05:59 -08:00 |
|
Benoit Steiner
|
a08d2ff0c9
|
Started to work on contractions and reductions using half floats
|
2016-02-19 15:59:59 +00:00 |
|
Benoit Steiner
|
ac5d706a94
|
Added support for simple coefficient wise tensor expression using half floats on CUDA devices
|
2016-02-19 08:19:12 +00:00 |
|
Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
f36c0c2c65
|
Added regression test for float16
|
2016-02-19 06:23:28 +00:00 |
|
Benoit Steiner
|
7151bd8768
|
Reverted unintended changes introduced by a bad merge
|
2016-02-19 06:20:50 +00:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Benoit Steiner
|
9e3f3a2d27
|
Deleted outdated comment
|
2016-02-11 17:27:35 -08:00 |
|
Benoit Steiner
|
de345eff2e
|
Added a method to conjugate the content of a tensor or the result of a tensor expression.
|
2016-02-11 16:34:07 -08:00 |
|
Benoit Steiner
|
6323851ea9
|
Fixed compilation warning
|
2016-02-09 20:43:41 -08:00 |
|
Benoit Steiner
|
4d4211c04e
|
Avoid unecessary type conversions
|
2016-02-05 18:19:41 -08:00 |
|
Benoit Steiner
|
d2cba52015
|
Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type
|
2016-02-05 18:14:23 -08:00 |
|
Benoit Steiner
|
fb00a4af2b
|
Made the tensor fft test compile on tegra x1
|
2016-02-06 01:42:14 +00:00 |
|
Benoit Steiner
|
5d82e47ef6
|
Properly disable nvcc warning messages in user code.
|
2016-02-03 14:10:06 -08:00 |
|
Benoit Steiner
|
af8436b196
|
Silenced the "calling a __host__ function from a __host__ __device__ function is not allowed" messages
|
2016-02-03 13:48:36 -08:00 |
|
Ville Kallioniemi
|
aedea349aa
|
Replace separate low word constructors with a single templated constructor.
|
2016-02-01 20:25:02 -07:00 |
|
Benoit Steiner
|
64ce78c2ec
|
Cleaned up a tensor contraction test
|
2016-02-01 13:57:41 -08:00 |
|
Benoit Steiner
|
0ce5d32be5
|
Sharded the cxx11_tensor_contract_cuda test
|
2016-02-01 13:33:23 -08:00 |
|
Benoit Steiner
|
922b5f527b
|
Silenced a few compilation warnings
|
2016-02-01 13:30:49 -08:00 |
|
Benoit Steiner
|
264f8141f8
|
Shared the tensor reduction test
|
2016-02-01 07:44:31 -08:00 |
|
Benoit Steiner
|
11bb71c8fc
|
Sharded the tensor device test
|
2016-02-01 07:34:59 -08:00 |
|
Benoit Steiner
|
6720b38fbf
|
Fixed a few compilation warnings
|
2016-01-31 16:48:50 -08:00 |
|
Benoit Steiner
|
4a2ddfb81d
|
Sharded the CUDA argmax tensor test
|
2016-01-31 10:44:15 -08:00 |
|
Benoit Steiner
|
483082ef6e
|
Fixed a few memory leaks in the cuda tests
|
2016-01-30 11:59:22 -08:00 |
|
Benoit Steiner
|
bd21aba181
|
Sharded the cxx11_tensor_cuda test and fixed a memory leak
|
2016-01-30 11:47:09 -08:00 |
|
Benoit Steiner
|
9de155d153
|
Added a test to cover threaded tensor shuffling
|
2016-01-30 10:56:47 -08:00 |
|
Benoit Steiner
|
32088c06a1
|
Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.
|
2016-01-30 10:51:14 -08:00 |
|
Benoit Steiner
|
2053478c56
|
Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test
|
2016-01-30 10:46:36 -08:00 |
|
Benoit Steiner
|
d0db95f730
|
Sharded the tensor thread pool test
|
2016-01-30 10:43:57 -08:00 |
|
Benoit Steiner
|
ba27c8a7de
|
Made the CUDA contract test more robust to numerical noise.
|
2016-01-30 10:28:43 -08:00 |
|
Benoit Steiner
|
7b3044d086
|
Made sure to call nvcc with the relaxed-constexpr flag.
|
2016-01-28 15:36:34 -08:00 |
|
Gael Guennebaud
|
7802a6bb1c
|
Fix unit test filename.
|
2016-01-28 09:35:37 +01:00 |
|
Benoit Steiner
|
4bf9eaf77a
|
Deleted an invalid assertion that prevented the assignment of empty tensors.
|
2016-01-27 17:09:30 -08:00 |
|
Benoit Steiner
|
47ca9dc809
|
Fixed the tensor_cuda test
|
2016-01-27 14:58:48 -08:00 |
|
Benoit Steiner
|
55a5204319
|
Fixed the flags passed to nvcc to compile the tensor code.
|
2016-01-27 14:46:34 -08:00 |
|
Benoit Steiner
|
9dfbd4fe8d
|
Made the cuda tests compile using make check
|
2016-01-27 12:22:17 -08:00 |
|
Benoit Steiner
|
5973bcf939
|
Properly specify the namespace when calling cout/endl
|
2016-01-27 12:04:42 -08:00 |
|
Gael Guennebaud
|
6a44ccb58b
|
Backout changeset 690bc950f7
|
2016-01-22 15:03:53 +01:00 |
|
Jan Prach
|
690bc950f7
|
fix clang warnings
"braces around scalar initializer"
|
2016-01-20 19:35:59 -08:00 |
|
Christoph Hertzberg
|
92655e7215
|
bug #1136: Protect isinf for Intel compilers. Also don't distinguish GCC from ICC and don't rely on EIGEN_NOT_A_MACRO, which might not be defined when including this.
|
2015-12-15 11:34:52 +01:00 |
|
Gael Guennebaud
|
ca39b1546e
|
Merged in ebrevdo/eigen (pull request PR-148)
Add special functions to eigen: lgamma, erf, erfc.
|
2015-12-11 11:52:09 +01:00 |
|
Benoit Steiner
|
2d8f2e4042
|
Made 2 tests compile without cxx11.
HdG: --
|
2015-12-10 23:20:04 -08:00 |
|
Benoit Steiner
|
8d28a161b2
|
Use the proper accessor to refer to the value of a scalar tensor
|
2015-12-10 22:53:56 -08:00 |
|
Benoit Steiner
|
9db8316c93
|
Updated the cxx11_tensor_custom_op to not require cxx11.
|
2015-12-10 20:53:44 -08:00 |
|
Benoit Steiner
|
4e324ca6ae
|
Updated the cxx11_tensor_assign test to make it compile without support for cxx11
|
2015-12-10 20:47:25 -08:00 |
|