Benoit Steiner
|
7781f865cb
|
Renamed the EIGEN_TEST_NVCC cmake option into EIGEN_TEST_CUDA per the discussion in bug #1173.
|
2016-04-06 09:35:23 -07:00 |
|
Benoit Steiner
|
72abfa11dd
|
Added support for isfinite on fp16
|
2016-04-06 09:07:30 -07:00 |
|
Konstantinos Margaritis
|
2bba4ee2cf
|
Merged kmargar/eigen/tip into default
|
2016-04-05 22:22:08 +03:00 |
|
Konstantinos Margaritis
|
317384b397
|
complete the port, remove float support
|
2016-04-05 14:56:45 -04:00 |
|
Gael Guennebaud
|
4d7e230d2f
|
bug #1189: fix pow/atan2 compilation for AutoDiffScalar
|
2016-04-05 14:49:41 +02:00 |
|
Konstantinos Margaritis
|
bc0ad363c6
|
add remaining includes
|
2016-04-05 06:01:17 -04:00 |
|
Konstantinos Margaritis
|
2d41dc9622
|
complete int/double specialized traits for ZVector
|
2016-04-05 06:00:51 -04:00 |
|
Konstantinos Margaritis
|
644d0f91d2
|
enable all tests again
|
2016-04-05 05:59:54 -04:00 |
|
Konstantinos Margaritis
|
988344daf1
|
enable the other includes as well
|
2016-04-05 05:59:30 -04:00 |
|
Benoit Jacob
|
158fea0f5e
|
bug #1190 - Don't trust __ARM_FEATURE_FMA on Clang/ARM
|
2016-04-04 16:42:40 -04:00 |
|
Benoit Jacob
|
03f2997a11
|
bug #1191 - Prevent Clang/ARM from rewriting VMLA into VMUL+VADD
|
2016-04-04 16:41:47 -04:00 |
|
Benoit Steiner
|
c4179dd470
|
Updated the scalar_abs_op struct to make it compatible with cuda devices.
|
2016-04-04 11:11:51 -07:00 |
|
Benoit Steiner
|
1108b4f218
|
Fixed the signature of numext::abs to make it compatible with complex numbers
|
2016-04-04 11:09:25 -07:00 |
|
Gael Guennebaud
|
2b457f8e5e
|
Fix cross-compiling windows version detection
|
2016-04-04 11:47:46 +02:00 |
|
Benoit Steiner
|
3da495e6b9
|
Relaxed the condition used to gate the fft code.
|
2016-03-31 18:11:51 -07:00 |
|
Benoit Steiner
|
0ea7ab4f62
|
Hashing was only officially introduced in c++11. Therefore only define an implementation of the hash function for float16 if c++11 is enabled.
|
2016-03-31 14:44:55 -07:00 |
|
Benoit Steiner
|
92b7f7b650
|
Improved code formating
|
2016-03-31 13:09:58 -07:00 |
|
Benoit Steiner
|
f197813f37
|
Added the ability to hash a fp16
|
2016-03-31 13:09:23 -07:00 |
|
Benoit Steiner
|
0f5cc504fe
|
Properly gate the fft code
|
2016-03-31 12:59:39 -07:00 |
|
Benoit Steiner
|
4c859181da
|
Made it possible to use the NumTraits for complex and Array in a cuda kernel.
|
2016-03-31 12:48:38 -07:00 |
|
Benoit Steiner
|
c36ab19902
|
Added __ldg primitive for fp16.
|
2016-03-31 10:55:03 -07:00 |
|
Benoit Steiner
|
b575fb1d02
|
Added NumTraits for half floats
|
2016-03-31 10:43:59 -07:00 |
|
Benoit Steiner
|
8c8a79cec1
|
Fixed a typo
|
2016-03-31 10:33:32 -07:00 |
|
Benoit Steiner
|
af4ef540bf
|
Fixed a off-by-one bug in a debug assertion
|
2016-03-30 18:37:19 -07:00 |
|
Benoit Steiner
|
791e5cfb69
|
Added NumTraits for type2index.
|
2016-03-30 18:36:36 -07:00 |
|
Benoit Steiner
|
4f1a7e51c1
|
Pull math functions from the global namespace only when compiling cuda code with nvcc. When compiling with clang, we want to use the std namespace.
|
2016-03-30 17:59:49 -07:00 |
|
Benoit Steiner
|
bc68fc2fe7
|
Enable constant expressions when compiling cuda code with clang.
|
2016-03-30 17:58:32 -07:00 |
|
Benoit Steiner
|
483aaad10a
|
Fixed compilation warning
|
2016-03-30 17:08:13 -07:00 |
|
Benoit Steiner
|
1b40abbf99
|
Added missing assignment operator to the TensorUInt128 class, and made misc small improvements
|
2016-03-30 13:17:03 -07:00 |
|
Benoit Jacob
|
01b5333e44
|
bug #1186 - vreinterpretq_u64_f64 fails to build on Android/Aarch64/Clang toolchain
|
2016-03-30 11:02:33 -04:00 |
|
Benoit Steiner
|
aa45ad2aac
|
Fixed the formatting of the README.
|
2016-03-29 15:06:13 -07:00 |
|
Benoit Steiner
|
56df5ef1d7
|
Attempt to fix the formatting of the README
|
2016-03-29 15:03:38 -07:00 |
|
Benoit Steiner
|
1bcd82e31b
|
Pulled latest updates from trunk
|
2016-03-29 13:36:18 -07:00 |
|
Gael Guennebaud
|
09ad31aa85
|
Add regression test for nesting type handling in blas_traits
|
2016-03-29 22:33:57 +02:00 |
|
Benoit Steiner
|
1841d6d4c3
|
Added missing cuda template specializations for numext::ceil
|
2016-03-29 13:29:34 -07:00 |
|
Benoit Steiner
|
7b7d2a9fa5
|
Use false instead of 0 as the expected value of a boolean
|
2016-03-29 11:50:17 -07:00 |
|
Benoit Steiner
|
e02b784ec3
|
Added support for standard mathematical functions and trancendentals(such as exp, log, abs, ...) on fp16
|
2016-03-29 09:20:36 -07:00 |
|
Benoit Steiner
|
c38295f0a0
|
Added support for fmod
|
2016-03-28 15:53:02 -07:00 |
|
Benoit Steiner
|
6772f653c3
|
Made it possible to customize the threadpool
|
2016-03-28 10:01:04 -07:00 |
|
Benoit Steiner
|
1bc81f7889
|
Fixed compilation warnings on arm
|
2016-03-28 09:21:04 -07:00 |
|
Benoit Steiner
|
78f83d6f6a
|
Prevent potential overflow.
|
2016-03-28 09:18:04 -07:00 |
|
Konstantinos Margaritis
|
01e7298fe6
|
actually include ZVector files, passes most basic tests (float still fails)
|
2016-03-28 10:58:02 -04:00 |
|
Konstantinos Margaritis
|
f48011119e
|
Merged eigen/eigen into default
|
2016-03-28 01:48:45 +03:00 |
|
Konstantinos Margaritis
|
ed6b9d08f1
|
some primitives ported, but missing intrinsics and crash with asm() are a problem
|
2016-03-27 18:47:49 -04:00 |
|
Benoit Steiner
|
74f91ed06c
|
Improved support for integer modulo
|
2016-03-25 17:21:56 -07:00 |
|
Benoit Steiner
|
65716e99a5
|
Improved the cost estimate of the quotient op
|
2016-03-25 11:13:53 -07:00 |
|
Benoit Steiner
|
d94f6ba965
|
Started to model the cost of divisions more accurately.
|
2016-03-25 11:02:56 -07:00 |
|
Benoit Steiner
|
a86c9f037b
|
Fixed compilation error on windows
|
2016-03-24 18:54:31 -07:00 |
|
Benoit Steiner
|
0968e925a0
|
Updated the benchmarking code to use Eigen::half instead of half
|
2016-03-24 18:00:33 -07:00 |
|
Benoit Steiner
|
044efea965
|
Made sure that the cxx11_tensor_cuda test can be compiled even without support for cxx11.
|
2016-03-23 20:02:11 -07:00 |
|