Mehdi Goli
|
0ebe3808ca
|
Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;
|
2016-11-04 18:18:19 +00:00 |
|
Benoit Steiner
|
0585b2965d
|
Disable vectorization on device only when compiling for sycl
|
2016-11-02 11:44:27 -07:00 |
|
Mehdi Goli
|
51af6ae971
|
Fixed the ambiguity in callig make_tuple for sycl backend.
|
2016-10-31 16:35:51 +00:00 |
|
Benoit Steiner
|
0a9ad6fc72
|
Worked around Visual Studio compilation errors
|
2016-10-28 07:54:27 -07:00 |
|
Benoit Steiner
|
b0c5bfdf78
|
Added missing template parameters
|
2016-10-28 03:43:41 +00:00 |
|
Gael Guennebaud
|
530f20c21a
|
Workaround MSVC issue.
|
2016-10-27 21:51:37 +02:00 |
|
Benoit Steiner
|
0a4c4d40b4
|
Removed a template parameter for fixed sized tensors
|
2016-10-26 18:47:37 -07:00 |
|
Benoit Steiner
|
5f2dd503ff
|
Replaced tabs with spaces
|
2016-10-25 20:40:58 -07:00 |
|
Benoit Steiner
|
1644bafe29
|
Code cleanup
|
2016-10-25 20:36:14 -07:00 |
|
Benoit Steiner
|
cf20b30d65
|
Merge latest updates from trunk
|
2016-10-20 09:42:05 -07:00 |
|
Luke Iwanski
|
03b63e182c
|
Added SYCL include in Tensor.
|
2016-10-20 15:32:44 +01:00 |
|
Benoit Steiner
|
d3943cd50c
|
Fixed a few typos in the ternary tensor expressions types
|
2016-10-19 12:56:12 -07:00 |
|
Mehdi Goli
|
e36cb91c99
|
Fixing the code indentation in the TensorReduction.h file.
|
2016-10-14 18:03:00 +01:00 |
|
Luke Iwanski
|
e742da8b28
|
Merged ComputeCpp into default.
|
2016-10-14 13:36:51 +01:00 |
|
Mehdi Goli
|
524fa4c46f
|
Reducing the code by generalising sycl backend functions/structs.
|
2016-10-14 12:09:55 +01:00 |
|
Benoit Steiner
|
7e4a6754b2
|
Merged eigen/eigen into default
|
2016-10-12 22:42:33 -07:00 |
|
Benoit Steiner
|
7f0599b6eb
|
Manually define int16_t and uint16_t when compiling with Visual Studio
|
2016-10-08 22:56:32 -07:00 |
|
Benoit Steiner
|
5c68051cd7
|
Merge the content of the ComputeCpp branch into the default branch
|
2016-10-07 11:04:16 -07:00 |
|
RJ Ryan
|
e2e9cdd169
|
Fully support complex types in SumReducer and MeanReducer when building for CUDA by using scalar_sum_op and scalar_product_op instead of operator+ and operator*.
|
2016-10-06 10:49:48 -07:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Benoit Steiner
|
c84084c0c0
|
Fixed compilation warning
|
2016-10-05 14:15:41 -07:00 |
|
Benoit Steiner
|
8b69d5d730
|
::rand() returns a signed integer on win32
|
2016-10-05 08:55:02 -07:00 |
|
Benoit Steiner
|
ed7a220b04
|
Fixed a typo that impacts windows builds
|
2016-10-05 08:51:31 -07:00 |
|
Benoit Steiner
|
ceee1c008b
|
Silenced compilation warning
|
2016-10-04 18:47:53 -07:00 |
|
Benoit Steiner
|
6af5ac7e27
|
Cleanup the cuda executor code.
|
2016-10-04 08:52:13 -07:00 |
|
Benoit Steiner
|
2f6d1607c8
|
Cleaned up the random number generation code.
|
2016-10-04 08:38:23 -07:00 |
|
Benoit Steiner
|
2bda1b0d93
|
Updated the tensor sum and mean reducer to enable them to process complex numbers on cuda gpus.
|
2016-09-28 17:08:41 -07:00 |
|
Mehdi Goli
|
dd602e62c8
|
Converting alias template to nested struct in order to be compatible with CXX-03
|
2016-09-27 16:21:19 +01:00 |
|
Benoit Steiner
|
6565f8d60f
|
Made the initialization of a CUDA device thread safe.
|
2016-09-26 11:00:32 -07:00 |
|
Benoit Steiner
|
f6ac51a054
|
Made TensorEvalTo compatible with c++0x again.
|
2016-09-23 16:45:17 -07:00 |
|
Benoit Steiner
|
00d4e65f00
|
Deleted unused TensorMap data member
|
2016-09-23 16:44:45 -07:00 |
|
Benoit Steiner
|
1301d744f8
|
Made the gaussian generator usable on GPU
|
2016-09-22 19:04:44 -07:00 |
|
Benoit Steiner
|
c3ca9b1e76
|
Deleted some unecessary and confusing EIGEN_DEVICE_FUNC
|
2016-09-19 11:33:39 -07:00 |
|
Luke Iwanski
|
b91e021172
|
Merged with default.
|
2016-09-19 14:03:54 +01:00 |
|
Luke Iwanski
|
cb81975714
|
Partial OpenCL support via SYCL compatible with ComputeCpp CE.
|
2016-09-19 12:44:13 +01:00 |
|
Gael Guennebaud
|
18f6e47815
|
Fix order of "static inline".
|
2016-09-16 11:32:54 +02:00 |
|
Benoit Steiner
|
488ad7dd1b
|
Added missing EIGEN_DEVICE_FUNC qualifiers
|
2016-09-14 13:35:00 -07:00 |
|
Benoit Steiner
|
028e299577
|
Fixed a bug impacting some outer reductions on GPU
|
2016-09-12 18:36:52 -07:00 |
|
Benoit Steiner
|
8321dcce76
|
Merged latest updates from trunk
|
2016-09-12 10:33:05 -07:00 |
|
Benoit Steiner
|
eb6ba00cc8
|
Properly size the list of waiters
|
2016-09-12 10:31:55 -07:00 |
|
Benoit Steiner
|
a618094b62
|
Added a resize method to MaxSizeVector
|
2016-09-12 10:30:53 -07:00 |
|
Gael Guennebaud
|
471eac5399
|
bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with some specializations in arch/SSE and arch/AVX)
|
2016-09-08 08:36:27 +02:00 |
|
Benoit Steiner
|
13df3441ae
|
Use MaxSizeVector instead of std::vector: xcode sometimes assumes that std::vector allocates aligned memory and therefore issues aligned instruction to initialize it. This can result in random crashes when compiling with AVX instructions enabled.
|
2016-09-02 19:25:47 -07:00 |
|
Benoit Steiner
|
cadd124d73
|
Pulled latest update from trunk
|
2016-09-02 15:30:02 -07:00 |
|
Benoit Steiner
|
05b0518077
|
Made the index type an explicit template parameter to help some compilers compile the code.
|
2016-09-02 15:29:34 -07:00 |
|
Benoit Steiner
|
adf864fec0
|
Merged in rmlarsen/eigen (pull request PR-222)
Fix CUDA build broken by changes to min and max reduction.
|
2016-09-02 14:11:20 -07:00 |
|
Rasmus Munk Larsen
|
13e93ca8b7
|
Fix CUDA build broken by changes to min and max reduction.
|
2016-09-02 13:41:36 -07:00 |
|
Benoit Steiner
|
c53f783705
|
Updated the contraction code to support constant inputs.
|
2016-09-01 11:41:27 -07:00 |
|
Gael Guennebaud
|
46475eff9a
|
Adjust Tensor module wrt recent change in nullary functor
|
2016-09-01 13:40:45 +02:00 |
|
Rasmus Munk Larsen
|
a1e092d1e8
|
Fix bugs to make min- and max reducers with correctly with IEEE infinities.
|
2016-08-31 15:04:16 -07:00 |
|