Benoit Steiner
|
0657228569
|
Simplified the way we link libxsmm
|
2016-12-21 14:40:08 -08:00 |
|
Benoit Steiner
|
c19fe5e9ed
|
Added support for libxsmm in the eigen makefiles
|
2016-12-21 10:43:40 -08:00 |
|
Mehdi Goli
|
35bae513a0
|
Converting all parallel for lambda to functor in order to prevent kernel duplication name error; adding tensorConcatinationOp backend for sycl.
|
2016-12-16 19:46:45 +00:00 |
|
Mehdi Goli
|
2d4a091beb
|
Adding tensor contraction operation backend for Sycl; adding test for contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h
|
2016-12-14 15:30:37 +00:00 |
|
Mehdi Goli
|
79aa2b784e
|
Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
|
2016-12-01 13:02:27 +00:00 |
|
Mehdi Goli
|
577ce78085
|
Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.
|
2016-11-29 15:30:42 +00:00 |
|
Luke Iwanski
|
c5130dedbe
|
Specialised basic math functions for SYCL device.
|
2016-11-17 11:47:13 +00:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Mehdi Goli
|
0ebe3808ca
|
Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;
|
2016-11-04 18:18:19 +00:00 |
|
Mehdi Goli
|
524fa4c46f
|
Reducing the code by generalising sycl backend functions/structs.
|
2016-10-14 12:09:55 +01:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Benoit Steiner
|
616a7a1912
|
Improved support for compiling CUDA code with clang as the host compiler
|
2016-10-03 17:09:33 -07:00 |
|
Benoit Steiner
|
422530946f
|
Renamed the SYCL tests to follow the standard naming convention.
|
2016-09-30 08:22:10 -07:00 |
|
RJ Ryan
|
b2c6dc48d9
|
Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.
|
2016-09-20 07:18:20 -07:00 |
|
Luke Iwanski
|
b91e021172
|
Merged with default.
|
2016-09-19 14:03:54 +01:00 |
|
Luke Iwanski
|
cb81975714
|
Partial OpenCL support via SYCL compatible with ComputeCpp CE.
|
2016-09-19 12:44:13 +01:00 |
|
Benoit Steiner
|
e4d4d15588
|
Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= 3.0) since the test instantiate contractions that require a modern gpu.
|
2016-09-12 19:01:52 -07:00 |
|
Benoit Steiner
|
4dfd888c92
|
CUDA contractions require arch >= 3.0: don't compile the cuda contraction tests on older architectures.
|
2016-09-12 18:49:01 -07:00 |
|
Benoit Steiner
|
5f50f12d2c
|
Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.
|
2016-09-12 13:46:13 -07:00 |
|
Gael Guennebaud
|
1f84f0d33a
|
merge EulerAngles module
|
2016-08-30 10:01:53 +02:00 |
|
Benoit Steiner
|
fad9828769
|
Deleted redundant regression test.
|
2016-08-03 16:08:37 -07:00 |
|
Benoit Steiner
|
81099ef482
|
Added a test for fp16
|
2016-08-03 11:41:17 -07:00 |
|
Gael Guennebaud
|
d075d122ea
|
Move half unit test from unsupported to main tests
|
2016-07-22 14:34:19 +02:00 |
|
Gael Guennebaud
|
c98bac2966
|
Manually add -stdd=c++11 to nvcc for old cmake versions
|
2016-07-12 09:29:18 +02:00 |
|
Benoit Steiner
|
40eb97516c
|
reverted unintended change.
|
2016-07-11 14:28:03 -07:00 |
|
Benoit Steiner
|
03b71c273e
|
Made the packetmath test compile again. A better fix would be to move the special function tests to the unsupported directory where the code now resides.
|
2016-07-11 13:50:24 -07:00 |
|
Gael Guennebaud
|
fd60966310
|
merge
|
2016-07-11 18:11:47 +02:00 |
|
Gael Guennebaud
|
7d636349dc
|
Fix configuration of CUDA:
- preserve user defined CUDA_NVCC_FLAGS
- remove the -ansi flag that conflicts with -std=c++11
- do not add -std=c++11 if already there
|
2016-07-11 18:09:04 +02:00 |
|
Gael Guennebaud
|
2f7e2614e7
|
bug #1232: refactor special functions as a new SpecialFunctions module, currently in unsupported/.
|
2016-07-08 11:13:55 +02:00 |
|
Igor Babuschkin
|
85699850d9
|
Add missing CUDA kernel to tensor scan op
The TensorScanOp implementation was missing a CUDA kernel launch.
This adds a simple placeholder implementation.
|
2016-06-29 11:54:35 +01:00 |
|
Benoit Steiner
|
1a9f92e781
|
Added a test to validate the tensor scan evaluation on GPU. The test is currently disabled since the code segfaults.
|
2016-06-27 16:02:52 -07:00 |
|
Benoit Steiner
|
02db4e1a82
|
Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code
|
2016-06-04 08:21:17 -07:00 |
|
Tal Hadad
|
52e4cbf539
|
Merged eigen/eigen into default
|
2016-06-02 22:15:20 +03:00 |
|
Igor Babuschkin
|
fbd7ed6ff7
|
Add tensor scan op
This is the initial implementation a generic scan operation.
Based on this, cumsum and cumprod method have been added to TensorBase.
|
2016-06-02 13:35:47 +01:00 |
|
Benoit Steiner
|
5707537592
|
Fixed option '--relaxed-constexpr' has been deprecated and replaced by option '--expt-relaxed-constexpr' warning generated by nvcc 7.5
|
2016-05-27 10:47:53 -07:00 |
|
Benoit Steiner
|
6bf8273bc0
|
Added a test to validate the new non blocking thread pool
|
2016-05-10 10:49:34 -07:00 |
|
Benoit Steiner
|
d14105f158
|
Made several tensor tests compatible with cxx03
|
2016-04-29 17:22:37 -07:00 |
|
Benoit Steiner
|
c0882ef4d9
|
Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section
|
2016-04-29 17:13:51 -07:00 |
|
Benoit Steiner
|
4f53178e62
|
Made a coupe of tensor tests compile without requiring c++11 support.
|
2016-04-29 16:09:54 -07:00 |
|
Benoit Steiner
|
bebb89acfa
|
Enabled the new threadpool tests
|
2016-04-14 16:44:10 -07:00 |
|
Benoit Steiner
|
995f202cea
|
Disabled the use of half2 on cuda devices of compute capability < 5.3
|
2016-04-08 14:43:36 -07:00 |
|
Benoit Steiner
|
0d2a532fc3
|
Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using clang instead of nvcc
|
2016-04-08 13:16:08 -07:00 |
|
Benoit Steiner
|
d962fe6a99
|
Renamed float16 into cxx11_float16 since the test relies on c++11 features
|
2016-04-07 20:28:32 -07:00 |
|
Benoit Steiner
|
dc45aaeb93
|
Added tests for float16
|
2016-04-07 11:18:05 -07:00 |
|
Benoit Steiner
|
7781f865cb
|
Renamed the EIGEN_TEST_NVCC cmake option into EIGEN_TEST_CUDA per the discussion in bug #1173.
|
2016-04-06 09:35:23 -07:00 |
|
Benoit Steiner
|
2062ee2d26
|
Added a test to verify that notifications are working properly
|
2016-03-23 13:39:00 -07:00 |
|
Benoit Steiner
|
e7a468c5b7
|
Filter some compilation flags that nvcc warns about.
|
2016-03-22 14:26:50 -07:00 |
|
Benoit Steiner
|
bb0e73c191
|
Gate all the CUDA tests under the EIGEN_TEST_NVCC option
|
2016-03-18 12:17:37 -07:00 |
|
Benoit Steiner
|
53d498ef06
|
Fixed compilation warnings in the cuda tests
|
2016-03-18 07:04:54 -07:00 |
|
Eugene Brevdo
|
5e7de771e3
|
Properly fix merge issues.
|
2016-03-08 17:35:05 -08:00 |
|