Benoit Steiner
|
068cc09708
|
Preserve file naming conventions
|
2017-04-04 10:09:10 -07:00 |
|
Mehdi Goli
|
f84963ed95
|
Adding TensorIndexTuple and TensorTupleReduceOP backend (ArgMax/Min) for sycl; fixing the address space issue for const TensorMap; converting all discard_write to write due to data missmatch.
|
2017-03-07 14:27:10 +00:00 |
|
Mehdi Goli
|
8296b87d7b
|
Adding sycl backend for TensorCustomOp; fixing the partial lhs modification issue on sycl when the rhs is TensorContraction, reduction or convolution; Fixing the partial modification for memset when sycl backend is used.
|
2017-02-28 17:16:14 +00:00 |
|
Mehdi Goli
|
2fa2b617a9
|
Adding TensorVolumePatchOP.h for sycl
|
2017-02-24 19:16:24 +00:00 |
|
Mehdi Goli
|
89dfd51fae
|
Adding Sycl Backend for TensorGenerator.h.
|
2017-02-22 16:36:24 +00:00 |
|
Mehdi Goli
|
79ebc8f761
|
Adding Sycl backend for TensorImagePatchOP.h; adding Sycl backend for TensorInflation.h.
|
2017-02-20 12:11:05 +00:00 |
|
Mehdi Goli
|
91982b91c0
|
Adding TensorLayoutSwapOp for sycl.
|
2017-02-15 16:28:12 +00:00 |
|
Mehdi Goli
|
b1e312edd6
|
Adding TensorPatch.h for sycl backend.
|
2017-02-15 10:13:01 +00:00 |
|
Mehdi Goli
|
0d153ded29
|
Adding TensorChippingOP for sycl backend; fixing the index value in the verification operation for cxx11_tensorChipping.cpp test
|
2017-02-13 17:25:12 +00:00 |
|
Benoit Steiner
|
fbc39fd02c
|
Merge latest changes from upstream
|
2017-01-30 15:25:57 -08:00 |
|
Mehdi Goli
|
6bdd15f572
|
Adding non-deferrenciable pointer track for ComputeCpp backend; Adding TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class.
|
2017-01-19 11:30:59 +00:00 |
|
Mehdi Goli
|
e46e722381
|
Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying Tensor Contractsycl to be located in any place in the expression tree.
|
2017-01-16 13:58:49 +00:00 |
|
Benoit Steiner
|
0657228569
|
Simplified the way we link libxsmm
|
2016-12-21 14:40:08 -08:00 |
|
Benoit Steiner
|
c19fe5e9ed
|
Added support for libxsmm in the eigen makefiles
|
2016-12-21 10:43:40 -08:00 |
|
Mehdi Goli
|
35bae513a0
|
Converting all parallel for lambda to functor in order to prevent kernel duplication name error; adding tensorConcatinationOp backend for sycl.
|
2016-12-16 19:46:45 +00:00 |
|
Mehdi Goli
|
2d4a091beb
|
Adding tensor contraction operation backend for Sycl; adding test for contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h
|
2016-12-14 15:30:37 +00:00 |
|
Mehdi Goli
|
79aa2b784e
|
Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
|
2016-12-01 13:02:27 +00:00 |
|
Mehdi Goli
|
577ce78085
|
Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.
|
2016-11-29 15:30:42 +00:00 |
|
Luke Iwanski
|
c5130dedbe
|
Specialised basic math functions for SYCL device.
|
2016-11-17 11:47:13 +00:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Mehdi Goli
|
0ebe3808ca
|
Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;
|
2016-11-04 18:18:19 +00:00 |
|
Mehdi Goli
|
524fa4c46f
|
Reducing the code by generalising sycl backend functions/structs.
|
2016-10-14 12:09:55 +01:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Benoit Steiner
|
616a7a1912
|
Improved support for compiling CUDA code with clang as the host compiler
|
2016-10-03 17:09:33 -07:00 |
|
Benoit Steiner
|
422530946f
|
Renamed the SYCL tests to follow the standard naming convention.
|
2016-09-30 08:22:10 -07:00 |
|
RJ Ryan
|
b2c6dc48d9
|
Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.
|
2016-09-20 07:18:20 -07:00 |
|
Luke Iwanski
|
b91e021172
|
Merged with default.
|
2016-09-19 14:03:54 +01:00 |
|
Luke Iwanski
|
cb81975714
|
Partial OpenCL support via SYCL compatible with ComputeCpp CE.
|
2016-09-19 12:44:13 +01:00 |
|
Benoit Steiner
|
e4d4d15588
|
Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= 3.0) since the test instantiate contractions that require a modern gpu.
|
2016-09-12 19:01:52 -07:00 |
|
Benoit Steiner
|
4dfd888c92
|
CUDA contractions require arch >= 3.0: don't compile the cuda contraction tests on older architectures.
|
2016-09-12 18:49:01 -07:00 |
|
Benoit Steiner
|
5f50f12d2c
|
Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.
|
2016-09-12 13:46:13 -07:00 |
|
Gael Guennebaud
|
1f84f0d33a
|
merge EulerAngles module
|
2016-08-30 10:01:53 +02:00 |
|
Benoit Steiner
|
fad9828769
|
Deleted redundant regression test.
|
2016-08-03 16:08:37 -07:00 |
|
Benoit Steiner
|
81099ef482
|
Added a test for fp16
|
2016-08-03 11:41:17 -07:00 |
|
Gael Guennebaud
|
d075d122ea
|
Move half unit test from unsupported to main tests
|
2016-07-22 14:34:19 +02:00 |
|
Gael Guennebaud
|
c98bac2966
|
Manually add -stdd=c++11 to nvcc for old cmake versions
|
2016-07-12 09:29:18 +02:00 |
|
Benoit Steiner
|
40eb97516c
|
reverted unintended change.
|
2016-07-11 14:28:03 -07:00 |
|
Benoit Steiner
|
03b71c273e
|
Made the packetmath test compile again. A better fix would be to move the special function tests to the unsupported directory where the code now resides.
|
2016-07-11 13:50:24 -07:00 |
|
Gael Guennebaud
|
fd60966310
|
merge
|
2016-07-11 18:11:47 +02:00 |
|
Gael Guennebaud
|
7d636349dc
|
Fix configuration of CUDA:
- preserve user defined CUDA_NVCC_FLAGS
- remove the -ansi flag that conflicts with -std=c++11
- do not add -std=c++11 if already there
|
2016-07-11 18:09:04 +02:00 |
|
Gael Guennebaud
|
2f7e2614e7
|
bug #1232: refactor special functions as a new SpecialFunctions module, currently in unsupported/.
|
2016-07-08 11:13:55 +02:00 |
|
Igor Babuschkin
|
85699850d9
|
Add missing CUDA kernel to tensor scan op
The TensorScanOp implementation was missing a CUDA kernel launch.
This adds a simple placeholder implementation.
|
2016-06-29 11:54:35 +01:00 |
|
Benoit Steiner
|
1a9f92e781
|
Added a test to validate the tensor scan evaluation on GPU. The test is currently disabled since the code segfaults.
|
2016-06-27 16:02:52 -07:00 |
|
Benoit Steiner
|
02db4e1a82
|
Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code
|
2016-06-04 08:21:17 -07:00 |
|
Tal Hadad
|
52e4cbf539
|
Merged eigen/eigen into default
|
2016-06-02 22:15:20 +03:00 |
|
Igor Babuschkin
|
fbd7ed6ff7
|
Add tensor scan op
This is the initial implementation a generic scan operation.
Based on this, cumsum and cumprod method have been added to TensorBase.
|
2016-06-02 13:35:47 +01:00 |
|
Benoit Steiner
|
5707537592
|
Fixed option '--relaxed-constexpr' has been deprecated and replaced by option '--expt-relaxed-constexpr' warning generated by nvcc 7.5
|
2016-05-27 10:47:53 -07:00 |
|
Benoit Steiner
|
6bf8273bc0
|
Added a test to validate the new non blocking thread pool
|
2016-05-10 10:49:34 -07:00 |
|
Benoit Steiner
|
d14105f158
|
Made several tensor tests compatible with cxx03
|
2016-04-29 17:22:37 -07:00 |
|
Benoit Steiner
|
c0882ef4d9
|
Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section
|
2016-04-29 17:13:51 -07:00 |
|