eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-21 07:19:46 +08:00

Author	SHA1	Message	Date
Benoit Steiner	068cc09708	Preserve file naming conventions	2017-04-04 10:09:10 -07:00
Mehdi Goli	f84963ed95	Adding TensorIndexTuple and TensorTupleReduceOP backend (ArgMax/Min) for sycl; fixing the address space issue for const TensorMap; converting all discard_write to write due to data missmatch.	2017-03-07 14:27:10 +00:00
Mehdi Goli	8296b87d7b	Adding sycl backend for TensorCustomOp; fixing the partial lhs modification issue on sycl when the rhs is TensorContraction, reduction or convolution; Fixing the partial modification for memset when sycl backend is used.	2017-02-28 17:16:14 +00:00
Mehdi Goli	2fa2b617a9	Adding TensorVolumePatchOP.h for sycl	2017-02-24 19:16:24 +00:00
Mehdi Goli	89dfd51fae	Adding Sycl Backend for TensorGenerator.h.	2017-02-22 16:36:24 +00:00
Mehdi Goli	79ebc8f761	Adding Sycl backend for TensorImagePatchOP.h; adding Sycl backend for TensorInflation.h.	2017-02-20 12:11:05 +00:00
Mehdi Goli	91982b91c0	Adding TensorLayoutSwapOp for sycl.	2017-02-15 16:28:12 +00:00
Mehdi Goli	b1e312edd6	Adding TensorPatch.h for sycl backend.	2017-02-15 10:13:01 +00:00
Mehdi Goli	0d153ded29	Adding TensorChippingOP for sycl backend; fixing the index value in the verification operation for cxx11_tensorChipping.cpp test	2017-02-13 17:25:12 +00:00
Benoit Steiner	fbc39fd02c	Merge latest changes from upstream	2017-01-30 15:25:57 -08:00
Mehdi Goli	6bdd15f572	Adding non-deferrenciable pointer track for ComputeCpp backend; Adding TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class.	2017-01-19 11:30:59 +00:00
Mehdi Goli	e46e722381	Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying Tensor Contractsycl to be located in any place in the expression tree.	2017-01-16 13:58:49 +00:00
Benoit Steiner	0657228569	Simplified the way we link libxsmm	2016-12-21 14:40:08 -08:00
Benoit Steiner	c19fe5e9ed	Added support for libxsmm in the eigen makefiles	2016-12-21 10:43:40 -08:00
Mehdi Goli	35bae513a0	Converting all parallel for lambda to functor in order to prevent kernel duplication name error; adding tensorConcatinationOp backend for sycl.	2016-12-16 19:46:45 +00:00
Mehdi Goli	2d4a091beb	Adding tensor contraction operation backend for Sycl; adding test for contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h	2016-12-14 15:30:37 +00:00
Mehdi Goli	79aa2b784e	Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.	2016-12-01 13:02:27 +00:00
Mehdi Goli	577ce78085	Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.	2016-11-29 15:30:42 +00:00
Luke Iwanski	c5130dedbe	Specialised basic math functions for SYCL device.	2016-11-17 11:47:13 +00:00
Mehdi Goli	f8ca893976	Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.	2016-11-14 17:51:57 +00:00
Mehdi Goli	0ebe3808ca	Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;	2016-11-04 18:18:19 +00:00
Mehdi Goli	524fa4c46f	Reducing the code by generalising sycl backend functions/structs.	2016-10-14 12:09:55 +01:00
Benoit Steiner	ae1385c7e4	Pull the latest updates from trunk	2016-10-05 14:54:36 -07:00
Benoit Steiner	616a7a1912	Improved support for compiling CUDA code with clang as the host compiler	2016-10-03 17:09:33 -07:00
Benoit Steiner	422530946f	Renamed the SYCL tests to follow the standard naming convention.	2016-09-30 08:22:10 -07:00
RJ Ryan	b2c6dc48d9	Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.	2016-09-20 07:18:20 -07:00
Luke Iwanski	b91e021172	Merged with default.	2016-09-19 14:03:54 +01:00
Luke Iwanski	cb81975714	Partial OpenCL support via SYCL compatible with ComputeCpp CE.	2016-09-19 12:44:13 +01:00
Benoit Steiner	e4d4d15588	Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= 3.0) since the test instantiate contractions that require a modern gpu.	2016-09-12 19:01:52 -07:00
Benoit Steiner	4dfd888c92	CUDA contractions require arch >= 3.0: don't compile the cuda contraction tests on older architectures.	2016-09-12 18:49:01 -07:00
Benoit Steiner	5f50f12d2c	Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.	2016-09-12 13:46:13 -07:00
Gael Guennebaud	1f84f0d33a	merge EulerAngles module	2016-08-30 10:01:53 +02:00
Benoit Steiner	fad9828769	Deleted redundant regression test.	2016-08-03 16:08:37 -07:00
Benoit Steiner	81099ef482	Added a test for fp16	2016-08-03 11:41:17 -07:00
Gael Guennebaud	d075d122ea	Move half unit test from unsupported to main tests	2016-07-22 14:34:19 +02:00
Gael Guennebaud	c98bac2966	Manually add -stdd=c++11 to nvcc for old cmake versions	2016-07-12 09:29:18 +02:00
Benoit Steiner	40eb97516c	reverted unintended change.	2016-07-11 14:28:03 -07:00
Benoit Steiner	03b71c273e	Made the packetmath test compile again. A better fix would be to move the special function tests to the unsupported directory where the code now resides.	2016-07-11 13:50:24 -07:00
Gael Guennebaud	fd60966310	merge	2016-07-11 18:11:47 +02:00
Gael Guennebaud	7d636349dc	Fix configuration of CUDA: - preserve user defined CUDA_NVCC_FLAGS - remove the -ansi flag that conflicts with -std=c++11 - do not add -std=c++11 if already there	2016-07-11 18:09:04 +02:00
Gael Guennebaud	2f7e2614e7	bug #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.	2016-07-08 11:13:55 +02:00
Igor Babuschkin	85699850d9	Add missing CUDA kernel to tensor scan op The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.	2016-06-29 11:54:35 +01:00
Benoit Steiner	1a9f92e781	Added a test to validate the tensor scan evaluation on GPU. The test is currently disabled since the code segfaults.	2016-06-27 16:02:52 -07:00
Benoit Steiner	02db4e1a82	Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code	2016-06-04 08:21:17 -07:00
Tal Hadad	52e4cbf539	Merged eigen/eigen into default	2016-06-02 22:15:20 +03:00
Igor Babuschkin	fbd7ed6ff7	Add tensor scan op This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase.	2016-06-02 13:35:47 +01:00
Benoit Steiner	5707537592	Fixed option '--relaxed-constexpr' has been deprecated and replaced by option '--expt-relaxed-constexpr' warning generated by nvcc 7.5	2016-05-27 10:47:53 -07:00
Benoit Steiner	6bf8273bc0	Added a test to validate the new non blocking thread pool	2016-05-10 10:49:34 -07:00
Benoit Steiner	d14105f158	Made several tensor tests compatible with cxx03	2016-04-29 17:22:37 -07:00
Benoit Steiner	c0882ef4d9	Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section	2016-04-29 17:13:51 -07:00

1 2 3 4 5

215 Commits