eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Benoit Steiner	fbc39fd02c	Merge latest changes from upstream	2017-01-30 15:25:57 -08:00
Mehdi Goli	6bdd15f572	Adding non-deferrenciable pointer track for ComputeCpp backend; Adding TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class.	2017-01-19 11:30:59 +00:00
Mehdi Goli	e46e722381	Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying Tensor Contractsycl to be located in any place in the expression tree.	2017-01-16 13:58:49 +00:00
Benoit Steiner	0657228569	Simplified the way we link libxsmm	2016-12-21 14:40:08 -08:00
Benoit Steiner	c19fe5e9ed	Added support for libxsmm in the eigen makefiles	2016-12-21 10:43:40 -08:00
Mehdi Goli	35bae513a0	Converting all parallel for lambda to functor in order to prevent kernel duplication name error; adding tensorConcatinationOp backend for sycl.	2016-12-16 19:46:45 +00:00
Mehdi Goli	2d4a091beb	Adding tensor contraction operation backend for Sycl; adding test for contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h	2016-12-14 15:30:37 +00:00
Mehdi Goli	79aa2b784e	Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.	2016-12-01 13:02:27 +00:00
Mehdi Goli	577ce78085	Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.	2016-11-29 15:30:42 +00:00
Luke Iwanski	c5130dedbe	Specialised basic math functions for SYCL device.	2016-11-17 11:47:13 +00:00
Mehdi Goli	f8ca893976	Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.	2016-11-14 17:51:57 +00:00
Mehdi Goli	0ebe3808ca	Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;	2016-11-04 18:18:19 +00:00
Mehdi Goli	524fa4c46f	Reducing the code by generalising sycl backend functions/structs.	2016-10-14 12:09:55 +01:00
Benoit Steiner	ae1385c7e4	Pull the latest updates from trunk	2016-10-05 14:54:36 -07:00
Benoit Steiner	616a7a1912	Improved support for compiling CUDA code with clang as the host compiler	2016-10-03 17:09:33 -07:00
Benoit Steiner	422530946f	Renamed the SYCL tests to follow the standard naming convention.	2016-09-30 08:22:10 -07:00
RJ Ryan	b2c6dc48d9	Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.	2016-09-20 07:18:20 -07:00
Luke Iwanski	b91e021172	Merged with default.	2016-09-19 14:03:54 +01:00
Luke Iwanski	cb81975714	Partial OpenCL support via SYCL compatible with ComputeCpp CE.	2016-09-19 12:44:13 +01:00
Benoit Steiner	e4d4d15588	Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= 3.0) since the test instantiate contractions that require a modern gpu.	2016-09-12 19:01:52 -07:00
Benoit Steiner	4dfd888c92	CUDA contractions require arch >= 3.0: don't compile the cuda contraction tests on older architectures.	2016-09-12 18:49:01 -07:00
Benoit Steiner	5f50f12d2c	Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.	2016-09-12 13:46:13 -07:00
Gael Guennebaud	1f84f0d33a	merge EulerAngles module	2016-08-30 10:01:53 +02:00
Benoit Steiner	fad9828769	Deleted redundant regression test.	2016-08-03 16:08:37 -07:00
Benoit Steiner	81099ef482	Added a test for fp16	2016-08-03 11:41:17 -07:00
Gael Guennebaud	d075d122ea	Move half unit test from unsupported to main tests	2016-07-22 14:34:19 +02:00
Gael Guennebaud	c98bac2966	Manually add -stdd=c++11 to nvcc for old cmake versions	2016-07-12 09:29:18 +02:00
Benoit Steiner	40eb97516c	reverted unintended change.	2016-07-11 14:28:03 -07:00
Benoit Steiner	03b71c273e	Made the packetmath test compile again. A better fix would be to move the special function tests to the unsupported directory where the code now resides.	2016-07-11 13:50:24 -07:00
Gael Guennebaud	fd60966310	merge	2016-07-11 18:11:47 +02:00
Gael Guennebaud	7d636349dc	Fix configuration of CUDA: - preserve user defined CUDA_NVCC_FLAGS - remove the -ansi flag that conflicts with -std=c++11 - do not add -std=c++11 if already there	2016-07-11 18:09:04 +02:00
Gael Guennebaud	2f7e2614e7	bug #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.	2016-07-08 11:13:55 +02:00
Igor Babuschkin	85699850d9	Add missing CUDA kernel to tensor scan op The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.	2016-06-29 11:54:35 +01:00
Benoit Steiner	1a9f92e781	Added a test to validate the tensor scan evaluation on GPU. The test is currently disabled since the code segfaults.	2016-06-27 16:02:52 -07:00
Benoit Steiner	02db4e1a82	Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code	2016-06-04 08:21:17 -07:00
Tal Hadad	52e4cbf539	Merged eigen/eigen into default	2016-06-02 22:15:20 +03:00
Igor Babuschkin	fbd7ed6ff7	Add tensor scan op This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase.	2016-06-02 13:35:47 +01:00
Benoit Steiner	5707537592	Fixed option '--relaxed-constexpr' has been deprecated and replaced by option '--expt-relaxed-constexpr' warning generated by nvcc 7.5	2016-05-27 10:47:53 -07:00
Benoit Steiner	6bf8273bc0	Added a test to validate the new non blocking thread pool	2016-05-10 10:49:34 -07:00
Benoit Steiner	d14105f158	Made several tensor tests compatible with cxx03	2016-04-29 17:22:37 -07:00
Benoit Steiner	c0882ef4d9	Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section	2016-04-29 17:13:51 -07:00
Benoit Steiner	4f53178e62	Made a coupe of tensor tests compile without requiring c++11 support.	2016-04-29 16:09:54 -07:00
Benoit Steiner	bebb89acfa	Enabled the new threadpool tests	2016-04-14 16:44:10 -07:00
Benoit Steiner	995f202cea	Disabled the use of half2 on cuda devices of compute capability < 5.3	2016-04-08 14:43:36 -07:00
Benoit Steiner	0d2a532fc3	Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using clang instead of nvcc	2016-04-08 13:16:08 -07:00
Benoit Steiner	d962fe6a99	Renamed float16 into cxx11_float16 since the test relies on c++11 features	2016-04-07 20:28:32 -07:00
Benoit Steiner	dc45aaeb93	Added tests for float16	2016-04-07 11:18:05 -07:00
Benoit Steiner	7781f865cb	Renamed the EIGEN_TEST_NVCC cmake option into EIGEN_TEST_CUDA per the discussion in bug #1173 .	2016-04-06 09:35:23 -07:00
Benoit Steiner	2062ee2d26	Added a test to verify that notifications are working properly	2016-03-23 13:39:00 -07:00
Benoit Steiner	e7a468c5b7	Filter some compilation flags that nvcc warns about.	2016-03-22 14:26:50 -07:00
Benoit Steiner	bb0e73c191	Gate all the CUDA tests under the EIGEN_TEST_NVCC option	2016-03-18 12:17:37 -07:00
Benoit Steiner	53d498ef06	Fixed compilation warnings in the cuda tests	2016-03-18 07:04:54 -07:00
Eugene Brevdo	5e7de771e3	Properly fix merge issues.	2016-03-08 17:35:05 -08:00
Eugene Brevdo	5707004d6b	Fix Eigen's building of sharded tests that use CUDA & more igamma/igammac bugfixes. 0. Prior to this PR, not a single sharded CUDA test was actually being run. Fixed that. GPU tests are still failing for igamma/igammac. 1. Add calls for igamma/igammac to TensorBase 2. Fix up CUDA-specific calls of igamma/igammac 3. Add unit tests for digamma, igamma, igammac in CUDA.	2016-03-07 14:08:56 -08:00
Benoit Steiner	c23e0be18f	Use the CMAKE_CXX_STANDARD variable to turn on cxx11	2016-03-04 20:18:01 -08:00
Benoit Steiner	deea866bbd	Added tests to cover the new rounding, flooring and ceiling tensor operations.	2016-03-03 12:38:02 -08:00
Benoit Steiner	dac58d7c35	Added a test to validate the conversion of half floats into floats on Kepler GPUs. Restricted the testing of the random number generation code to GPU architecture greater than or equal to 3.5.	2016-03-03 10:37:25 -08:00
Benoit Steiner	af199b4658	Made the CUDA architecture level a build setting.	2016-02-25 09:06:18 -08:00
Benoit Steiner	7151bd8768	Reverted unintended changes introduced by a bad merge	2016-02-19 06:20:50 +00:00
Benoit Steiner	17b9fbed34	Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa	2016-02-19 06:16:07 +00:00
Benoit Steiner	d2cba52015	Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type	2016-02-05 18:14:23 -08:00
Benoit Steiner	5d82e47ef6	Properly disable nvcc warning messages in user code.	2016-02-03 14:10:06 -08:00
Benoit Steiner	af8436b196	Silenced the "calling a __host__ function from a __host__ __device__ function is not allowed" messages	2016-02-03 13:48:36 -08:00
Benoit Steiner	7b3044d086	Made sure to call nvcc with the relaxed-constexpr flag.	2016-01-28 15:36:34 -08:00
Gael Guennebaud	7802a6bb1c	Fix unit test filename.	2016-01-28 09:35:37 +01:00
Benoit Steiner	4bf9eaf77a	Deleted an invalid assertion that prevented the assignment of empty tensors.	2016-01-27 17:09:30 -08:00
Benoit Steiner	55a5204319	Fixed the flags passed to nvcc to compile the tensor code.	2016-01-27 14:46:34 -08:00
Benoit Steiner	9dfbd4fe8d	Made the cuda tests compile using make check	2016-01-27 12:22:17 -08:00
Tal Hadad	fabd8474ff	Merged eigen/eigen into default	2015-12-20 12:50:07 +02:00
Benoit Steiner	f8df393165	Added support for 128bit integers on CUDA devices.	2015-11-19 13:57:27 -08:00
Gael Guennebaud	d4cf436cb1	Enable mpreal unit test for C++11 compiler only	2015-10-27 17:35:54 +01:00
Tal Hadad	5e0a178df2	Initial fork of unsupported module EulerAngles.	2015-09-27 16:51:24 +03:00
Benoit Steiner	2495e2479f	Added tests for the fft code	2015-10-22 16:52:55 -07:00
Benoit Steiner	b178cc3479	Added some syntactic sugar to make it simpler to compare a tensor to a scalar.	2015-10-21 11:28:28 -07:00
Benoit Steiner	6585efc553	Tightened the definition of isOfNormalIndex to take into account integer types in addition to arrays of indices Only compile the custom index code when EIGEN_HAS_SFINAE is defined. For the time beeing, EIGEN_HAS_SFINAE is a synonym for EIGEN_HAS_VARIADIC_TEMPLATES, but this might evolve in the future. Moved some code around.	2015-10-14 09:31:37 -07:00
Gabriel Nützi	6edae2d30d	added CustomIndex capability only to Tensor and not yet to TensorBase. using Sfinae and is_base_of to select correct template which converts to array<Index,NumIndices> user: Gabriel Nützi <gnuetzi@gmx.ch> branch 'default' added unsupported/Eigen/CXX11/src/Tensor/TensorMetaMacros.h added unsupported/test/cxx11_tensor_customIndex.cpp changed unsupported/Eigen/CXX11/Tensor changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h changed unsupported/Eigen/CXX11/src/Tensor/TensorMeta.h changed unsupported/test/CMakeLists.txt	2015-10-09 18:52:48 +02:00
Gael Guennebaud	1b148d9e2e	Move IncompleteCholesky to official modules	2015-10-08 11:32:46 +02:00
Christoph Hertzberg	5ad7981f73	Use full packet size for Dynamic-sized objects (otherwise, the unalignedcount unit test fails with AVX enabled)	2015-09-02 22:51:43 +02:00
Benoit Steiner	f41831e445	Added support for argmax/argmin	2015-08-31 08:18:53 -07:00
Gael Guennebaud	9a4713e505	Add a unit test for IncompleteCholesky	2015-08-04 16:14:06 +02:00
Benoit Steiner	06a22ca5bd	Added support for sigmoid function to the tensor module	2015-07-17 09:29:00 -07:00
Benoit Steiner	05787f8367	Added support for tensor inflation.	2015-07-16 09:04:05 -07:00
Benoit Steiner	e6297741c9	Added support for generation of random complex numbers on CUDA devices	2015-07-07 17:40:49 -07:00
Benoit Steiner	f1f480b116	Added support for user defined custom tensor op.	2015-06-30 15:36:29 -07:00
Benoit Steiner	dc31fcb9ba	Added support for 3D patch extraction	2015-06-30 14:48:26 -07:00
Benoit Steiner	fffe63045c	Added a test for full reductions on GPU	2015-06-29 14:10:32 -07:00
Benoit Steiner	91359e1d0a	Added the ability to generate a tensor from a custom user defined 'generator'. This simplifies the creation of constant tensors initialized using specific regular patterns. Created a gaussian window generator as a first use case.	2015-04-22 11:14:58 -07:00
Benoit Steiner	d3f7915aeb	Pulled latest update from the eigen main codebase	2015-03-24 13:12:14 -07:00
Benoit Steiner	2386fc8528	Added support for 32bit index on a per tensor/tensor expression. This enables us to use 32bit indices to evaluate expressions on GPU faster while keeping the ability to use 64 bit indices to manipulate large tensors on CPU in the same binary.	2015-02-27 12:57:13 -08:00
Gael Guennebaud	3594451ee0	Remove EIGEN_TEST_C++0x option and let EIGEN_TEST_CXX11 adds the -std=c++11 flag	2015-02-20 09:31:27 +01:00
Gael Guennebaud	b10cd3afd2	Re-enbale detection of min/max parentheses protection, and re-enable mpreal_support unit test.	2015-02-27 22:38:00 +01:00
Benoit Steiner	c739102ef9	Pulled the latest changes from the trunk	2015-02-06 05:25:03 -08:00
Benoit Steiner	b5124e7cfd	Created many additional tests	2015-01-14 15:46:04 -08:00
Abhijit Kundu	48db34a7b9	Adding missing OPENGL_LIBRARIES for openglsupport test. Also adding OpenGL include directories as a better pratice even though these are system include directories in most systems.	2014-12-04 01:18:47 -05:00
Benoit Steiner	ec785b0180	Added support for extraction of patches from images	2014-11-13 09:28:54 -08:00
Benoit Steiner	c2d1074932	Added support for static list of indices	2014-11-12 22:25:38 -08:00
Benoit Steiner	cb37f818ca	Fixed a compilation error triggered by some operations on fixed sized tensors	2014-11-05 23:25:11 -08:00
Benoit Steiner	9a06a71627	Fixed a test	2014-11-05 07:49:51 -08:00
Benoit Steiner	85c3389b28	Fixed a test	2014-10-31 00:04:13 -07:00
Gael Guennebaud	21c0a2ce0c	Move D&C SVD to official SVD module.	2014-10-29 11:29:33 +01:00

1 2 3 4 5 ...

256 Commits