eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-01-06 14:14:46 +08:00

Author	SHA1	Message	Date
Benoit Steiner	8d06c02ffd	Allow vectorized padding on GPU. This helps speed things up a little. Before: BM_padding/10 5000000 460 217.03 MFlops/s BM_padding/80 5000000 460 13899.40 MFlops/s BM_padding/640 5000000 461 888421.17 MFlops/s BM_padding/4K 5000000 460 54316322.55 MFlops/s After: BM_padding/10 5000000 454 220.20 MFlops/s BM_padding/80 5000000 455 14039.86 MFlops/s BM_padding/640 5000000 452 904968.83 MFlops/s BM_padding/4K 5000000 411 60750049.21 MFlops/s	2016-05-17 09:13:27 -07:00
Benoit Steiner	92fc6add43	Don't rely on c++11 extension when we don't have to.	2016-05-17 07:21:22 -07:00
Gael Guennebaud	1fbfab27a9	bug #1223 : fix compilation of AutoDiffScalar's min/max operators, and add regression unit test.	2016-05-18 16:26:26 +02:00
Gael Guennebaud	448d9d943c	bug #1222 : fix compilation in AutoDiffScalar and add respective unit test	2016-05-18 16:00:11 +02:00
Benoit Steiner	2a54b70d45	Fixed potential race condition in the non blocking thread pool	2016-05-12 11:45:48 -07:00
Benoit Steiner	fae0493f98	Fixed a couple of bugs related to the Pascalfamily of GPUs H: Enter commit message. Lines beginning with 'HG:' are removed.	2016-05-11 23:02:26 -07:00
Benoit Steiner	886445ce4d	Avoid unnecessary conversions between floats and doubles	2016-05-11 23:00:03 -07:00
Benoit Steiner	595e890391	Added more tests for half floats	2016-05-11 21:27:15 -07:00
Christoph Hertzberg	2150f13d65	fixed some double-promotion and sign-compare warnings	2016-05-11 23:02:26 +02:00
Benoit Steiner	217d984abc	Fixed a typo in my previous commit	2016-05-11 10:22:15 -07:00
Benoit Steiner	cbb14ed47e	Added a few tests to validate the generation of random tensors on GPU.	2016-05-11 10:05:56 -07:00
Benoit Steiner	6bf8273bc0	Added a test to validate the new non blocking thread pool	2016-05-10 10:49:34 -07:00
Benoit Steiner	75bd2bd32d	Fixed compilation warning	2016-05-09 19:24:41 -07:00
Benoit Steiner	dc7dbc2df7	Optimized the non blocking thread pool: * Use a pseudo-random permutation of queue indices during random stealing. This ensures that all the queues are considered. * Directly pop from a non-empty queue when we are waiting for work, instead of first noticing that there is a non-empty queue and then doing another round of random stealing to re-discover the non-empty queue. * Steal only 1 task from a remote queue instead of half of tasks.	2016-05-09 10:17:17 -07:00
Benoit Steiner	691614bd2c	Worked around a bug in nvcc on tegra x1	2016-05-07 13:28:53 -07:00
Benoit Steiner	69a8a4e1f3	Added a test to validate full reduction on tensor of half floats	2016-05-05 16:52:50 -07:00
Benoit Steiner	678a17ba79	Made the testing of contractions on fp16 more robust	2016-05-05 16:36:39 -07:00
Benoit Steiner	e3d053e14e	Refined the testing of log and exp on fp16	2016-05-05 16:24:15 -07:00
Benoit Steiner	9a48688d37	Further improved the testing of fp16	2016-05-05 15:58:05 -07:00
Benoit Steiner	2aba40d208	Avoid unecessary type promotion	2016-05-05 09:26:57 -07:00
Benoit Steiner	7875437ca0	Avoided unecessary type promotion	2016-05-05 09:08:42 -07:00
Benoit Steiner	f363e533aa	Added tests for full contractions using thread pools and gpu devices. Fixed a couple of issues in the corresponding code.	2016-05-05 09:05:45 -07:00
Benoit Steiner	06d774bf58	Updated the contraction code to ensure that full contraction return a tensor of rank 0	2016-05-05 08:37:47 -07:00
Christoph Hertzberg	b300a84989	Fixed some singed/unsigned comparison warnings	2016-05-05 13:36:28 +02:00
Christoph Hertzberg	dacb469bc9	Enable and fix -Wdouble-conversion warnings	2016-05-05 13:35:45 +02:00
Benoit Steiner	62b710072e	Reduced the memory footprint of the cxx11_tensor_image_patch test	2016-05-04 21:08:22 -07:00
Benoit Steiner	2c5568a757	Added a test to validate the computation of exp and log on 16bit floats	2016-05-03 12:06:07 -07:00
Benoit Steiner	2b890ae618	Fixed compilation errors generated by clang	2016-04-29 18:30:40 -07:00
Benoit Steiner	d217217842	Added a few tests to ensure that the dimensions of rank 0 tensors are correctly computed	2016-04-29 18:15:34 -07:00
Benoit Steiner	d14105f158	Made several tensor tests compatible with cxx03	2016-04-29 17:22:37 -07:00
Benoit Steiner	c0882ef4d9	Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section	2016-04-29 17:13:51 -07:00
Benoit Steiner	9d1dbd1ec0	Fixed teh cxx11_tensor_empty test to compile without requiring cxx11 support	2016-04-29 16:53:55 -07:00
Benoit Steiner	4f53178e62	Made a coupe of tensor tests compile without requiring c++11 support.	2016-04-29 16:09:54 -07:00
Benoit Steiner	1131a984a6	Made the cxx11_tensor_forced_eval compile without c++11.	2016-04-29 15:48:59 -07:00
Benoit Steiner	c07404f6a1	Restore Tensor support for non c++11 compilers	2016-04-29 15:19:19 -07:00
Benoit Steiner	a524a26fdc	Fixed a few memory leaks	2016-04-28 18:55:53 -07:00
Gael Guennebaud	3dddd34133	Refactor the unsupported CXX11/Core module to internal headers only.	2016-04-26 11:20:25 +02:00
Benoit Steiner	4a164d2c46	Fixed the partial evaluation of non vectorizable tensor subexpressions	2016-04-25 10:43:03 -07:00
Benoit Steiner	f670613e4b	Fixed several compilation warnings	2016-04-21 11:03:02 -07:00
Benoit Steiner	32ffce04fc	Use EIGEN_THREAD_YIELD instead of std::this_thread::yield to make the code more portable.	2016-04-21 08:47:28 -07:00
Benoit Steiner	a792cd357d	Added more tests	2016-04-20 17:33:58 -07:00
Benoit Steiner	04f954956d	Fixed a few typos	2016-04-19 15:27:09 -07:00
Benoit Steiner	f953c60705	Fixed 2 recent regression tests	2016-04-19 12:57:39 -07:00
Benoit Steiner	84543c8be2	Worked around the lack of a rand_r function on windows systems	2016-04-17 19:29:27 -07:00
Benoit Steiner	5fbcfe5eb4	Worked around the lack of a rand_r function on windows systems	2016-04-17 18:42:31 -07:00
Benoit Steiner	40c9923a8a	Fixed compilation errors with msvc	2016-04-15 11:27:52 -07:00
Benoit Steiner	bebb89acfa	Enabled the new threadpool tests	2016-04-14 16:44:10 -07:00
Benoit Steiner	a8e8837ba7	Added tests for the non blocking thread pool	2016-04-14 15:23:49 -07:00
Benoit Steiner	2b6e3de02f	Added tests to validate flooring and ceiling of fp16	2016-04-14 11:39:18 -07:00
Benoit Steiner	6f23e945f6	Added simple test for numext::sqrt and numext::pow on fp16	2016-04-14 10:32:52 -07:00
Benoit Steiner	72510c80e1	Added basic test for trigonometric functions on fp16	2016-04-14 10:27:24 -07:00
Benoit Steiner	f6003f0873	Made the test msvc friendly	2016-04-14 09:47:26 -07:00
Gael Guennebaud	7d1391d049	Turn a converge check to a warning	2016-04-13 22:50:54 +02:00
Benoit Steiner	e9b12cc1f7	Fixed compilation warnings generated by clang	2016-04-12 20:53:18 -07:00
Benoit Steiner	e3a184785c	Fixed the zeta test	2016-04-12 11:12:36 -07:00
Gael Guennebaud	af2161cdb4	bug #1197 : fix/relax some LM unit tests	2016-04-09 11:14:02 +02:00
Gael Guennebaud	a05a683d83	bug #1160 : fix and relax some lm unit tests by turning faillures to warnings	2016-04-09 10:49:19 +02:00
Benoit Steiner	995f202cea	Disabled the use of half2 on cuda devices of compute capability < 5.3	2016-04-08 14:43:36 -07:00
Benoit Steiner	0d2a532fc3	Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using clang instead of nvcc	2016-04-08 13:16:08 -07:00
Benoit Steiner	2d072b38c1	Don't test the division by 0 on float16 when compiling with msvc since msvc detects and errors out on divisions by 0.	2016-04-08 12:50:25 -07:00
Benoit Steiner	d962fe6a99	Renamed float16 into cxx11_float16 since the test relies on c++11 features	2016-04-07 20:28:32 -07:00
Benoit Steiner	a02ec09511	Worked around numerical noise in the test for the zeta function.	2016-04-07 12:11:02 -07:00
Benoit Steiner	c912b1d28c	Fixed a typo in the polygamma test.	2016-04-07 11:51:07 -07:00
Benoit Steiner	dc45aaeb93	Added tests for float16	2016-04-07 11:18:05 -07:00
Benoit Steiner	8db269e055	Fixed a typo in a test	2016-04-07 10:41:51 -07:00
Benoit Steiner	48308ed801	Added support for isinf, isnan, and isfinite checks to the tensor api	2016-04-07 09:48:36 -07:00
Benoit Steiner	165150e896	Fixed the tests for the zeta and polygamma functions	2016-04-06 14:31:01 -07:00
Benoit Steiner	10bdd8e378	Merged in tillahoffmann/eigen (pull request PR-173) Added zeta function of two arguments and polygamma function	2016-04-06 09:40:17 -07:00
Benoit Steiner	7781f865cb	Renamed the EIGEN_TEST_NVCC cmake option into EIGEN_TEST_CUDA per the discussion in bug #1173 .	2016-04-06 09:35:23 -07:00
tillahoffmann	726bd5f077	Merged eigen/eigen into default	2016-04-05 18:21:05 +01:00
Gael Guennebaud	4d7e230d2f	bug #1189 : fix pow/atan2 compilation for AutoDiffScalar	2016-04-05 14:49:41 +02:00
Till Hoffmann	eb0ae602bd	Added CUDA tests.	2016-04-01 18:17:45 +01:00
Benoit Steiner	1b40abbf99	Added missing assignment operator to the TensorUInt128 class, and made misc small improvements	2016-03-30 13:17:03 -07:00
Benoit Steiner	7b7d2a9fa5	Use false instead of 0 as the expected value of a boolean	2016-03-29 11:50:17 -07:00
Benoit Steiner	a86c9f037b	Fixed compilation error on windows	2016-03-24 18:54:31 -07:00
Benoit Steiner	044efea965	Made sure that the cxx11_tensor_cuda test can be compiled even without support for cxx11.	2016-03-23 20:02:11 -07:00
Benoit Steiner	9bc9396e88	Use portable includes	2016-03-23 16:30:06 -07:00
Benoit Steiner	7a570e50ef	Fixed contractions of fp16	2016-03-23 16:00:06 -07:00
Benoit Steiner	2062ee2d26	Added a test to verify that notifications are working properly	2016-03-23 13:39:00 -07:00
Christoph Hertzberg	9642fd7a93	Replace all M_PI by EIGEN_PI and add a check to the testsuite.	2016-03-23 15:37:45 +01:00
Benoit Steiner	28e02996df	Merged patch 672 from Justin Lebar: Don't use long doubles with cuda	2016-03-22 16:53:57 -07:00
Benoit Steiner	e7a468c5b7	Filter some compilation flags that nvcc warns about.	2016-03-22 14:26:50 -07:00
Benoit Steiner	bb0e73c191	Gate all the CUDA tests under the EIGEN_TEST_NVCC option	2016-03-18 12:17:37 -07:00
Benoit Steiner	2db4a04827	Fixed a typo	2016-03-18 12:08:01 -07:00
Benoit Steiner	dd514de8a9	Added a test to validate the fallback path for half floats	2016-03-18 12:02:39 -07:00
Benoit Steiner	53d498ef06	Fixed compilation warnings in the cuda tests	2016-03-18 07:04:54 -07:00
Benoit Steiner	f7329619da	Fix bug in tensor contraction. The code assumes that contraction axis indices for the LHS (after possibly swapping to ColMajor!) is increasing. Explicitly sort the contraction axis pairs to make it so.	2016-03-17 15:08:02 -07:00
Christoph Hertzberg	46aa9772fc	Merged in ebrevdo/eigen (pull request PR-169) Bugfixes to cuda tests, igamma & igammac implemented, & tests for digamma, igamma, igammac on CPU & GPU.	2016-03-16 21:59:08 +01:00
Benoit Steiner	ab9b749b45	Improved a test	2016-03-14 20:03:13 -07:00
Benoit Steiner	048c4d6efd	Made half floats usable on hardware that doesn't support them natively.	2016-03-11 17:21:42 -08:00
Benoit Steiner	c5b98a58b8	Updated the cxx11_meta test to work on the Eigen::array class when std::array isn't available.	2016-03-11 11:53:38 -08:00
Eugene Brevdo	5e7de771e3	Properly fix merge issues.	2016-03-08 17:35:05 -08:00
Eugene Brevdo	73220d2bb0	Resolve bad merge.	2016-03-08 17:28:21 -08:00
Eugene Brevdo	0bb5de05a1	Finishing touches on igamma/igammac for GPU. Tests now pass.	2016-03-07 15:35:09 -08:00
Eugene Brevdo	5707004d6b	Fix Eigen's building of sharded tests that use CUDA & more igamma/igammac bugfixes. 0. Prior to this PR, not a single sharded CUDA test was actually being run. Fixed that. GPU tests are still failing for igamma/igammac. 1. Add calls for igamma/igammac to TensorBase 2. Fix up CUDA-specific calls of igamma/igammac 3. Add unit tests for digamma, igamma, igammac in CUDA.	2016-03-07 14:08:56 -08:00
Benoit Steiner	e5f25622e2	Added a test to validate the behavior of some of the tensor syntactic sugar.	2016-03-07 09:04:27 -08:00
Benoit Steiner	9f5740cbc1	Added missing include	2016-03-06 22:03:18 -08:00
Benoit Steiner	5238e03fe1	Don't try to compile the uint128 test with compilers that don't support uint127	2016-03-06 21:59:40 -08:00
Benoit Steiner	6093eb9ff5	Don't test our 128bit emulation code when compiling with msvc	2016-03-05 10:37:11 -08:00
Benoit Steiner	57b263c5b9	Avoid using initializer lists in test since not all version of msvc support them	2016-03-05 08:35:26 -08:00
Benoit Steiner	c23e0be18f	Use the CMAKE_CXX_STANDARD variable to turn on cxx11	2016-03-04 20:18:01 -08:00
Benoit Steiner	4e49fd5eb9	MSVC uses __uint128 while other compilers use __uint128_t to encode 128bit unsigned integers. Make the cxx11_tensor_uint128.cpp test work in both cases.	2016-03-04 14:49:18 -08:00
Benoit Steiner	667fcc2b53	Fixed syntax error	2016-03-04 14:37:51 -08:00
Benoit Steiner	4416a5dcff	Added missing include	2016-03-04 14:35:43 -08:00
Benoit Steiner	174edf976b	Made the contraction test more portable	2016-03-04 14:11:13 -08:00
Benoit Steiner	deea866bbd	Added tests to cover the new rounding, flooring and ceiling tensor operations.	2016-03-03 12:38:02 -08:00
Benoit Steiner	dac58d7c35	Added a test to validate the conversion of half floats into floats on Kepler GPUs. Restricted the testing of the random number generation code to GPU architecture greater than or equal to 3.5.	2016-03-03 10:37:25 -08:00
Benoit Steiner	af199b4658	Made the CUDA architecture level a build setting.	2016-02-25 09:06:18 -08:00
Benoit Steiner	e644f60907	Pulled latest updates from trunk	2016-02-21 20:24:59 +00:00
Benoit Steiner	95fceb6452	Added the ability to compute the absolute value of a half float	2016-02-21 20:24:11 +00:00
Benoit Steiner	ed69cbeef0	Added some debugging information to the test to figure out why it fails sometimes	2016-02-21 11:20:20 -08:00
Benoit Steiner	1e6fe6f046	Fixed the float16 tensor test.	2016-02-20 07:44:17 +00:00
Benoit Steiner	180156ba1a	Added support for tensor reductions on half floats	2016-02-19 10:05:59 -08:00
Benoit Steiner	a08d2ff0c9	Started to work on contractions and reductions using half floats	2016-02-19 15:59:59 +00:00
Benoit Steiner	ac5d706a94	Added support for simple coefficient wise tensor expression using half floats on CUDA devices	2016-02-19 08:19:12 +00:00
Benoit Steiner	0606a0a39b	FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA	2016-02-18 23:15:23 -08:00
Benoit Steiner	f36c0c2c65	Added regression test for float16	2016-02-19 06:23:28 +00:00
Benoit Steiner	7151bd8768	Reverted unintended changes introduced by a bad merge	2016-02-19 06:20:50 +00:00
Benoit Steiner	17b9fbed34	Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa	2016-02-19 06:16:07 +00:00
Benoit Steiner	9e3f3a2d27	Deleted outdated comment	2016-02-11 17:27:35 -08:00
Benoit Steiner	de345eff2e	Added a method to conjugate the content of a tensor or the result of a tensor expression.	2016-02-11 16:34:07 -08:00
Benoit Steiner	6323851ea9	Fixed compilation warning	2016-02-09 20:43:41 -08:00
Benoit Steiner	4d4211c04e	Avoid unecessary type conversions	2016-02-05 18:19:41 -08:00
Benoit Steiner	d2cba52015	Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type	2016-02-05 18:14:23 -08:00
Benoit Steiner	fb00a4af2b	Made the tensor fft test compile on tegra x1	2016-02-06 01:42:14 +00:00
Benoit Steiner	5d82e47ef6	Properly disable nvcc warning messages in user code.	2016-02-03 14:10:06 -08:00
Benoit Steiner	af8436b196	Silenced the "calling a __host__ function from a __host__ __device__ function is not allowed" messages	2016-02-03 13:48:36 -08:00
Ville Kallioniemi	aedea349aa	Replace separate low word constructors with a single templated constructor.	2016-02-01 20:25:02 -07:00
Benoit Steiner	64ce78c2ec	Cleaned up a tensor contraction test	2016-02-01 13:57:41 -08:00
Benoit Steiner	0ce5d32be5	Sharded the cxx11_tensor_contract_cuda test	2016-02-01 13:33:23 -08:00
Benoit Steiner	922b5f527b	Silenced a few compilation warnings	2016-02-01 13:30:49 -08:00
Benoit Steiner	264f8141f8	Shared the tensor reduction test	2016-02-01 07:44:31 -08:00
Benoit Steiner	11bb71c8fc	Sharded the tensor device test	2016-02-01 07:34:59 -08:00
Benoit Steiner	6720b38fbf	Fixed a few compilation warnings	2016-01-31 16:48:50 -08:00
Benoit Steiner	4a2ddfb81d	Sharded the CUDA argmax tensor test	2016-01-31 10:44:15 -08:00
Benoit Steiner	483082ef6e	Fixed a few memory leaks in the cuda tests	2016-01-30 11:59:22 -08:00
Benoit Steiner	bd21aba181	Sharded the cxx11_tensor_cuda test and fixed a memory leak	2016-01-30 11:47:09 -08:00
Benoit Steiner	9de155d153	Added a test to cover threaded tensor shuffling	2016-01-30 10:56:47 -08:00
Benoit Steiner	32088c06a1	Made the comparison between single and multithreaded contraction results more resistant to numerical noise to prevent spurious test failures.	2016-01-30 10:51:14 -08:00
Benoit Steiner	2053478c56	Made sure to use a tensor of rank 0 to store the result of a full reduction in the tensor thread pool test	2016-01-30 10:46:36 -08:00
Benoit Steiner	d0db95f730	Sharded the tensor thread pool test	2016-01-30 10:43:57 -08:00
Benoit Steiner	ba27c8a7de	Made the CUDA contract test more robust to numerical noise.	2016-01-30 10:28:43 -08:00
Benoit Steiner	7b3044d086	Made sure to call nvcc with the relaxed-constexpr flag.	2016-01-28 15:36:34 -08:00
Gael Guennebaud	7802a6bb1c	Fix unit test filename.	2016-01-28 09:35:37 +01:00
Benoit Steiner	4bf9eaf77a	Deleted an invalid assertion that prevented the assignment of empty tensors.	2016-01-27 17:09:30 -08:00
Benoit Steiner	47ca9dc809	Fixed the tensor_cuda test	2016-01-27 14:58:48 -08:00
Benoit Steiner	55a5204319	Fixed the flags passed to nvcc to compile the tensor code.	2016-01-27 14:46:34 -08:00
Benoit Steiner	9dfbd4fe8d	Made the cuda tests compile using make check	2016-01-27 12:22:17 -08:00
Benoit Steiner	5973bcf939	Properly specify the namespace when calling cout/endl	2016-01-27 12:04:42 -08:00
Gael Guennebaud	6a44ccb58b	Backout changeset `690bc950f7`	2016-01-22 15:03:53 +01:00

1 2 3 4 5 ...

842 Commits