eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-01-24 14:45:14 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	32d95e86c9	merge	2016-07-22 16:43:12 +02:00
Gael Guennebaud	d7a0e52478	Fix testing of log nearby 1	2016-07-22 15:44:26 +02:00
Gael Guennebaud	7acf23c14c	Truely split unit test.	2016-07-22 15:41:23 +02:00
Gael Guennebaud	d075d122ea	Move half unit test from unsupported to main tests	2016-07-22 14:34:19 +02:00
Gael Guennebaud	82798162c0	Extend unit testing of half with ADL and arrays.	2016-07-21 15:47:21 +02:00
Gael Guennebaud	c98bac2966	Manually add -stdd=c++11 to nvcc for old cmake versions	2016-07-12 09:29:18 +02:00
Benoit Steiner	40eb97516c	reverted unintended change.	2016-07-11 14:28:03 -07:00
Benoit Steiner	03b71c273e	Made the packetmath test compile again. A better fix would be to move the special function tests to the unsupported directory where the code now resides.	2016-07-11 13:50:24 -07:00
Gael Guennebaud	fd60966310	merge	2016-07-11 18:11:47 +02:00
Gael Guennebaud	7d636349dc	Fix configuration of CUDA: - preserve user defined CUDA_NVCC_FLAGS - remove the -ansi flag that conflicts with -std=c++11 - do not add -std=c++11 if already there	2016-07-11 18:09:04 +02:00
Gael Guennebaud	131ee4bb8e	Split test_slice_in_expr which seems to be huge for visual	2016-07-11 11:46:55 +02:00
Gael Guennebaud	544935101a	Fix warnings	2016-07-08 11:38:52 +02:00
Gael Guennebaud	59bf2774a3	Fix warnings	2016-07-08 11:38:11 +02:00
Gael Guennebaud	2f7e2614e7	bug #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.	2016-07-08 11:13:55 +02:00
Gael Guennebaud	8b7431d8fd	fix compilation with c++11	2016-07-07 15:18:23 +02:00
Gael Guennebaud	69378eed0b	Split huge unit test	2016-07-07 15:18:04 +02:00
Gael Guennebaud	5d2dada197	Fix warnings	2016-07-07 09:05:15 +02:00
Gael Guennebaud	f5e780fb05	split huge unit test	2016-07-07 08:59:59 +02:00
Igor Babuschkin	85699850d9	Add missing CUDA kernel to tensor scan op The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.	2016-06-29 11:54:35 +01:00
Benoit Steiner	1a9f92e781	Added a test to validate the tensor scan evaluation on GPU. The test is currently disabled since the code segfaults.	2016-06-27 16:02:52 -07:00
Igor Babuschkin	0425118e2a	Merge upstream changes	2016-08-05 14:34:57 +01:00
Igor Babuschkin	eeb0d880ee	Enable efficient Tensor reduction for doubles	2016-07-01 19:08:26 +01:00
Gael Guennebaud	3852351793	merge pull request 198	2016-06-24 11:48:17 +02:00
Rasmus Munk Larsen	a9c1e4d7b7	Return -1 from CurrentThreadId when called by thread outside the pool.	2016-06-23 16:40:07 -07:00
Rasmus Munk Larsen	d39df320d2	Resolve merge.	2016-06-23 15:08:03 -07:00
Gael Guennebaud	361dbd246d	Add unit test for printing empty tensors	2016-06-23 18:54:30 +02:00
Benoit Steiner	de32f8d656	Fixed the printing of rank-0 tensors	2016-06-20 10:46:45 -07:00
Geoffrey Lalonde	72c95383e0	Add autodiff coverage for standard library hyperbolic functions, and tests. * * * Corrected tanh derivatived, moved test definitions. * * * Added more test cases, removed lingering lines	2016-06-15 23:33:19 -07:00
Igor Babuschkin	c4d10e921f	Implement exclusive scan option	2016-06-14 19:44:07 +01:00
Rasmus Munk Larsen	f1f2ff8208	size_t -> int	2016-06-03 18:06:37 -07:00
Rasmus Munk Larsen	76308e7fd2	Add CurrentThreadId and NumThreads methods to Eigen threadpools and TensorDeviceThreadPool.	2016-06-03 16:28:58 -07:00
Eugene Brevdo	39baff850c	Add TernaryFunctors and the betainc SpecialFunction. TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.	2016-06-02 17:04:19 -07:00
Benoit Steiner	02db4e1a82	Disable the tensor tests when using msvc since older versions of the compiler fail to handle this code	2016-06-04 08:21:17 -07:00
Rasmus Munk Larsen	811aadbe00	Add syntactic sugar to Eigen tensors to allow more natural syntax. Specifically, this enables expressions involving: scalar + tensor scalar * tensor scalar / tensor scalar - tensor	2016-06-02 12:41:28 -07:00
Tal Hadad	52e4cbf539	Merged eigen/eigen into default	2016-06-02 22:15:20 +03:00
Tal Hadad	2aaaf22623	Fix Gael reports (except documention) - "Scalar angle(int) const" should be "const Vector& angles() const" - then method "coeffs" could be removed. - avoid one letter names like h, p, r -> use alpha(), beta(), gamma() ;) - about the "fromRotation" methods: - replace the ones which are not static by operator= (as in Quaternion) - the others are actually static methods: use a capital F: FromRotation - method "invert" should be removed. - use a macro to define both float and double EulerAnglesXYZ* typedefs - AddConstIf -> not used - no needs for NegateIfXor, compilers are extremely good at optimizing away branches based on compile time constants: if(IsHeadingOpposite-=IsEven) res.alpha() = -res.alpha();	2016-06-02 22:12:57 +03:00
Igor Babuschkin	fbd7ed6ff7	Add tensor scan op This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase.	2016-06-02 13:35:47 +01:00
Benoit Steiner	c3cada38e2	Speedup a test	2016-06-01 21:13:00 -07:00
Benoit Steiner	abc815798b	Added a new operation to enable more powerful tensorindexing.	2016-05-27 12:22:25 -07:00
Benoit Steiner	5707537592	Fixed option '--relaxed-constexpr' has been deprecated and replaced by option '--expt-relaxed-constexpr' warning generated by nvcc 7.5	2016-05-27 10:47:53 -07:00
Benoit Steiner	36369ab63c	Resolved merge conflicts	2016-05-26 13:39:39 -07:00
Benoit Steiner	28fcb5ca2a	Merged latest reduction improvements	2016-05-26 12:19:33 -07:00
Benoit Steiner	c1c7f06c35	Improved the performance of inner reductions.	2016-05-26 11:53:59 -07:00
Benoit Steiner	22d02c9855	Improved the coverage of the fp16 reduction tests	2016-05-26 11:12:16 -07:00
Benoit Steiner	58026905ae	Added support for statically known lists of pairs of indices	2016-05-25 11:04:14 -07:00
Christoph Hertzberg	718521d5cf	Silenced several double-promotion warnings	2016-05-22 18:17:04 +02:00
Christoph Hertzberg	b5a7603822	fixed macro name	2016-05-22 16:49:29 +02:00
Gael Guennebaud	ccaace03c9	Make EIGEN_HAS_CONSTEXPR user configurable	2016-05-20 15:10:08 +02:00
Gael Guennebaud	c3410804cd	Make EIGEN_HAS_VARIADIC_TEMPLATES user configurable	2016-05-20 15:05:38 +02:00
Benoit Steiner	a910bcee43	Merged latest updates from trunk	2016-05-17 09:14:22 -07:00
Benoit Steiner	8d06c02ffd	Allow vectorized padding on GPU. This helps speed things up a little. Before: BM_padding/10 5000000 460 217.03 MFlops/s BM_padding/80 5000000 460 13899.40 MFlops/s BM_padding/640 5000000 461 888421.17 MFlops/s BM_padding/4K 5000000 460 54316322.55 MFlops/s After: BM_padding/10 5000000 454 220.20 MFlops/s BM_padding/80 5000000 455 14039.86 MFlops/s BM_padding/640 5000000 452 904968.83 MFlops/s BM_padding/4K 5000000 411 60750049.21 MFlops/s	2016-05-17 09:13:27 -07:00
Benoit Steiner	92fc6add43	Don't rely on c++11 extension when we don't have to.	2016-05-17 07:21:22 -07:00
Gael Guennebaud	1fbfab27a9	bug #1223 : fix compilation of AutoDiffScalar's min/max operators, and add regression unit test.	2016-05-18 16:26:26 +02:00
Gael Guennebaud	448d9d943c	bug #1222 : fix compilation in AutoDiffScalar and add respective unit test	2016-05-18 16:00:11 +02:00
Benoit Steiner	2a54b70d45	Fixed potential race condition in the non blocking thread pool	2016-05-12 11:45:48 -07:00
Benoit Steiner	fae0493f98	Fixed a couple of bugs related to the Pascalfamily of GPUs H: Enter commit message. Lines beginning with 'HG:' are removed.	2016-05-11 23:02:26 -07:00
Benoit Steiner	886445ce4d	Avoid unnecessary conversions between floats and doubles	2016-05-11 23:00:03 -07:00
Benoit Steiner	595e890391	Added more tests for half floats	2016-05-11 21:27:15 -07:00
Christoph Hertzberg	2150f13d65	fixed some double-promotion and sign-compare warnings	2016-05-11 23:02:26 +02:00
Benoit Steiner	217d984abc	Fixed a typo in my previous commit	2016-05-11 10:22:15 -07:00
Benoit Steiner	cbb14ed47e	Added a few tests to validate the generation of random tensors on GPU.	2016-05-11 10:05:56 -07:00
Benoit Steiner	6bf8273bc0	Added a test to validate the new non blocking thread pool	2016-05-10 10:49:34 -07:00
Benoit Steiner	75bd2bd32d	Fixed compilation warning	2016-05-09 19:24:41 -07:00
Benoit Steiner	dc7dbc2df7	Optimized the non blocking thread pool: * Use a pseudo-random permutation of queue indices during random stealing. This ensures that all the queues are considered. * Directly pop from a non-empty queue when we are waiting for work, instead of first noticing that there is a non-empty queue and then doing another round of random stealing to re-discover the non-empty queue. * Steal only 1 task from a remote queue instead of half of tasks.	2016-05-09 10:17:17 -07:00
Benoit Steiner	691614bd2c	Worked around a bug in nvcc on tegra x1	2016-05-07 13:28:53 -07:00
Benoit Steiner	69a8a4e1f3	Added a test to validate full reduction on tensor of half floats	2016-05-05 16:52:50 -07:00
Benoit Steiner	678a17ba79	Made the testing of contractions on fp16 more robust	2016-05-05 16:36:39 -07:00
Benoit Steiner	e3d053e14e	Refined the testing of log and exp on fp16	2016-05-05 16:24:15 -07:00
Benoit Steiner	9a48688d37	Further improved the testing of fp16	2016-05-05 15:58:05 -07:00
Benoit Steiner	2aba40d208	Avoid unecessary type promotion	2016-05-05 09:26:57 -07:00
Benoit Steiner	7875437ca0	Avoided unecessary type promotion	2016-05-05 09:08:42 -07:00
Benoit Steiner	f363e533aa	Added tests for full contractions using thread pools and gpu devices. Fixed a couple of issues in the corresponding code.	2016-05-05 09:05:45 -07:00
Benoit Steiner	06d774bf58	Updated the contraction code to ensure that full contraction return a tensor of rank 0	2016-05-05 08:37:47 -07:00
Christoph Hertzberg	b300a84989	Fixed some singed/unsigned comparison warnings	2016-05-05 13:36:28 +02:00
Christoph Hertzberg	dacb469bc9	Enable and fix -Wdouble-conversion warnings	2016-05-05 13:35:45 +02:00
Benoit Steiner	62b710072e	Reduced the memory footprint of the cxx11_tensor_image_patch test	2016-05-04 21:08:22 -07:00
Benoit Steiner	2c5568a757	Added a test to validate the computation of exp and log on 16bit floats	2016-05-03 12:06:07 -07:00
Benoit Steiner	2b890ae618	Fixed compilation errors generated by clang	2016-04-29 18:30:40 -07:00
Benoit Steiner	d217217842	Added a few tests to ensure that the dimensions of rank 0 tensors are correctly computed	2016-04-29 18:15:34 -07:00
Benoit Steiner	d14105f158	Made several tensor tests compatible with cxx03	2016-04-29 17:22:37 -07:00
Benoit Steiner	c0882ef4d9	Moved a number of tensor tests that don't require cxx11 to work properly outside the EIGEN_TEST_CXX11 test section	2016-04-29 17:13:51 -07:00
Benoit Steiner	9d1dbd1ec0	Fixed teh cxx11_tensor_empty test to compile without requiring cxx11 support	2016-04-29 16:53:55 -07:00
Benoit Steiner	4f53178e62	Made a coupe of tensor tests compile without requiring c++11 support.	2016-04-29 16:09:54 -07:00
Benoit Steiner	1131a984a6	Made the cxx11_tensor_forced_eval compile without c++11.	2016-04-29 15:48:59 -07:00
Benoit Steiner	c07404f6a1	Restore Tensor support for non c++11 compilers	2016-04-29 15:19:19 -07:00
Benoit Steiner	a524a26fdc	Fixed a few memory leaks	2016-04-28 18:55:53 -07:00
Gael Guennebaud	3dddd34133	Refactor the unsupported CXX11/Core module to internal headers only.	2016-04-26 11:20:25 +02:00
Benoit Steiner	4a164d2c46	Fixed the partial evaluation of non vectorizable tensor subexpressions	2016-04-25 10:43:03 -07:00
Benoit Steiner	f670613e4b	Fixed several compilation warnings	2016-04-21 11:03:02 -07:00
Benoit Steiner	32ffce04fc	Use EIGEN_THREAD_YIELD instead of std::this_thread::yield to make the code more portable.	2016-04-21 08:47:28 -07:00
Benoit Steiner	a792cd357d	Added more tests	2016-04-20 17:33:58 -07:00
Benoit Steiner	04f954956d	Fixed a few typos	2016-04-19 15:27:09 -07:00
Benoit Steiner	f953c60705	Fixed 2 recent regression tests	2016-04-19 12:57:39 -07:00
Benoit Steiner	84543c8be2	Worked around the lack of a rand_r function on windows systems	2016-04-17 19:29:27 -07:00
Benoit Steiner	5fbcfe5eb4	Worked around the lack of a rand_r function on windows systems	2016-04-17 18:42:31 -07:00
Benoit Steiner	40c9923a8a	Fixed compilation errors with msvc	2016-04-15 11:27:52 -07:00
Benoit Steiner	bebb89acfa	Enabled the new threadpool tests	2016-04-14 16:44:10 -07:00
Benoit Steiner	a8e8837ba7	Added tests for the non blocking thread pool	2016-04-14 15:23:49 -07:00
Benoit Steiner	2b6e3de02f	Added tests to validate flooring and ceiling of fp16	2016-04-14 11:39:18 -07:00
Benoit Steiner	6f23e945f6	Added simple test for numext::sqrt and numext::pow on fp16	2016-04-14 10:32:52 -07:00

1 2 3 4 5 ...

842 Commits