eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Benoit Steiner	4df8b5a75e	Avoid making an unecessary copy of the tensor expression when evaluating it on a GPU device	2015-03-25 14:36:07 -07:00
Benoit Steiner	b3343bfdae	Fixed the vectorized implementation of the Tensor select() method	2015-03-25 13:25:53 -07:00
Benoit Steiner	ccf290a65c	Cleaned up the TensorDevice code a little bit.	2015-03-25 12:37:38 -07:00
Benoit Steiner	d3f7915aeb	Pulled latest update from the eigen main codebase	2015-03-24 13:12:14 -07:00
Benoit Steiner	abdbe8562e	Fixed the CUDA packet primitives	2015-03-24 10:45:46 -07:00
Gael Guennebaud	29eaa2b0f1	Make MatrixBase::is* methods aware of nested_eval.	2015-03-24 13:42:42 +01:00
Gael Guennebaud	f42b105f73	Add the possibility to make VERIFY* checks to output a warning instead of abording.	2015-03-24 13:39:14 +01:00
Gael Guennebaud	d27968eb7e	D&C SVD: directly falls back to JacobiSVD for very small problems (by-pass upper-bidiagonalization)	2015-03-24 13:38:07 +01:00
Gael Guennebaud	4472f3e578	Avoid SVD: consider denormalized small numbers as zero when computing the rank of the matrix	2015-03-23 09:40:21 +01:00
Gael Guennebaud	4e2b18d909	Update approx. minimum ordering method to push and keep structural empty diagonal elements to the bottom-right part of the matrix	2015-03-20 16:33:48 +01:00
Gael Guennebaud	8d9bfb3a7b	fix loadMarket wrt Index versus int	2015-03-20 16:00:10 +01:00
Benoit Steiner	a6a628ca6b	Added the -= operator to the device classes	2015-03-19 23:22:19 -07:00
Benoit Steiner	e134226a03	Fixed a bug in the handling of packets by the MeanReducer	2015-03-19 23:11:42 -07:00
Gael Guennebaud	9ee62fdcd5	Fix random unit test for 32bits systems.	2015-03-19 21:39:37 +01:00
Gael Guennebaud	d6b2f300db	Fix MSVC compilation: aligned type must be passed by reference	2015-03-19 17:28:32 +01:00
Gael Guennebaud	61c45d7cfd	Fix comparison warning	2015-03-19 17:13:22 +01:00
Gael Guennebaud	d7698c18b7	Split sparse_basic unit test	2015-03-19 15:11:05 +01:00
Gael Guennebaud	f329d0908a	Improve random number generation for integer and add unit test	2015-03-19 15:10:36 +01:00
Deanna Hood	2ab4922431	Make html directory before generating output image there	2015-03-18 07:24:13 +10:00
Benoit Steiner	cc0f89eb3b	Changed the way lvalue operations are declared in TensorBase: this fixes constness isses that prevented some expressions mixing lvalues and rvalues from compiling.	2015-03-17 09:57:20 -07:00
Benoit Jacob	dc04f12967	use unsigned short instead of uint16_t which doesn't exist in c++98	2015-03-17 10:31:45 -04:00
Benoit Jacob	364cfd529d	Similar to cset `3589a9c115` , also in 2px4 kernel: actual_panel_rows computation should always be resilient to parameters not consistent with the known L1 cache size, see comment	2015-03-16 16:28:44 -04:00
Benoit Steiner	25664afacd	Pulled latest updates from trunk	2015-03-16 13:25:45 -07:00
Benoit Jacob	577056aa94	Include stdint.h. Not going for cstdint because it is a C++11 addition. Needed for uint16_t at least, in lookup-table code.	2015-03-16 16:21:50 -04:00
Benoit Steiner	5144f66728	Fixed compilation warning	2015-03-16 13:17:52 -07:00
Benoit Steiner	0fd6d52724	Fixed compilation error with clang	2015-03-16 13:16:12 -07:00
Benoit Jacob	eb6929cb19	fix bug in maxsize calculation, which would cause products of size > 2048 to address the lookup table out of bounds	2015-03-16 16:15:47 -04:00
Benoit Steiner	f218c0181d	Fixes the Lvalue computation by actually setting the LvalueBit properly when instantiating tensors of const T. Added a test to check the fix.	2015-03-16 13:05:00 -07:00
Benoit Jacob	35c3a8bb84	Update Nexus 5 lookup table from combining now 2 runs of the benchmark, using the analyze-blocking-sizes partition tool. Gives better worst-case performance.	2015-03-16 11:05:51 -04:00
Benoit Jacob	e274607d7f	fix compilation with GCC 4.8	2015-03-16 10:48:27 -04:00
Benoit Jacob	151b8b95c6	Fix bug in case where EIGEN_TEST_SPECIFIC_BLOCKING_SIZE is defined but false	2015-03-15 19:10:51 -04:00
Benoit Jacob	02babb9c0f	Provide a empirical lookup table for blocking sizes measured on a Nexus 5. Only for float, only for Android on ARM 32bit for now.	2015-03-15 18:13:12 -04:00
Benoit Jacob	3589a9c115	actual_panel_rows computation should always be resilient to parameters not consistent with the known L1 cache size, see comment	2015-03-15 18:12:18 -04:00
Benoit Jacob	1dd3d89818	Fix a unused-var warning	2015-03-15 18:07:19 -04:00
Benoit Jacob	ca5c12587b	Polish lookup tables generation	2015-03-15 18:05:53 -04:00
Benoit Jacob	e56aabf205	Refactor computeProductBlockingSizes to make room for the possibility of using lookup tables	2015-03-15 18:05:12 -04:00
Benoit Jacob	b6b88c0808	update perf_monitoring/gemm/changesets.txt	2015-03-13 14:57:05 -07:00
Benoit Jacob	488c15615a	organize a little our default cache sizes, and use a saner default L1 outside of x86 (10% faster on Nexus 5)	2015-03-13 14:51:26 -07:00
Gael Guennebaud	9f58524cbd	merge	2015-03-13 21:16:39 +01:00
Gael Guennebaud	1330f8bbd1	bug #973 , improve AVX support by enabling vectorization of Vector4i-like types, and enforcing alignement of Vector4f/Vector2d-like types to preserve compatibility with SSE and future Eigen versions that will vectorize them with AVX enabled.	2015-03-13 21:15:50 +01:00
Gael Guennebaud	d99ab35f9e	Fix internal::random(x,y) for integer types. The previous implementation could return y+1. The new implementation uses rejection sampling to get an unbiased behabior.	2015-03-13 21:12:46 +01:00
Gael Guennebaud	8580eb6808	bug #949 : add static assertion for incompatible scalar types in dense end-user decompositions.	2015-03-13 21:06:20 +01:00
Gael Guennebaud	a9df28c95b	SparseMatrix::insert: switch to a fully uncompressed mode if sequential insertion is not possible (otherwise an arbitrary large amount of memory was preallocated in some cases)	2015-03-13 21:00:21 +01:00
Gael Guennebaud	5ffe29cb9f	Bound pre-allocation to the maximal size representable by StorageIndex and throw bad_alloc if that's not possible.	2015-03-13 20:57:33 +01:00
Benoit Jacob	d73ccd717e	Add support for dumping blocking sizes tables	2015-03-13 10:36:01 -07:00
Gael Guennebaud	2f6f8bf31c	Add missing coeff/coeffRef members to Block<sparse>, and extend unit tests.	2015-03-13 16:24:40 +01:00
Benoit Jacob	f2c3e2b10f	Add --only-cubic-sizes option to analyze-blocking-sizes tool	2015-03-12 13:16:33 -07:00
Gael Guennebaud	fd78874888	Fix compilation of iterative solvers with dense matrices	2015-03-09 21:31:03 +01:00
Gael Guennebaud	d4317a85e8	Add typedefs for return types of SparseMatrixBase::selfadjointView	2015-03-09 21:29:46 +01:00
Gael Guennebaud	9e885fb766	Add unit tests for CG and sparse-LLT for long int as storage-index	2015-03-09 14:33:15 +01:00

1 2 3 4 5 ...

6372 Commits