Gael Guennebaud
|
f329d0908a
|
Improve random number generation for integer and add unit test
|
2015-03-19 15:10:36 +01:00 |
|
Deanna Hood
|
2ab4922431
|
Make html directory before generating output image there
|
2015-03-18 07:24:13 +10:00 |
|
Deanna Hood
|
41b717de25
|
More extensive unit tests for recent array-wise functors
|
2015-03-18 03:11:03 +10:00 |
|
Benoit Steiner
|
cc0f89eb3b
|
Changed the way lvalue operations are declared in TensorBase: this fixes constness isses that prevented some expressions mixing lvalues and rvalues from compiling.
|
2015-03-17 09:57:20 -07:00 |
|
Benoit Jacob
|
dc04f12967
|
use unsigned short instead of uint16_t which doesn't exist in c++98
|
2015-03-17 10:31:45 -04:00 |
|
Deanna Hood
|
8878e1c1de
|
Remove ambiguity with recent numext methods isNaN and isInf
|
2015-03-17 22:39:51 +10:00 |
|
Deanna Hood
|
596be3cd86
|
Use std::arg for real numbers when c++11 is used, custom implementation otherwise
|
2015-03-17 15:28:12 +10:00 |
|
Deanna Hood
|
e26134ed62
|
Use std::round when c++11 is used, custom implementation otherwise
|
2015-03-17 14:55:14 +10:00 |
|
Deanna Hood
|
e21e29a088
|
Update cost of arg call to depend on if the scalar is complex or not
|
2015-03-17 14:04:33 +10:00 |
|
Deanna Hood
|
447a5a6b01
|
Fix VML declarations to only be for real/complex as appropriate
|
2015-03-17 13:33:31 +10:00 |
|
Deanna Hood
|
f52b78491c
|
Remove packet isNaN, isInf, isFinite
|
2015-03-17 09:26:24 +10:00 |
|
Deanna Hood
|
1c78d6f2a6
|
Add boolean not operator (!) array support
|
2015-03-17 08:29:57 +10:00 |
|
Deanna Hood
|
85da0c2281
|
Remove test of now-missing floor, ceil, round complex implementations
|
2015-03-17 06:56:47 +10:00 |
|
Benoit Jacob
|
364cfd529d
|
Similar to cset 3589a9c115
, also in 2px4 kernel: actual_panel_rows computation should always be resilient to parameters not consistent with the known L1 cache size, see comment
|
2015-03-16 16:28:44 -04:00 |
|
Benoit Steiner
|
25664afacd
|
Pulled latest updates from trunk
|
2015-03-16 13:25:45 -07:00 |
|
Deanna Hood
|
e1d6e6c972
|
Make cube, inverse and abs2 free-functions
|
2015-03-17 06:25:24 +10:00 |
|
Benoit Jacob
|
577056aa94
|
Include stdint.h. Not going for cstdint because it is a C++11 addition. Needed for uint16_t at least, in lookup-table code.
|
2015-03-16 16:21:50 -04:00 |
|
Benoit Steiner
|
5144f66728
|
Fixed compilation warning
|
2015-03-16 13:17:52 -07:00 |
|
Benoit Steiner
|
0fd6d52724
|
Fixed compilation error with clang
|
2015-03-16 13:16:12 -07:00 |
|
Benoit Jacob
|
eb6929cb19
|
fix bug in maxsize calculation, which would cause products of size > 2048 to address the lookup table out of bounds
|
2015-03-16 16:15:47 -04:00 |
|
Benoit Steiner
|
f218c0181d
|
Fixes the Lvalue computation by actually setting the LvalueBit properly when instantiating tensors of const T. Added a test to check the fix.
|
2015-03-16 13:05:00 -07:00 |
|
Deanna Hood
|
fef4e071d7
|
Rename isinf to isInf
|
2015-03-17 05:58:47 +10:00 |
|
Deanna Hood
|
46cf9cda32
|
Add isfinite array support as isFinite
|
2015-03-17 04:33:12 +10:00 |
|
Deanna Hood
|
7b829940d1
|
Add code snippets for new methods
|
2015-03-17 03:40:28 +10:00 |
|
Deanna Hood
|
1d76ceab55
|
Remove floor, ceil, round for complex numbers
|
2015-03-17 02:36:07 +10:00 |
|
Deanna Hood
|
717b7954ce
|
Update cost of coeff-wise arg call
|
2015-03-17 02:11:57 +10:00 |
|
Deanna Hood
|
fb68b149cb
|
Rename isnan to isNaN
|
2015-03-17 02:04:42 +10:00 |
|
Benoit Jacob
|
35c3a8bb84
|
Update Nexus 5 lookup table from combining now 2 runs of the benchmark, using the analyze-blocking-sizes partition tool. Gives better worst-case performance.
|
2015-03-16 11:05:51 -04:00 |
|
Benoit Jacob
|
e274607d7f
|
fix compilation with GCC 4.8
|
2015-03-16 10:48:27 -04:00 |
|
Benoit Jacob
|
151b8b95c6
|
Fix bug in case where EIGEN_TEST_SPECIFIC_BLOCKING_SIZE is defined but false
|
2015-03-15 19:10:51 -04:00 |
|
Benoit Jacob
|
02babb9c0f
|
Provide a empirical lookup table for blocking sizes measured on a Nexus 5. Only for float, only for Android on ARM 32bit for now.
|
2015-03-15 18:13:12 -04:00 |
|
Benoit Jacob
|
3589a9c115
|
actual_panel_rows computation should always be resilient to parameters not consistent with the known L1 cache size, see comment
|
2015-03-15 18:12:18 -04:00 |
|
Benoit Jacob
|
1dd3d89818
|
Fix a unused-var warning
|
2015-03-15 18:07:19 -04:00 |
|
Benoit Jacob
|
ca5c12587b
|
Polish lookup tables generation
|
2015-03-15 18:05:53 -04:00 |
|
Benoit Jacob
|
e56aabf205
|
Refactor computeProductBlockingSizes to make room for the possibility of using lookup tables
|
2015-03-15 18:05:12 -04:00 |
|
Benoit Jacob
|
b6b88c0808
|
update perf_monitoring/gemm/changesets.txt
|
2015-03-13 14:57:05 -07:00 |
|
Benoit Jacob
|
488c15615a
|
organize a little our default cache sizes, and use a saner default L1 outside of x86 (10% faster on Nexus 5)
|
2015-03-13 14:51:26 -07:00 |
|
Gael Guennebaud
|
9f58524cbd
|
merge
|
2015-03-13 21:16:39 +01:00 |
|
Gael Guennebaud
|
1330f8bbd1
|
bug #973, improve AVX support by enabling vectorization of Vector4i-like types, and enforcing alignement of Vector4f/Vector2d-like types to preserve compatibility with SSE and future Eigen versions that will vectorize them with AVX enabled.
|
2015-03-13 21:15:50 +01:00 |
|
Gael Guennebaud
|
d99ab35f9e
|
Fix internal::random(x,y) for integer types. The previous implementation could return y+1. The new implementation uses rejection sampling to get an unbiased behabior.
|
2015-03-13 21:12:46 +01:00 |
|
Gael Guennebaud
|
8580eb6808
|
bug #949: add static assertion for incompatible scalar types in dense end-user decompositions.
|
2015-03-13 21:06:20 +01:00 |
|
Gael Guennebaud
|
a9df28c95b
|
SparseMatrix::insert: switch to a fully uncompressed mode if sequential insertion is not possible (otherwise an arbitrary large amount of memory was preallocated in some cases)
|
2015-03-13 21:00:21 +01:00 |
|
Gael Guennebaud
|
5ffe29cb9f
|
Bound pre-allocation to the maximal size representable by StorageIndex and throw bad_alloc if that's not possible.
|
2015-03-13 20:57:33 +01:00 |
|
Benoit Jacob
|
d73ccd717e
|
Add support for dumping blocking sizes tables
|
2015-03-13 10:36:01 -07:00 |
|
Gael Guennebaud
|
2f6f8bf31c
|
Add missing coeff/coeffRef members to Block<sparse>, and extend unit tests.
|
2015-03-13 16:24:40 +01:00 |
|
Benoit Jacob
|
f2c3e2b10f
|
Add --only-cubic-sizes option to analyze-blocking-sizes tool
|
2015-03-12 13:16:33 -07:00 |
|
Doug Kwan
|
657407227e
|
Fix bug in pdiv<Packet1cd> which swaps 32-bit halves of a pair of
doubles instead of swapping the doubles.
|
2015-03-11 15:13:37 -07:00 |
|
Deanna Hood
|
f89fcefa79
|
Add hyperbolic trigonometric functions from std array support
|
2015-03-11 13:13:30 +10:00 |
|
Deanna Hood
|
a5e49976f5
|
Add log10 array support
|
2015-03-11 08:56:42 +10:00 |
|
Deanna Hood
|
19a71056ae
|
Allow calling of square(array) in addition to array.square()
|
2015-03-11 06:59:28 +10:00 |
|