Antonio Sánchez
|
bb51d9f4fa
|
Fix ODR violations.
|
2022-07-09 04:56:36 +00:00 |
|
Rohit Santhanam
|
06a458a13d
|
Enable subtests which use device side malloc since this has been fixed in ROCm 5.2.
|
2022-06-29 17:09:43 +00:00 |
|
Chip Kerchner
|
84cf3ff18d
|
Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial.
|
2022-06-27 19:18:00 +00:00 |
|
Chip Kerchner
|
c603275dc9
|
Better performance for Power10 using more load and store vector pairs for GEMV
|
2022-06-27 18:11:55 +00:00 |
|
Antonio Sanchez
|
0e18714167
|
Fix clang-tidy warnings about function definitions in headers.
|
2022-06-24 15:10:58 +00:00 |
|
Antonio Sánchez
|
8ed3b9dcd6
|
Skip f16/bf16 bessel specializations on AVX512 if unavailable.
|
2022-06-24 15:10:36 +00:00 |
|
Antonio Sánchez
|
bc2ab81634
|
Eliminate undef warnings when not compiling for AVX512.
|
2022-06-24 15:10:10 +00:00 |
|
Antonio Sánchez
|
0e083b172e
|
Use numext::sqrt in Householder.h.
|
2022-06-21 16:29:59 +00:00 |
|
b-shi
|
37673ca1bc
|
AVX512 TRSM kernels use alloca if EIGEN_NO_MALLOC requested
|
2022-06-17 18:05:26 +00:00 |
|
Chip Kerchner
|
4d1c16eab8
|
Fix tanh and erf to use vectorized version for EIGEN_FAST_MATH in VSX.
|
2022-06-15 16:06:43 +00:00 |
|
Mehdi Goli
|
7ea823e824
|
[SYCL-Spec] According to [SYCL-2020 spec](...
|
2022-06-13 15:52:29 +00:00 |
|
Arthur
|
ba4d7304e2
|
Document DiagonalBase
|
2022-06-08 17:46:32 +00:00 |
|
Binhao Qin
|
95463b59bc
|
Mark index_remap as EIGEN_DEVICE_FUNC in src/Core/Reshaped.h (Fixes #2493)
|
2022-06-07 20:10:47 +00:00 |
|
Shi, Brian
|
28812d2ebb
|
AVX512 TRSM Kernels respect EIGEN_NO_MALLOC
|
2022-06-07 11:28:42 -07:00 |
|
sfalmo
|
9960a30422
|
Fix row vs column vector typo in Matrix class tutorial
|
2022-06-07 17:28:19 +00:00 |
|
Antonio Sánchez
|
8c2e0e3cb8
|
Fix ambiguous comparisons for c++20 (again again)
|
2022-06-07 17:06:17 +00:00 |
|
Arthur
|
14aae29470
|
Provide DiagonalMatrix Product and Initializers
|
2022-06-06 21:43:22 +00:00 |
|
Antonio Sánchez
|
76cf6204f3
|
Revert "Fix c++20 ambiguity of comparisons."
This reverts commit 4f6354128f
|
2022-06-04 02:32:10 +00:00 |
|
aaraujom
|
8fbb76a043
|
Fix build issues with MSVC for AVX512
|
2022-06-03 14:55:40 +00:00 |
|
Antonio Sánchez
|
4f6354128f
|
Fix c++20 ambiguity of comparisons.
|
2022-06-03 05:11:07 +00:00 |
|
Oleg Shirokobrod
|
f542b0a71f
|
Adding an MKL adapter in FFT module.
|
2022-06-02 18:10:43 +00:00 |
|
aaraujom
|
d49ede4dc4
|
Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
|
2022-05-28 02:00:21 +00:00 |
|
Rasmus Munk Larsen
|
510f6b9f15
|
Fix integer shortening warnings in visitor tests.
|
2022-05-27 18:51:37 +00:00 |
|
Arthur
|
705ae70646
|
Add R-Bidiagonalization step to BDCSVD
|
2022-05-27 02:00:24 +00:00 |
|
Mario Rincon-Nigro
|
e99163e732
|
fix: issue 2481: LDLT produce wrong results with AutoDiffScalar
|
2022-05-25 15:26:10 +00:00 |
|
Antonio Sánchez
|
477eb7f630
|
Revert "Avoid ambiguous Tensor comparison operators for C++20 compatibility"
This reverts commit 5c2179b6c3
|
2022-05-24 16:09:59 +00:00 |
|
Mehdi Goli
|
c5a5ac680c
|
[SYCL] SYCL-2020 range does not have default constructor.
|
2022-05-24 03:11:46 +00:00 |
|
Benjamin Kramer
|
5c2179b6c3
|
Avoid ambiguous Tensor comparison operators for C++20 compatibility
|
2022-05-23 17:36:03 +00:00 |
|
Chip Kerchner
|
aa8b7e2c37
|
Add subMappers to Power GEMM packing - simplifies the address calculations (10% faster)
|
2022-05-23 15:18:29 +00:00 |
|
Antonio Sánchez
|
32348091ba
|
Avoid signed integer overflow in adjoint test.
|
2022-05-23 14:46:16 +00:00 |
|
Mehdi Goli
|
cbe03f3531
|
[SYCL] Extending SYCL queue interface extension.
|
2022-05-23 14:45:27 +00:00 |
|
Guoqiang QI
|
32a3f9ac33
|
Improve plogical_shift_* implementations and fix typo in SVE/PacketMath.h
|
2022-05-23 09:33:49 +00:00 |
|
Eisuke Kawashima
|
ac5c83a3f5
|
unset executable flag
|
2022-05-22 22:47:43 +09:00 |
|
Antonio Sanchez
|
481a4a8c31
|
Fix BDCSVD condition for failing with numerical issue.
|
2022-05-20 08:18:31 -07:00 |
|
Tobias Wood
|
a9868bd5be
|
Add arg() to tensor
|
2022-05-20 03:33:01 +00:00 |
|
Antonio Sánchez
|
028ab12586
|
Prevent BDCSVD crash caused by index out of bounds.
|
2022-05-19 22:29:48 +00:00 |
|
Rohan Ghige
|
798fc1c577
|
Fix 'Incorrect reference code in STL_interface.hh for ata_product' eigen/isses/2425
|
2022-05-18 14:42:57 +00:00 |
|
Antonio Sánchez
|
9b9496ad98
|
Revert "Add AVX512 optimizations for matrix multiply"
This reverts commit 25db0b4a82
|
2022-05-13 18:50:33 +00:00 |
|
aaraujom
|
25db0b4a82
|
Add AVX512 optimizations for matrix multiply
|
2022-05-12 23:41:19 +00:00 |
|
Guoqiang QI
|
00b75375e7
|
Adding PocketFFT support in FFT module since kissfft has some flaw in accuracy and performance
|
2022-05-11 17:44:22 +00:00 |
|
Rasmus Munk Larsen
|
73d65dbc43
|
Update README.md. Remove obsolete comment about RowMajor not being fully supported.
|
2022-05-06 18:19:35 +00:00 |
|
Francesco Romano
|
68e03ab240
|
Add uninstall target only if not already defined.
|
2022-05-05 17:43:08 +00:00 |
|
Alex_M
|
2c055f8633
|
make diagonal matrix cols() and rows() methods constexpr
|
2022-05-03 10:13:37 +02:00 |
|
Chip Kerchner
|
c2f15edc43
|
Add load vector_pairs for RHS of GEMM MMA. Improved predux GEMV.
|
2022-04-25 16:23:01 +00:00 |
|
John Mather
|
9e026e5e28
|
Removed need to supply the Symmetric flag to UpLo argument for Accelerate LLT and LDLT
|
2022-04-21 20:02:10 +00:00 |
|
Chip Kerchner
|
44ba7a0da3
|
Fix compiler bugs for GCC 10 & 11 for Power GEMM
|
2022-04-20 15:59:00 +00:00 |
|
Chip Kerchner
|
b02c384ef4
|
Add fused multiply functions for PowerPC - pmsub, pnmadd and pnmsub
|
2022-04-18 16:16:32 +00:00 |
|
Rohit Santhanam
|
3de96caeaa
|
Fix HouseholderSequence.h
|
2022-04-17 02:46:56 +00:00 |
|
Antonio Sánchez
|
f845a8bb1a
|
Fix cwise NaN propagation for scalar input.
|
2022-04-16 05:07:44 +00:00 |
|
Charles Schlosser
|
a4bb513b99
|
Update HouseholderSequence.h
|
2022-04-15 16:56:17 +00:00 |
|