Commit Graph

  • 33f5f59614 Vectorize cbrt for float and double. master Rasmus Munk Larsen 2025-04-17 23:31:20 +00:00
  • 5330960900 Enable packet segment in partial redux nightly Charles Schlosser 2025-04-14 17:44:53 +00:00
  • 6266d430cc packet segment: also check DiagonalWrapper Charles Schlosser 2025-04-12 19:34:11 +00:00
  • e39ad8badc fix constexpr in CoreEvaluators.h Charles Schlosser 2025-04-12 18:54:09 +00:00
  • 7aefb9f4d9 fix memset optimization for std::complex types Charles Schlosser 2025-04-12 16:20:09 +00:00
  • 73ca849a68 fix packetSegment for ArrayWrapper / MatrixWrapper Charles Schlosser 2025-04-12 12:12:48 +00:00
  • 28c3b26d53 masked load/store framework Charles Schlosser 2025-04-12 00:31:10 +00:00
  • cebe09110c Fix a potential deadlock because of Eigen thread pool Eugene Zhulenev 2025-04-11 23:43:14 +00:00
  • 11fd34cc1c Fix the typing of the Tasks in ForkJoin.h William Kong 2025-04-09 17:21:36 +00:00
  • 2cd47d743e Fixe Conversion Warning in Parallelizer Hunter Belanger 2025-04-08 07:39:01 +00:00
  • b860042263 Add postream for ostream-ing packets more reliably. Antonio Sánchez 2025-04-01 22:12:00 +00:00
  • 02d9e1138a Add missing pmadd for Packet16bf. Antonio Sánchez 2025-03-31 04:17:17 +00:00
  • 9cc9209b9b Fix cmake warning and default to j0. Antonio Sánchez 2025-03-29 16:09:40 +00:00
  • e0c99a8dd6 By default, run ctests on all available cores in parallel. Rasmus Munk Larsen 2025-03-27 13:16:44 -07:00
  • 63a40ffb95 Use fma<float> for fma<half> and fma<bfloat16> if native fma is not available on the platform. Rasmus Munk Larsen 2025-03-28 04:26:04 +00:00
  • 44fb6422be All triggering full CI if MR label containts all-tests Antonio Sanchez 2025-03-27 06:53:52 -07:00
  • 3866cbfbe8 Fix test for TensorRef of trace. Rasmus Munk Larsen 2025-03-25 23:01:46 +00:00
  • 6579e36eb4 Allow Tensor trace to be passed to a TensorRef. Antonio Sanchez 2025-03-25 08:26:23 -07:00
  • 8e32cbf7da Reduce flakiness of test for Eigen::half. Antonio Sanchez 2025-03-23 22:31:25 -07:00
  • d935916ac6 Add numext::fma and missing pmadd implementations. Antonio Sánchez 2025-03-23 01:05:53 +00:00
  • 754bd24f5e fix 2828 Charles Schlosser 2025-03-22 17:19:44 +00:00
  • ac2165c11f fix allFinite Charles Schlosser 2025-03-20 16:04:46 +00:00
  • 3143968195 Generalize the Eigen ForkJoin scheduler to use any ThreadPool interface. William Kong 2025-03-19 18:44:03 +00:00
  • 70f2aead9a Use native _Float16 for AVX512FP16 and update vectorization. Antonio Sánchez 2025-03-19 19:55:26 +00:00
  • 3580a38298 Use native _Float16 for AVX512FP16 and update vectorization. avx512fp16 Antonio Sanchez 2025-03-16 20:58:59 -07:00
  • 0259a52b0e
    Use more .noalias() Markus Vieth 2025-03-17 19:32:43 +01:00
  • 14f845a1a8 Fix givens rotation. Antonio Sánchez 2025-03-14 17:15:57 +00:00
  • 5e8edd2186 Fix undefined behavior in PPC load. 3.3 Antonio Sanchez 2025-03-13 22:23:45 -07:00
  • 0ac1fc52dd Fix CUDA clang again with new C++11 usages Antonio Sanchez 2025-03-13 20:47:31 -07:00
  • 33b04fe518 CMake: add install-doc target Guilhem Saurel 2024-12-13 21:46:20 +01:00
  • 6aa0143851 Judge unitary-ness relative to scaling. Antonio Sánchez 2024-04-30 22:28:46 +00:00
  • c7f6f8315f Update CUDA testing infra to match master branch. Antonio Sanchez 2025-03-13 21:49:24 +00:00
  • b0448fc6e0 Fix cxx03 testing job configuration Antonio Sanchez 2025-03-13 11:13:35 -07:00
  • 3b8644da50 Better rand to fix MSVC random tests Antonio Sanchez 2025-03-13 08:45:50 -07:00
  • 414c42bfcf Fix cuda clang builds Antonio Sanchez 2025-03-11 19:20:03 -07:00
  • 10e62ccd22 Fix x86 complex vectorized fma Charles Schlosser 2025-03-12 17:06:32 +00:00
  • 952eda443b Fix GPU build failures. Antonio Sanchez 2025-03-09 17:04:41 -07:00
  • 6a4a0b66bd Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC. Chip Kerchner 2023-02-16 23:35:42 +00:00
  • 079de53fa5 Adjust tolerance of matrix_power test for MSVC. Antonio Sanchez 2022-03-01 23:31:15 +00:00
  • ce950ca2db Patch PPC PacketMath from 3.4. Antonio Sanchez 2025-03-07 20:59:30 -08:00
  • 464c1d0978 Format TensorDeviceThreadPool.h & use if constexpr for c++20. Rasmus Munk Larsen 2025-03-08 01:09:36 +00:00
  • 21223f6bb6 Fix addition of different enum types. Rasmus Munk Larsen 2025-03-07 22:18:00 +00:00
  • 350544eb01 Clean up TensorDeviceThreadPool.h Rasmus Munk Larsen 2025-03-07 18:14:17 +00:00
  • 43810fc1be Fix extra semicolon in DeviceWrapper Kevin 2025-03-07 01:07:23 +00:00
  • d28041ed5a refactor AssignmentFunctors.h, unify with existing scalar_op Charles Schlosser 2025-03-06 01:28:39 +00:00
  • 49bd503308 Fix merge conflict error Antonio Sanchez 2025-03-03 15:21:02 -08:00
  • 5b20d9f326 Fix arm32 float division and related bugs Charles Schlosser 2023-08-29 00:36:07 +00:00
  • 5f8f69020b Remove poor non-convergence checks in NonLinearOptimization. Antonio Sánchez 2022-03-02 19:31:20 +00:00
  • dc9325848a Fix arm32 issues. Antonio Sánchez 2024-01-23 22:04:55 +00:00
  • 9df4c76bb8 Fix emulated builds cmake configuration Antonio Sanchez 2025-03-03 07:26:26 -08:00
  • 0071c2e8a8 Fix more hard-coded magic bounds. Antonio Sánchez 2024-04-29 21:21:11 +00:00
  • 03727bdf55 Slightly adjust error bound for nonlinear tests. Antonio Sánchez 2024-04-25 18:04:49 +00:00
  • 5e39ba6642 Fix emulated tests. Antonio Sánchez 2025-01-31 19:30:42 +00:00
  • d2ce4faa5a Fix cuda 9+ builds Antonio Sanchez 2025-03-02 07:38:47 -08:00
  • 43b7aa2412 Don't check for build type Antonio Sanchez 2025-02-28 22:13:46 -08:00
  • 23b1682723 Fix cuda device warnings Antonio Sanchez 2025-02-28 15:21:23 -08:00
  • 9a86214039 Optimize division operations in TensorVolumePatch.h Gopinath Vasalamarri 2025-02-28 22:34:13 +00:00
  • c53002f5fb Fix failing tests on arm/ppc Antonio Sanchez 2025-02-28 13:15:33 -08:00
  • be5147b090 Fix STL feature detection for c++20. Antonio Sánchez 2025-02-28 19:52:37 +00:00
  • ea37d9e73e Remove private access of std::deque::_M_impl. Antonio Sanchez 2020-12-10 14:59:34 -08:00
  • 179a49684a Fix CMake BOOST warning Antonio Sanchez 2025-02-28 07:33:26 -08:00
  • dd56367554 Fix docs job for nightlies Antonio Sanchez 2025-02-26 08:00:41 -08:00
  • ece7cec604 Fix parsing of command-line arguments when already specified as a cmake list. Antonio Sánchez 2023-03-16 22:47:38 +00:00
  • 2e708d48ca Merge CI from 3.4 Antonio Sanchez 2025-02-25 21:23:42 -08:00
  • 109935bfce Fix Tensor docs C. Antonio Sanchez 2025-02-15 22:46:57 -08:00
  • 339d7188ed Fix up all doxygen warnings. Antonio Sanchez 2025-02-25 21:05:40 -08:00
  • d79bac0d3c Fix boolean scatter and random generation for tensors. Antonio Sánchez 2025-02-25 21:37:09 +00:00
  • 9935396b15 Specify constructor template arguments for ConstexprTest struct Tyler Veness 2025-02-25 19:38:47 +00:00
  • 72adf891d5 Slightly simplify ForkJoin code, and make sure the test is actually run. Rasmus Munk Larsen 2025-02-25 17:22:43 +00:00
  • 6aebfa9acc Build docs on push, and don't expire Antonio Sanchez 2025-02-24 08:29:00 -08:00
  • 68f4e58cfa Don't expire docs pages job 3.4 Antonio Sanchez 2025-02-24 08:23:58 -08:00
  • bddaa99e15 Fix bitwise operation error when compiling as C++26 Markus Vieth 2025-02-21 20:48:31 +01:00
  • e42dceb3a1 Fix implicit copy-constructor warning in TensorRef. C. Antonio Sanchez 2025-02-22 08:36:05 -08:00
  • 5fc6fc9881 Initialize matrix in bicgstab test Antonio Sanchez 2025-02-21 10:27:29 -08:00
  • 0ae7b59018 Make assignment constexpr Tyler Veness 2025-02-21 18:16:46 +00:00
  • 4dda5b927a fix Warray-bounds in inner product Charles Schlosser 2025-02-20 22:40:55 +00:00
  • 66f7f51b7e Disable fno-check-new on clang. C. Antonio Sanchez 2025-02-18 21:24:47 -08:00
  • 151f6127df Fix Warray-bounds warning for fixed-size assignments Charles Schlosser 2025-02-18 19:23:14 +00:00
  • 0e607fd350 Fix c++03 build and tests Antonio Sanchez 2025-02-18 10:41:23 -08:00
  • 13507d1efd Remove nightly tag deploy on non-default branches C. Antonio Sanchez 2025-02-17 18:48:09 -08:00
  • 85ffda9539 Fix arm32 packetmath tests C. Antonio Sanchez 2025-02-17 17:49:08 -08:00
  • 72f77ccb3e Fix arm32 float division and related bugs Charles Schlosser 2023-08-29 00:36:07 +00:00
  • 526a6328e2 Default eigen_packet_wrapper constructor. Antonio Sanchez 2021-09-14 10:57:22 -07:00
  • 7b378c2d91 Fix cherry-pick bug for NEON make_packet C. Antonio Sanchez 2025-02-17 12:59:57 -08:00
  • 129e003cdf Disable FP16 arithmetic for arm32. Antonio Sánchez 2023-06-26 18:39:42 +00:00
  • 6161ce5cde Fix arm builds. Antonio Sánchez 2023-03-20 16:59:38 +00:00
  • be62728876 More NEON packetmath fixes. Antonio Sánchez 2023-02-14 21:45:25 +00:00
  • 1426855b68 Fix NEON make_packet2f. Antonio Sánchez 2023-02-14 16:52:07 +00:00
  • b2deb94e4a Fix MSVC arm build. Antonio Sánchez 2023-02-08 21:46:37 +00:00
  • c23abcf25c Fix arm32 issues. Antonio Sánchez 2024-01-23 22:04:55 +00:00
  • f23b8c0d78 Fix more hard-coded magic bounds. Antonio Sánchez 2024-04-29 21:21:11 +00:00
  • d60c3a3341 Slightly adjust error bound for nonlinear tests. Antonio Sánchez 2024-04-25 18:04:49 +00:00
  • 57c8d7c93f Fix failing builds and update CI on push. C. Antonio Sanchez 2025-02-16 22:13:29 -08:00
  • ab92609cad Add missing ci scripts C. Antonio Sanchez 2025-02-16 15:06:16 -08:00
  • 551e95a409 Run pipelines on push C. Antonio Sanchez 2025-02-16 14:50:58 -08:00
  • 2924f58188 Remove deprecated check in meta test C. Antonio Sanchez 2025-02-16 14:42:15 -08:00
  • f1922b6dac Update cmake configuration from master C. Antonio Sanchez 2025-02-16 11:22:08 -08:00
  • 052d91349a Split bdcsvd tests C. Antonio Sanchez 2025-02-16 11:21:42 -08:00
  • 72e38684c1 Disable deprecated warnings for SVD tests on MSVC. Antonio Sánchez 2022-02-24 21:20:49 +00:00
  • bb1dbb4df6 Disable deprecated warnings in SVD tests. Antonio Sánchez 2022-02-23 18:32:00 +00:00