William Kong
|
5d866a7a78
|
Fix potential data race on spin_count_ NonBlockingThreadPool member variable
|
2025-01-28 17:22:15 +00:00 |
|
William Kong
|
bc67025ba7
|
Clean up and fix the documentation of ForkJoin.h
|
2025-01-27 23:12:17 +00:00 |
|
Antonio Sánchez
|
dc1126e762
|
Fix threadpool for c++14.
|
2025-01-27 21:57:23 +00:00 |
|
Rasmus Munk Larsen
|
cd511a09aa
|
Fix initialization order and remove unused variables in NonBlockingThreadPool.h.
|
2025-01-27 19:35:49 +00:00 |
|
Johannes Zipfel
|
f679843dc2
|
Block doc non square
|
2025-01-25 17:14:21 +00:00 |
|
William Kong
|
f9705adabb
|
Fix typo introduced in the refactor of NonBlockingThreadPool
|
2025-01-25 17:13:24 +00:00 |
|
Antonio Sánchez
|
b75895a8b6
|
Try to fix loongarch
|
2025-01-25 16:38:41 +00:00 |
|
William Kong
|
4a6ac97d13
|
Add a ForkJoin-based ParallelFor algorithm to the ThreadPool module
|
2025-01-24 22:12:05 +00:00 |
|
Pengzhou0810
|
e986838464
|
Add LoongArch64 architecture LSX support.(build/test )
|
2025-01-20 18:37:44 +00:00 |
|
Markus Vieth
|
c486af5ad3
|
Change Eigen::aligned_allocator to not inherit from std::allocator
|
2025-01-20 16:04:43 +00:00 |
|
Antonio Sánchez
|
abac563f5d
|
Update documentation to clarify cross product for complex numbers.
|
2025-01-16 00:52:40 +00:00 |
|
Antonio Sanchez
|
2e76277bd0
|
Zero-initialize test arrays to avoid uninitialized reads.
|
2025-01-14 09:15:43 -08:00 |
|
Antonio Sánchez
|
ad13df7ea4
|
Fix std::fill_n reference.
|
2025-01-14 00:43:00 +00:00 |
|
Frédéric Simonis
|
9836e8d035
|
Fix read of uninitialized threshold in SparseQR
|
2025-01-08 23:40:58 +00:00 |
|
Charles Schlosser
|
7bb23b1e36
|
CI: don't add ToolChain PPA
|
2024-12-31 14:04:01 +00:00 |
|
xsjk
|
7bb8c58e7c
|
Fix the missing CUDA device qualifier
|
2024-12-28 15:17:55 +00:00 |
|
Joerg Buchwald
|
24e0c2a125
|
use omp_get_max_threads if setNbThreads is not set
|
2024-12-20 21:16:15 +00:00 |
|
Jordan Rupprecht
|
a32db43966
|
Add missing #include <new>
|
2024-12-19 11:06:08 +00:00 |
|
Charles Schlosser
|
c01ff45312
|
Enable fill_n and memset optimizations for construction and assignment
|
2024-12-14 14:25:04 +00:00 |
|
Antonio Sánchez
|
af59ada0ac
|
Use alpine for deploying nightly tag.
|
2024-12-10 22:48:29 +00:00 |
|
Charles Schlosser
|
4a9e32ae0b
|
matrix equality operator
|
2024-12-10 12:40:39 +00:00 |
|
Antonio Sanchez
|
00776d1ba4
|
Remove branch name from nightly tag job.
|
2024-12-09 20:18:18 -08:00 |
|
Antonio Sanchez
|
7f23778593
|
Add tag to commit instead of branch
|
2024-12-09 07:47:48 -08:00 |
|
Antonio Sánchez
|
c30b35a310
|
Force tag to update to latest head.
|
2024-12-08 04:48:21 +00:00 |
|
Antonio Sánchez
|
a26ba67349
|
Add LICENSE file in correct place so it is picked up by gitlab.
|
2024-12-08 03:26:43 +00:00 |
|
Charles Schlosser
|
08c31c3ba6
|
try alpine for formatting
|
2024-12-08 01:09:33 +00:00 |
|
Antonio Sanchez
|
1ac1af62ef
|
Update deploy job
|
2024-12-07 09:19:21 -08:00 |
|
Antonio Sánchez
|
7b6623af30
|
Fix special packetmath erfc flushing for ARM32.
|
2024-12-07 01:42:30 +00:00 |
|
Antonio Sánchez
|
fd48fbb260
|
Update rocm docker again again.
|
2024-12-06 22:13:53 +00:00 |
|
Antonio Sánchez
|
a885340ba5
|
Update rocm docker again.
|
2024-12-06 17:19:31 +00:00 |
|
Antonio Sanchez
|
45a8478d09
|
Update rocm docker image in CI.
|
2024-12-06 07:14:59 -08:00 |
|
Antonio Sánchez
|
de4afcf414
|
Add a deploy phase to the CI that tags the latest nightly pipeline if it passes.
|
2024-12-05 15:28:18 +00:00 |
|
Charles Schlosser
|
5e8916050b
|
move constructor / move assignment doc strings
|
2024-12-04 17:42:20 +00:00 |
|
Charles Schlosser
|
77a073aaa8
|
fix checkformat ci stage
|
2024-12-04 02:45:52 +00:00 |
|
Charles Schlosser
|
41e46ed243
|
fix IOFormat alignment
|
2024-12-04 01:13:48 +00:00 |
|
Charles Schlosser
|
a0d32e40d9
|
fix map fill logic
|
2024-11-30 13:39:02 +00:00 |
|
Charles Schlosser
|
d34b100c13
|
Fix UB in setZero
|
2024-11-27 19:32:14 +00:00 |
|
Rasmus Munk Larsen
|
f19a6803c8
|
Refactor special case handling in pow(x,y) and revert to repeated squaring for <float,int>
|
2024-11-27 00:24:21 +00:00 |
|
Rasmus Munk Larsen
|
5064cb7d5e
|
Add test for using pcast on scalars.
|
2024-11-25 22:27:26 -08:00 |
|
Rasmus Munk Larsen
|
1ea61a5d26
|
Improve pow(x,y): 25% speedup, increase accuracy for integer exponents.
|
2024-11-26 06:13:48 +00:00 |
|
Charles Schlosser
|
8ad4344ca7
|
optimize setConstant, setZero
|
2024-11-22 03:39:19 +00:00 |
|
Rasmus Munk Larsen
|
5610a13b77
|
Simplify and speed up pow() by 5-6%
|
2024-11-20 12:45:00 +00:00 |
|
Rasmus Munk Larsen
|
6c6ce9d06b
|
Enable vectorized erf<double>(x) for SSE and AVX, which was accidentally removed in merge request 1750.
|
2024-11-19 22:14:29 +00:00 |
|
Rasmus Munk Larsen
|
e7c799b7c9
|
Prevent premature overflow to infinity in exp(x). The changes also provide a 3-4% speedup.
|
2024-11-19 13:08:18 -08:00 |
|
Rasmus Munk Larsen
|
00af47102d
|
Revert 040180078d
|
2024-11-19 10:25:16 -08:00 |
|
Rasmus Munk Larsen
|
8ee6f8475a
|
Speed up exp(x).
|
2024-11-19 17:50:34 +00:00 |
|
Charles Schlosser
|
93ec5450cb
|
disable fill_n optimization for msvc
|
2024-11-19 01:38:48 +00:00 |
|
Rasmus Munk Larsen
|
0af6ab4b76
|
Remove unnecessary check for HasBlend trait.
|
2024-11-18 21:16:45 +00:00 |
|
Rasmus Munk Larsen
|
d5eec781b7
|
Get rid of redundant computation for large arguments to erf(x).
|
2024-11-18 10:51:58 -08:00 |
|
Tyler Veness
|
2fc63808e4
|
Fix C++20 constexpr test compilation failures
|
2024-11-18 01:56:55 +00:00 |
|