mirror of
git://sourceware.org/git/glibc.git
synced 2024-11-27 03:41:23 +08:00
d846f4c12d
The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic lgammaf. The code was adapted to glibc style, to use the definition of math_config.h, to remove errno handling, to use math_narrow_eval on overflow usage, and to adapt to make it reentrant. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 86.5609 70.3278 18.75% x86_64v2 78.3030 69.9709 10.64% x86_64v3 74.7470 59.8457 19.94% i686 387.355 229.761 40.68% aarch64 40.8341 33.7563 17.33% power10 26.5520 16.1672 39.11% powerpc 28.3145 17.0625 39.74% reciprocal-throughput master patched improvement x86_64 68.0461 48.3098 29.00% x86_64v2 55.3256 47.2476 14.60% x86_64v3 52.3015 38.9028 25.62% i686 340.848 195.707 42.58% aarch64 36.8000 30.5234 17.06% power10 20.4043 12.6268 38.12% powerpc 22.6588 13.8866 38.71% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com> |
||
---|---|---|
.. | ||
bits | ||
coldfire | ||
fpu | ||
m680x0 | ||
nptl | ||
sys | ||
__longjmp.c | ||
abort-instr.h | ||
asm-syntax.h | ||
backtrace.c | ||
bsd-_setjmp.c | ||
bsd-setjmp.c | ||
configure | ||
configure.ac | ||
crti.S | ||
crtn.S | ||
dl-machine.h | ||
dl-tls.h | ||
dl-trampoline.S | ||
elf-initfini.h | ||
fpu_control.h | ||
gccframe.h | ||
Implies | ||
jmpbuf-unwind.h | ||
ldsodefs.h | ||
libc-tls.c | ||
Makefile | ||
math-use-builtins-ffs.h | ||
memchr.S | ||
memcopy.h | ||
preconfigure | ||
preconfigure.ac | ||
rawmemchr.S | ||
setjmp.c | ||
shlib-versions | ||
sotruss-lib.c | ||
stackinfo.h | ||
start.S | ||
strchr.S | ||
strchrnul.S | ||
symbol-hacks.h | ||
sysdep.h | ||
tst-audit.h | ||
unwind-arch.h | ||
utmp-size.h | ||
Versions | ||
wcpcpy_chk.c | ||
wordcopy.c |