Go to file
Adhemerval Zanella Netto 34b9f8bc17 math: Improve fmod
This uses a new algorithm similar to already proposed earlier [1].
With x = mx * 2^ex and y = my * 2^ey (mx, my, ex, ey being integers),
the simplest implementation is:

   mx * 2^ex == 2 * mx * 2^(ex - 1)

   while (ex > ey)
     {
       mx *= 2;
       --ex;
       mx %= my;
     }

With mx/my being mantissa of double floating pointer, on each step the
argument reduction can be improved 11 (which is sizeo of uint64_t minus
MANTISSA_WIDTH plus the signal bit):

   while (ex > ey)
     {
       mx << 11;
       ex -= 11;
       mx %= my;
     }  */

The implementation uses builtin clz and ctz, along with shifts to
convert hx/hy back to doubles.  Different than the original patch,
this path assume modulo/divide operation is slow, so use multiplication
with invert values.

I see the following performance improvements using fmod benchtests
(result only show the 'mean' result):

  Architecture     | Input           | master   | patch
  -----------------|-----------------|----------|--------
  x86_64 (Ryzen 9) | subnormals      | 19.1584  | 12.5049
  x86_64 (Ryzen 9) | normal          | 1016.51  | 296.939
  x86_64 (Ryzen 9) | close-exponents | 18.4428  | 16.0244
  aarch64 (N1)     | subnormal       | 11.153   | 6.81778
  aarch64 (N1)     | normal          | 528.649  | 155.62
  aarch64 (N1)     | close-exponents | 11.4517  | 8.21306

I also see similar improvements on arm-linux-gnueabihf when running on
the N1 aarch64 chips, where it a lot of soft-fp implementation (for
modulo, clz, ctz, and multiplication):

  Architecture     | Input           | master   | patch
  -----------------|-----------------|----------|--------
  armhf (N1)       | subnormal       | 15.908   | 15.1083
  armhf (N1)       | normal          | 837.525  | 244.833
  armhf (N1)       | close-exponents | 16.2111  | 21.8182

Instead of using the math_private.h definitions, I used the
math_config.h instead which is used on newer math implementations.

Co-authored-by: kirill <kirill.okhotnikov@gmail.com>

[1] https://sourceware.org/pipermail/libc-alpha/2020-November/119794.html
Reviewed-by: Wilco Dijkstra  <Wilco.Dijkstra@arm.com>
2023-04-03 16:36:24 -03:00
argp Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
assert Use __builtin_FILE instead of __FILE__ in assert in C++. 2023-02-10 17:12:40 +00:00
benchtests benchtests: Add fmodf benchmark 2023-04-03 16:13:55 -03:00
bits [hurd] Add MTU_DISCOVER values 2023-02-15 15:14:06 +01:00
catgets Update copyright dates not handled by scripts/update-copyrights 2023-01-06 21:45:36 +00:00
ChangeLog.old Create ChangeLog.old/ChangeLog.26. 2023-01-31 22:27:45 -05:00
conform Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
crypt Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
csu elf: Fix GL(dl_phdr) and GL(dl_phnum) for static builds [BZ #29864] 2023-01-12 13:54:34 -03:00
ctype Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
debug stdio-common: Handle -1 buffer size in __sprintf_chk & co (bug 30039) 2023-01-25 08:01:00 +01:00
dirent Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
dlfcn Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
elf Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
gmon Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
gnulib Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
grp Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
gshadow Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
hesiod Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
htl Fix typos in comments 2023-02-12 16:34:28 +01:00
hurd hurd: Microoptimize _hurd_self_sigstate () 2023-04-03 01:25:57 +02:00
iconv Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
iconvdata Remove --with-default-link configure option 2023-03-27 13:57:55 -03:00
include Remove set-hooks.h from generic includes 2023-03-27 13:57:55 -03:00
inet Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
intl Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
io Replace rawmemchr (s, '\0') with strchr 2023-02-06 16:16:19 +00:00
libio system: Add "--" after "-c" for sh (BZ #28519) 2023-03-28 10:12:30 -03:00
locale Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
localedata localedata: de_DE should not use Fräulein 2023-02-27 16:54:22 +01:00
login Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
mach mach: Use PAGE_SIZE 2023-02-20 00:46:36 +01:00
malloc memalign: Support scanning for aligned chunks. 2023-03-29 16:36:03 -04:00
manual manual: Document __wur usage under _FORTIFY_SOURCE 2023-04-03 10:20:04 -04:00
math math: Improve fmod 2023-04-03 16:36:24 -03:00
mathvec Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
misc Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
nis nis: Fix stringop-truncation warning with -O3 in nis_local_host. 2023-03-02 14:22:54 +01:00
nptl Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
nptl_db Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
nscd Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
nss Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
po Update all PO files in preparation for release. 2023-01-31 17:51:40 -05:00
posix posix: Fix some crashes in wordexp [BZ #18096] 2023-03-28 10:12:12 -03:00
pwd Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
resolv Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
resource Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
rt Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
scripts Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
setjmp Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
shadow Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
signal Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
socket Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
soft-fp Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
stdio-common stdio-common: Fix building when !IS_IN (libc) 2023-04-03 01:01:11 +02:00
stdlib system: Add "--" after "-c" for sh (BZ #28519) 2023-03-28 10:12:30 -03:00
string Fix stringop-overflow warning in test-strncat. 2023-03-02 14:25:34 +01:00
sunrpc Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions 2023-03-27 13:57:55 -03:00
support system: Add "--" after "-c" for sh (BZ #28519) 2023-03-28 10:12:30 -03:00
sysdeps math: Improve fmod 2023-04-03 16:36:24 -03:00
sysvipc Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
termios Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
time time: Fix strftime(3) API regarding nullability 2023-03-31 10:31:14 -03:00
timezone Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
wcsmbs Declare wcstofN, wcstofNx for C2x 2023-03-14 18:11:27 +00:00
wctype Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
.clang-format Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
.gitattributes
.gitignore
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4 configure: Move nm, objdump, and readelf to LIBC_PROG_BINUTILS 2023-01-12 09:05:09 -03:00
config.h.in Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
config.make.in Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
configure Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
configure.ac Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
CONTRIBUTED-BY Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
COPYING
COPYING.LIB
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk Improve gen-locales.mk and gen-locale.sh to make test files with @ options work 2018-02-27 17:01:57 +01:00
INSTALL Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
libc-abis riscv: support GNU indirect function 2021-01-10 21:25:13 -05:00
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES arc4random: simplify design for better safety 2022-07-27 08:58:27 -03:00
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
Makefile Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
Makefile.help Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
Makefile.in
Makerules Remove --with-default-link configure option 2023-03-27 13:57:55 -03:00
NEWS Remove --enable-tunables configure option 2023-03-29 14:33:06 -03:00
o-iterator.mk
README LoongArch: Update NEWS and README for the LoongArch port. 2022-07-26 12:35:12 -03:00
Rules libio: Do not autogenerate stdio_lim.h 2023-03-27 13:57:55 -03:00
SHARED-FILES Mention today's regex merge in SHARED-FILES 2021-09-21 18:00:10 -07:00
shlib-versions nss: Do not mention NSS test modules in <gnu/lib-names.h> 2022-03-11 08:24:04 +01:00
test-skeleton.c Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
version.h Open master branch for glibc 2.38 development 2023-01-31 22:39:21 -05:00

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arc*-*-linux-gnu
	arm-*-linux-gnueabi
	csky-*-linux-gnuabiv2
	hppa-*-linux-gnu
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	loongarch64-*-linux-gnu Hardware floating point, LE only.
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	or1k-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	riscv32-*-linux-gnu
	riscv64-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see https://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at https://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see https://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.