Go to file
Naohiro Tamura 4f26956d5b aarch64: Added optimized memset for A64FX
This patch optimizes the performance of memset for A64FX [1] which
implements ARMv8-A SVE and has L1 64KB cache per core and L2 8MB cache
per NUMA node.

The performance optimization makes use of Scalable Vector Register
with several techniques such as loop unrolling, memory access
alignment, cache zero fill and prefetch.

SVE assembler code for memset is implemented as Vector Length Agnostic
code so theoretically it can be run on any SOC which supports ARMv8-A
SVE standard.

We confirmed that all testcases have been passed by running 'make
check' and 'make xcheck' not only on A64FX but also on ThunderX2.

And also we confirmed that the SVE 512 bit vector register performance
is roughly 4 times better than Advanced SIMD 128 bit register and 8
times better than scalar 64 bit register by running 'make bench'.

[1] https://github.com/fujitsu/A64FX

Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>
Reviewed-by: Szabolcs Nagy <Szabolcs.Nagy@arm.com>
2021-05-27 09:47:53 +01:00
argp
assert
benchtests benchtests: Fixed bench-memcpy-random: buf1: mprotect failed 2021-05-26 12:01:06 +01:00
bits
catgets
ChangeLog.old
conform
crypt
csu elf: Introduce __tls_pre_init_tp 2021-05-10 10:31:41 +02:00
ctype
debug Remove all usage of @BASH@ or ${BASH} in installed files, and hardcode /bin/bash instead 2021-05-12 07:47:11 +05:30
dirent
dlfcn elf: Partially initialize ld.so after static dlopen (bug 20802) 2021-05-17 10:06:57 +02:00
elf elf: Use custom NODELETE DSO for tst-dlopenfail, tst-dlopenfail-2 2021-05-21 22:35:00 +02:00
gmon
gnulib
grp
gshadow
hesiod
htl htl: Add __libpthread_freeres 2021-05-18 17:46:12 +00:00
hurd hurd: Export _hurd_libc_proc_init 2021-04-12 00:23:36 +02:00
iconv charmap_conversion: Free conversion table on exit 2021-05-18 09:25:40 +05:30
iconvdata localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
include Add cast_to_pointer to cast an integer to void * pointer 2021-05-22 05:09:15 -07:00
inet inet: Free result from getaddrinfo 2021-05-13 08:05:19 +05:30
intl
io Annotate additional APIs with GCC attribute access. 2021-05-06 11:01:05 -06:00
libio Enable support for GCC 11 -Wmismatched-dealloc. 2021-05-16 15:21:18 -06:00
locale show_archive_content: Fix trivial memory leak 2021-05-18 09:07:06 +05:30
localedata localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
login
mach
malloc tst-mallinfo2.c: Use correct multiple for total variable 2021-05-25 16:47:01 -04:00
manual aarch64: Added optimized memcpy and memmove for A64FX 2021-05-27 09:47:53 +01:00
math
mathvec
misc Enable support for GCC 11 -Wmismatched-dealloc. 2021-05-16 15:21:18 -06:00
nis
nptl Linux: Remove remaining references to $(shared-thread-library) 2021-05-25 11:30:23 +02:00
nptl_db nptl: Move pthread_create, thrd_create into libc 2021-05-21 22:35:00 +02:00
nscd
nss Use a #pragma to suppress a bogus GCC 10 warning instead of an assert [BZ 27832]. 2021-05-10 14:30:09 -06:00
po
posix Fix stringop-overflow warning in bug-regex19.c. 2021-05-18 10:07:30 +02:00
pwd Annotate additional APIs with GCC attribute access. 2021-05-06 11:01:05 -06:00
resolv
resource
rt nptl: Consolidate async cancel enable/disable implementation in libc 2021-05-05 17:19:32 +02:00
scripts scripts/versions.awk: Add strings and hashes to <first-versions.h> 2021-05-10 10:31:41 +02:00
setjmp nptl: Move __pthread_unwind_next into libc 2021-04-21 19:49:50 +02:00
shadow
signal
socket
soft-fp
stdio-common linux: Move funlockfile/_IO_funlockfile into libc 2021-05-10 23:35:44 -03:00
stdlib Enable support for GCC 11 -Wmismatched-dealloc. 2021-05-16 15:21:18 -06:00
string x86: Expand bench-memcmp.c and test-memcmp.c 2021-05-18 22:57:39 -04:00
sunrpc linux: Normalize and return timeout on select (BZ #27651) 2021-04-12 18:38:37 -03:00
support support: Free gdb_script_name 2021-05-13 08:07:23 +05:30
sysdeps aarch64: Added optimized memset for A64FX 2021-05-27 09:47:53 +01:00
sysvipc
termios
time Do not declare asctime_r and ctime_r for C2X 2021-05-18 19:47:49 +00:00
timezone Remove all usage of @BASH@ or ${BASH} in installed files, and hardcode /bin/bash instead 2021-05-12 07:47:11 +05:30
wcsmbs Enable support for GCC 11 -Wmismatched-dealloc. 2021-05-16 15:21:18 -06:00
wctype
.gitattributes
.gitignore
abi-tags
aclocal.m4
config.h.in config: Added HAVE_AARCH64_SVE_ASM for aarch64 2021-05-26 12:01:06 +01:00
config.make.in Add pthread-in-libc, libpthread-routines-var, librt-routines-var 2021-05-03 08:13:32 +02:00
configure Remove --enable-stackguard-randomization (BZ #27872) 2021-05-19 10:22:19 -03:00
configure.ac Remove --enable-stackguard-randomization (BZ #27872) 2021-05-19 10:22:19 -03:00
COPYING
COPYING.LIB
extra-lib.mk
gen-locales.mk
INSTALL aarch64: Added Vector Length Set test helper script 2021-05-26 12:01:06 +01:00
libc-abis
libof-iterator.mk
LICENSES
MAINTAINERS
Makeconfig Add pthread-in-libc, libpthread-routines-var, librt-routines-var 2021-05-03 08:13:32 +02:00
Makefile testrun.sh: Improve --help message 2021-05-25 10:14:19 +05:30
Makefile.help
Makefile.in
Makerules
NEWS Add C2X timespec_getres 2021-05-17 20:55:21 +00:00
o-iterator.mk
README
Rules
shlib-versions
test-skeleton.c
version.h

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arc*-*-linux-gnu
	arm-*-linux-gnueabi
	csky-*-linux-gnuabiv2
	hppa-*-linux-gnu
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	riscv32-*-linux-gnu
	riscv64-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see https://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at https://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see https://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.