glibc/sysdeps/aarch64
Wilco Dijkstra 4a733bf375 AArch64: Add optimized Q-register memcpy
Add a new memcpy using 128-bit Q registers - this is faster on modern
cores and reduces codesize.  Similar to the generic memcpy, small cases
include copies up to 32 bytes.  64-128 byte copies are split into two
cases to improve performance of 64-96 byte copies.  Large copies align
the source rather than the destination.

bench-memcpy-random is ~9% faster than memcpy_falkor on Neoverse N1,
so make this memcpy the default on N1 (on Centriq it is 15% faster than
memcpy_falkor).

Passes GLIBC regression tests.

Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>
2020-07-15 16:55:07 +01:00
..
bits
fpu aarch64: Remove fpu Makefile 2020-06-22 11:09:50 -03:00
multiarch AArch64: Add optimized Q-register memcpy 2020-07-15 16:55:07 +01:00
nptl
sys
__longjmp.S
abort-instr.h
atomic-machine.h
bsd-_setjmp.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
bsd-setjmp.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
configure aarch64: configure check for pac-ret code generation 2020-07-08 15:02:38 +01:00
configure.ac aarch64: configure check for pac-ret code generation 2020-07-08 15:02:38 +01:00
crti.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
crtn.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-bti.c aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
dl-dtprocnum.h
dl-irel.h
dl-link.sym
dl-machine.h aarch64: fix RTLD_START for BTI 2020-07-08 15:02:37 +01:00
dl-prop.h aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
dl-sysdep.h
dl-tls.h
dl-tlsdesc.h
dl-tlsdesc.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-trampoline.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-tunables.list
e_sqrtl.c
elf-initfini.h
Implies
jmpbuf-offsets.h
jmpbuf-unwind.h
ldsodefs.h
libc-tls.c
libm-test-ulps Update aarch64 libm-test-ulps 2020-04-08 13:52:44 -03:00
libm-test-ulps-name
linkmap.h aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
machine-gmon.h aarch64: fix pac-ret support in _mcount 2020-07-08 15:02:38 +01:00
Makefile aarch64: ensure objects are BTI compatible 2020-07-08 15:02:38 +01:00
math-tests-trap.h
mcount.c
memchr.S aarch64: MTE compatible memchr 2020-06-23 17:55:39 +01:00
memcmp.S
memcpy.S
memmove.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
memrchr.S aarch64: MTE compatible memrchr 2020-06-23 17:55:39 +01:00
memset-reg.h
memset.S
memusage.h
preconfigure
rawmemchr.S
setjmp.S
sfp-machine.h
sotruss-lib.c
stackinfo.h
start.S aarch64: Add BTI support to assembly files 2020-07-08 15:02:37 +01:00
stpcpy.S
strchr.S aarch64: MTE compatible strchr 2020-06-09 09:20:27 +01:00
strchrnul.S aarch64: MTE compatible strchrnul 2020-06-09 09:20:27 +01:00
strcmp.S aarch64: MTE compatible strcmp 2020-06-23 17:55:39 +01:00
strcpy.S aarch64: MTE compatible strcpy 2020-06-23 17:55:39 +01:00
string_private.h
strlen.S aarch64: MTE compatible strlen 2020-06-09 09:21:11 +01:00
strncmp.S aarch64: MTE compatible strncmp 2020-06-23 17:55:39 +01:00
strnlen.S aarch64: fix strcpy and strnlen for big-endian [BZ #25824] 2020-05-15 12:15:56 +01:00
strrchr.S aarch64: MTE compatible strrchr 2020-06-23 17:55:39 +01:00
sysdep.h AArch64: Align ENTRY to a cacheline 2020-07-15 16:50:02 +01:00
tls-macros.h
tlsdesc.c
tlsdesc.sym
tst-audit.h
tst-ifunc-arg-1.c
tst-ifunc-arg-2.c
Versions