glibc/sysdeps/aarch64
Siddhesh Poyarekar 30a81dae5b aarch64: Optimized memcmp for medium to large sizes
This improved memcmp provides a fast path for compares up to 16 bytes
and then compares 16 bytes at a time, thus optimizing loads from both
sources.  The glibc memcmp microbenchmark retains performance (with an
error of ~1ns) for smaller compare sizes and reduces up to 31% of
execution time for compares up to 4K on the APM Mustang.  On Qualcomm
Falkor this improves to almost 48%, i.e. it is almost 2x improvement
for sizes of 2K and above.

	* sysdeps/aarch64/memcmp.S: Widen comparison to 16 bytes at a
	time.
2018-03-06 19:22:40 +05:30
..
bits
fpu [AArch64] Fix include. 2018-02-15 12:41:06 +00:00
multiarch IFUNC for Cavium ThunderX2 2018-02-22 08:38:47 -08:00
nptl
soft-fp
__longjmp.S
abort-instr.h
atomic-machine.h
backtrace.c
bsd-_setjmp.S
bsd-setjmp.S
configure
configure.ac
crti.S
crtn.S
dl-irel.h
dl-link.sym
dl-machine.h
dl-sysdep.h
dl-tls.h
dl-tlsdesc.h
dl-tlsdesc.S
dl-trampoline.S
dl-tunables.list
Implies
jmpbuf-offsets.h
jmpbuf-unwind.h
ldsodefs.h
libc-tls.c
libm-test-ulps Remove slow paths from pow 2018-02-12 10:47:09 +00:00
libm-test-ulps-name
linkmap.h
machine-gmon.h
Makefile
math-tests.h
mcount.c
memchr.S
memcmp.S aarch64: Optimized memcmp for medium to large sizes 2018-03-06 19:22:40 +05:30
memcpy.S
memmove.S
memset-reg.h
memset.S
memusage.h
preconfigure
rawmemchr.S
setjmp.S
sotruss-lib.c
stackinfo.h
start.S
stpcpy.S
strchr.S
strchrnul.S
strcmp.S aarch64/strcmp: fix misaligned loop jump target 2018-02-22 23:48:14 +05:30
strcpy.S
string_private.h
strlen.S
strncmp.S
strnlen.S
strrchr.S
sysdep.h
tls-macros.h
tlsdesc.c
tlsdesc.sym
tst-audit.h
Versions