glibc/sysdeps
H.J. Lu cf2c57526b x86: Set rep_movsb_threshold to 2112 on processors with FSRM
The glibc memcpy benchmark on Intel Core i7-1065G7 (Ice Lake) showed
that REP MOVSB became faster after 2112 bytes:

                                      Vector Move       REP MOVSB
length=2112, align1=0, align2=0:        24.20             24.40
length=2112, align1=1, align2=0:        26.07             23.13
length=2112, align1=0, align2=1:        27.18             28.13
length=2112, align1=1, align2=1:        26.23             25.16
length=2176, align1=0, align2=0:        23.18             22.52
length=2176, align1=2, align2=0:        25.45             22.52
length=2176, align1=0, align2=2:        27.14             27.82
length=2176, align1=2, align2=2:        22.73             25.56
length=2240, align1=0, align2=0:        24.62             24.25
length=2240, align1=3, align2=0:        29.77             27.15
length=2240, align1=0, align2=3:        35.55             29.93
length=2240, align1=3, align2=3:        34.49             25.15
length=2304, align1=0, align2=0:        34.75             26.64
length=2304, align1=4, align2=0:        32.09             22.63
length=2304, align1=0, align2=4:        28.43             31.24

Use REP MOVSB for data size > 2112 bytes in memcpy on processors with
fast short REP MOVSB (FSRM).

	* sysdeps/x86/dl-cacheinfo.h (dl_init_cacheinfo): Set
	rep_movsb_threshold to 2112 on processors with fast short REP
	MOVSB (FSRM).
2021-05-03 05:08:22 -07:00
..
aarch64
alpha
arc
arm
csky
generic
gnu
hppa
htl
hurd
i386
ia64
ieee754
m68k
mach
microblaze
mips
nios2
nptl nptl: Move pthread_yield into libc, as a compatibility symbol 2021-05-03 09:23:44 +02:00
posix
powerpc
pthread nptl: Move tss_set into libc 2021-05-03 08:18:08 +02:00
riscv
s390
sh
sparc
unix nptl: Move pthread_yield into libc, as a compatibility symbol 2021-05-03 09:23:44 +02:00
wordsize-32
wordsize-64
x86 x86: Set rep_movsb_threshold to 2112 on processors with FSRM 2021-05-03 05:08:22 -07:00
x86_64