glibc/sysdeps/powerpc
Adhemerval Zanella 71ae86478e PowerPC: memset optimization for POWER8/PPC64
This patch adds an optimized memset implementation for POWER8.  For
sizes from 0 to 255 bytes, a word/doubleword algorithm similar to
POWER7 optimized one is used.

For size higher than 255 two strategies are used:

1. If the constant is different than 0, the memory is written with
   altivec vector instruction;

2. If constant is 0, dbcz instructions are used.  The loop is unrolled
   to clear 512 byte at time.

Using vector instructions increases throughput considerable, with a
double performance for sizes larger than 1024.  The dcbz loops unrolls
also shows performance improvement, by doubling throughput for sizes
larger than 8192 bytes.
2014-09-10 07:39:46 -04:00
..
bits PowerPC: Guard CALL_ELF check for ppc64 only in link.h 2014-07-07 14:46:22 -05:00
fpu Update powerpc-fpu ULPs. 2014-06-30 17:38:43 -04:00
nofpu Fix powerpc-nofpu __fe_enabled_env and __fe_nonieee_env (bug 17261). 2014-08-12 20:31:54 +00:00
nptl
power4
power5+/fpu
power6
power7/fpu
powerpc32 Define __GI_fegetenv for e500 libm 2014-09-02 21:39:04 +00:00
powerpc64 PowerPC: memset optimization for POWER8/PPC64 2014-09-10 07:39:46 -04:00
soft-fp
sys/platform
abort-instr.h
configure
configure.ac
dl-procinfo.c
dl-procinfo.h
dl-tls.h
ffs.c
fpu_control.h
gccframe.h
ifunc-sel.h
Implies
jmpbuf-offsets.h
jmpbuf-unwind.h
ldsodefs.h
locale-defines.sym
longjmp.c
machine-gmon.h
Makefile
math-tests.h
memusage.h
mp_clz_tab.c
novmx-longjmp.c
novmx-sigjmp.c
novmxsetjmp.h
preconfigure
rtld-global-offsets.sym
sched_cpucount.c
sigjmp.c
sotruss-lib.c
stackinfo.h
strcat.c PowerPC: strcat optimization for PPC64/POWER7 2014-07-02 14:04:21 -05:00
sysdep.h
test-arith.c
test-arithf.c
test-gettimebase.c Fix missing newline in test output 2014-07-09 11:07:24 +02:00
tls-macros.h
tst-stack-align.h
Versions