Go to file
H.J. Lu dc485ceb2a x86-64: Optimize strlen/strnlen/wcslen/wcsnlen with AVX2
Optimize strlen/strnlen/wcslen/wcsnlen with AVX2 to check 32 bytes with
a single vector compare instruction.  It is as fast as SSE2 versions for
size <= 16 bytes and up to 1X faster for or size > 16 bytes on Haswell.
Select AVX2 version on AVX2 machines where vzeroupper is preferred and
AVX unaligned load is fast.

NB: It uses TZCNT instead of BSF since TZCNT produces the same result
as BSF for non-zero input.  TZCNT is faster than BSF and is executed
as BSF if machine doesn't support TZCNT.

	* sysdeps/x86_64/multiarch/Makefile (sysdep_routines): Add
	strlen-sse2, strnlen-sse2, strlen-avx2, strnlen-avx2,
	wcslen-sse2, wcslen-avx2 and wcsnlen-avx2.
	* sysdeps/x86_64/multiarch/ifunc-impl-list.c
	(__libc_ifunc_impl_list): Add tests for __strlen_avx2,
	__strlen_sse2, __strnlen_avx2, __strnlen_sse2, __wcslen_avx2,
	__wcslen_sse2 and __wcsnlen_avx2.
	* sysdeps/x86_64/multiarch/strlen-avx2.S: New file.
	* sysdeps/x86_64/multiarch/strlen-sse2.S: Likewise.
	* sysdeps/x86_64/multiarch/strlen.c: Likewise.
	* sysdeps/x86_64/multiarch/strnlen-avx2.S: Likewise.
	* sysdeps/x86_64/multiarch/strnlen-sse2.S: Likewise.
	* sysdeps/x86_64/multiarch/strnlen.c: Likewise.
	* sysdeps/x86_64/multiarch/wcslen-avx2.S: Likewise.
	* sysdeps/x86_64/multiarch/wcslen-sse2.S: Likewise.
	* sysdeps/x86_64/multiarch/wcslen.c: Likewise.
	* sysdeps/x86_64/multiarch/wcsnlen-avx2.S: Likewise.
	* sysdeps/x86_64/multiarch/wcsnlen.c (OPTIMIZE (avx2)): New.
	(IFUNC_SELECTOR): Return OPTIMIZE (avx2) on AVX2 machines where
	vzeroupper is preferred and AVX unaligned load is fast.
2017-06-09 05:18:18 -07:00
argp Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
assert Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
benchtests benchtests: Add more tests for memrchr 2017-06-04 09:45:09 -07:00
bits Define SIG_HOLD for XPG4 (bug 21538). 2017-06-05 10:19:03 +00:00
catgets Update copyright dates not handled by scripts/update-copyrights. 2017-01-01 00:26:24 +00:00
conform conformtest: Correct sys/wait.h expectations for XPG4. 2017-06-08 22:34:58 +00:00
crypt Add missing header files throughout the testsuite. 2017-02-16 17:33:18 -05:00
csu Delay initialization of CPU features struct in static binaries 2017-05-31 06:38:33 +05:30
ctype Remove C++ namespace handling from glibc headers. 2017-03-16 13:31:57 +00:00
debug Fix struct sigaltstack namespace (bug 21517). 2017-06-05 10:17:46 +00:00
dirent support: Prevent multiple deletion of temporary files 2017-05-08 16:20:40 +02:00
dlfcn Miscellaneous low-risk changes preparing for _ISOMAC testsuite. 2017-03-01 20:32:50 -05:00
elf ld.so: Consolidate 2 strtouls into _dl_strtoul [BZ #21528] 2017-06-08 12:52:42 -07:00
gmon Assume that O_NOFOLLOW is always defined 2017-04-13 21:28:18 +02:00
gnulib Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
grp Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
gshadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
hesiod Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
hurd Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
iconv Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
iconvdata Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
include Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
inet Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
intl Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
io Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
libidn Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
libio Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
locale Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
localedata Bug 20686: Add el_GR@euro support. 2017-05-03 15:37:04 -04:00
login Remove check for NULL buffer passed to `ptsname_r' 2017-06-07 17:37:59 +02:00
mach Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
malloc malloc: Remove tst-dynarray, tst-dynarray-fail from test-srcs 2017-06-09 14:08:57 +02:00
manual tunables: Add LD_HWCAP_MASK to tunables 2017-06-07 11:11:37 +05:30
math float128: Add wrappers to override ldbl-128 as float128. 2017-05-25 09:01:37 -03:00
mathvec Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
misc Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
nis Include shlib-compat.h in many sunrpc/nis source files. 2017-06-04 11:31:28 -04:00
nptl Optimize generic spinlock code and use C11 like atomic macros. 2017-06-06 09:41:56 +02:00
nptl_db Narrowing the visibility of libc-internal.h even further. 2017-03-01 20:33:46 -05:00
nscd Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
nss Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
po Add target to incorporate translations from translations.org 2017-01-20 12:32:46 +05:30
posix Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
pwd Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
resolv Include shlib-compat.h in many sunrpc/nis source files. 2017-06-04 11:31:28 -04:00
resource Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
rt Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
scripts tunables: Add LD_HWCAP_MASK to tunables 2017-06-07 11:11:37 +05:30
setjmp Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
shadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
signal Fix struct sigaltstack namespace (bug 21517). 2017-06-05 10:17:46 +00:00
socket Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
soft-fp Narrowing the visibility of libc-internal.h even further. 2017-03-01 20:33:46 -05:00
stdio-common Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
stdlib float128: Add strfromf128 2017-06-07 17:08:21 -03:00
streams Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
string Add more tests for memchr 2017-06-08 09:56:01 -07:00
sunrpc Include shlib-compat.h in many sunrpc/nis source files. 2017-06-04 11:31:28 -04:00
support support: Expose TEST_VERIFY_EXIT behavior to GCC optimizers 2017-06-09 14:08:13 +02:00
sysdeps x86-64: Optimize strlen/strnlen/wcslen/wcsnlen with AVX2 2017-06-09 05:18:18 -07:00
sysvipc Fix test-sysvsem on some platforms 2017-01-02 18:53:50 -02:00
termios Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
time Remove C++ namespace handling from glibc headers. 2017-03-16 13:31:57 +00:00
timezone Fix tst-timezone race (bug 14096). 2017-06-07 17:14:28 +00:00
wcsmbs Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
wctype Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
.gitattributes
.gitignore
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4
BUGS
ChangeLog x86-64: Optimize strlen/strnlen/wcslen/wcsnlen with AVX2 2017-06-09 05:18:18 -07:00
ChangeLog.1
ChangeLog.2
ChangeLog.3
ChangeLog.4
ChangeLog.5
ChangeLog.6
ChangeLog.7
ChangeLog.8
ChangeLog.9
ChangeLog.10
ChangeLog.11
ChangeLog.12
ChangeLog.13
ChangeLog.14
ChangeLog.15
ChangeLog.16
ChangeLog.17
ChangeLog.old-ports
ChangeLog.old-ports-aarch64
ChangeLog.old-ports-aix
ChangeLog.old-ports-alpha ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-am33
ChangeLog.old-ports-arm
ChangeLog.old-ports-cris
ChangeLog.old-ports-hppa ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-ia64
ChangeLog.old-ports-linux-generic
ChangeLog.old-ports-m68k
ChangeLog.old-ports-microblaze
ChangeLog.old-ports-mips ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-powerpc
ChangeLog.old-ports-tile
config.h.in Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
config.make.in Deprecate libnsl by default (only shared library will be 2017-03-21 15:14:27 +01:00
configure Deprecate libnsl by default (only shared library will be 2017-03-21 15:14:27 +01:00
configure.ac Deprecate libnsl by default (only shared library will be 2017-03-21 15:14:27 +01:00
CONFORMANCE
COPYING
COPYING.LIB
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk
INSTALL Assume that accept4 is always available and works 2017-04-19 07:44:48 +02:00
libc-abis
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig Support dl-tunables.list in subdirectories 2017-05-25 05:41:18 -07:00
Makefile Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
Makefile.in
Makerules Also create and use ldbl-compat-choose.h. 2017-05-19 11:30:26 +00:00
NAMESPACE
NEWS Optimize generic spinlock code and use C11 like atomic macros. 2017-06-06 09:41:56 +02:00
o-iterator.mk
README Require Linux kernel 3.2 or later on x86 / x86_64. 2017-05-08 10:45:20 +00:00
README.pretty-printers Fix mutex pretty printer test and pretty printer output. 2017-01-20 14:56:39 +01:00
README.tunables tunables: Clean up hooks to get and set tunables 2017-06-07 11:11:36 +05:30
Rules Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
shlib-versions
test-skeleton.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
version.h Open master for development 2017-02-05 21:27:52 +05:30
WUR-REPORT

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.  The current
GNU/Hurd support requires out-of-tree patches that will eventually be
incorporated into an official GNU C Library release.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arm-*-linux-gnueabi
	hppa-*-linux-gnu	Not currently functional without patches.
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu
	tilegx-*-linux-gnu
	tilepro-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see http://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at http://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see http://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.