Roger Sayle 0b2c1369d0 PR target/107548: Handle vec_select in STV on x86.
This patch enhances x86's STV pass to handle VEC_SELECT during general
scalar chain conversion, performing SImode scalar extraction from V4SI
and DImode scalar extraction from V2DI in vector registers.

The motivating test case from bugzilla is:

typedef unsigned int v4si __attribute__((vector_size(16)));

unsigned int f (v4si a, v4si b)
{
  a[0] += b[0];
  return a[0] + a[1];
}

currently with -O2 -march=znver2 this generates:

	vpextrd	$1, %xmm0, %edx
	vmovd	%xmm0, %eax
	addl	%edx, %eax
	vmovd	%xmm1, %edx
	addl	%edx, %eax
	ret

which performs three transfers from the vector unit to the scalar unit,
and performs the two additions there.  With this patch, we now generate:

	vmovdqa	%xmm0, %xmm2
	vpshufd	$85, %xmm0, %xmm0
	vpaddd	%xmm0, %xmm2, %xmm0
	vpaddd	%xmm1, %xmm0, %xmm0
	vmovd	%xmm0, %eax
	ret

which performs the two additions in the vector unit, and then transfers
the result to the scalar unit.  Technically the (cheap) movdqa isn't
needed with better register allocation (or this could be cleaned up
during peephole2), but even so this transform is still a win.

2022-12-23  Roger Sayle  <roger@nextmovesoftware.com>

gcc/ChangeLog
	PR target/107548
	* config/i386/i386-features.cc (scalar_chain::add_insn): The
	operands of a VEC_SELECT don't need to added to the scalar chain.
	(general_scalar_chain::compute_convert_gain) <case VEC_SELECT>:
	Provide gains for performing STV on a VEC_SELECT.
	(general_scalar_chain::convert_insn): Convert VEC_SELECT to pshufd,
	psrldq or no-op.
	(general_scalar_to_vector_candidate_p): Handle VEC_SELECT of a
	single element from a vector register to a scalar register.

gcc/testsuite/ChangeLog
	PR target/107548
	* gcc.target/i386/pr107548-1.c: New test V4SI case.
	* gcc.target/i386/pr107548-2.c: New test V2DI case.
2022-12-23 09:58:13 +00:00
2022-11-24 00:17:47 +00:00
2022-11-15 08:32:29 +00:00
2022-12-22 19:44:07 -05:00
2022-11-24 00:17:47 +00:00
2022-09-01 00:17:39 +00:00
2022-08-31 00:16:45 +00:00
2022-12-22 00:17:29 +00:00
2022-11-24 00:17:47 +00:00
2022-08-26 00:16:21 +00:00
2022-11-17 00:16:52 +00:00
2022-12-18 00:16:57 +00:00
2022-11-02 00:17:38 +00:00
2022-11-24 00:17:47 +00:00
2022-12-08 00:17:45 +00:00
2022-11-24 00:17:47 +00:00
2022-12-22 00:17:29 +00:00
2022-12-17 00:17:56 +00:00
2022-12-16 00:17:46 +00:00
2022-12-22 00:17:29 +00:00
2022-11-24 00:17:47 +00:00
2022-12-12 00:22:21 +00:00
2022-12-20 00:17:00 +00:00
2022-10-13 00:17:37 +00:00
2022-11-24 00:17:47 +00:00
2022-12-01 00:17:51 +00:00
2022-11-24 00:17:47 +00:00
2022-07-19 17:07:04 +03:00
2022-12-22 00:17:29 +00:00
2022-12-09 11:08:55 +01:00

This directory contains the GNU Compiler Collection (GCC).

The GNU Compiler Collection is free software.  See the files whose
names start with COPYING for copying permission.  The manuals, and
some of the runtime libraries, are under different terms; see the
individual source files for details.

The directory INSTALL contains copies of the installation information
as HTML and plain text.  The source of this information is
gcc/doc/install.texi.  The installation information includes details
of what is included in the GCC sources and what files GCC installs.

See the file gcc/doc/gcc.texi (together with other files that it
includes) for usage and porting information.  An online readable
version of the manual is in the files gcc/doc/gcc.info*.

See http://gcc.gnu.org/bugs/ for how to report bugs usefully.

Copyright years on GCC source files may be listed using range
notation, e.g., 1987-2012, indicating that every year in the range,
inclusive, is a copyrightable year that could otherwise be listed
individually.
Description
No description provided
Readme 2.1 GiB
Languages
C++ 31.9%
C 31.3%
Ada 12%
D 6.5%
Go 6.4%
Other 11.5%