binutils-gdb/ld
Nick Alcock 211bcd0133 bfd, ld, libctf: skip zero-refcount strings in CTF string reporting
This is a tricky one.  BFD, on the linker's behalf, reports symbols to
libctf via the ctf_new_symbol and ctf_new_dynsym callbacks, which
ultimately call ctf_link_add_linker_symbol.  But while this happens
after strtab offsets are finalized, it happens before the .dynstr is
actually laid out, so we can't iterate over it at this stage and
it is not clear what the reported symbols are actually called.  So
a second callback, examine_strtab, is called after the .dynstr is
finalized, which calls ctf_link_add_strtab and ultimately leads
to ldelf_ctf_strtab_iter_cb being called back repeatedly until the
offsets of every string in the .dynstr is passed to libctf.

libctf can then use this to get symbol names out of the input (which
usually stores symbol types in the form of a name -> type mapping at
this stage) and extract the types of those symbols, feeding them back
into their final form as a 1:1 association with the real symtab's
STT_OBJ and STT_FUNC symbols (with a few skipped, see
ctf_symtab_skippable).

This representation is compact, but has one problem: if libctf somehow
gets confused about the st_type of a symbol, it'll stick an entry into
the function symtypetab when it should put it into the object
symtypetab, or vice versa, and *every symbol from that one on* will have
the wrong CTF type because it's actually looking up the type for a
different symbol.

And we have just such a bug.  ctf_link_add_strtab was not taking the
refcounts of strings into consideration, so even strings that had been
eliminated from the strtab by virtue of being in objects eliminated via
--as-needed etc were being reported.  This is harmful because it can
lead to multiple strings with the same apparent offset, and if the last
duplicate to be reported relates to an eliminated symbol, we look up the
wrong symbol from the input and gets its type wrong: if it's unlucky and
the eliminated symbol is also of the wrong st_type, we will end up with
a corrupted symtypetab.

Thankfully the wrong-st_type case is already diagnosed by a
this-can-never-happen paranoid warning:

  CTF warning: Symbol 61a added to CTF as a function but is of type 1

or the converse

 * CTF warning: Symbol a3 added to CTF as a data object but is of type 2

so at least we can tell when the corruption has spread to more than one
symbol's type.

Skipping zero-refcounted strings is easy: teach _bfd_elf_strtab_str to
skip them, and ldelf_ctf_strtab_iter_cb to loop over skipped strings
until it falls off the end or finds one that isn't skipped.

bfd/ChangeLog
2021-03-02  Nick Alcock  <nick.alcock@oracle.com>

	* elf-strtab.c (_bfd_elf_strtab_str): Skip strings with zero refcount.

ld/ChangeLog
2021-03-02  Nick Alcock  <nick.alcock@oracle.com>

	* ldelfgen.c (ldelf_ctf_strtab_iter_cb): Skip zero-refcount strings.

libctf/ChangeLog
2021-03-02  Nick Alcock  <nick.alcock@oracle.com>

	* ctf-create.c (symtypetab_density): Report the symbol name as
	well as index in the name != object error; note the likely
	consequences.
	* ctf-link.c (ctf_link_shuffle_syms): Report the symbol index
	as well as name.
2021-03-02 15:10:10 +00:00
..
emulparams Remove arm-symbianelf 2021-02-09 23:36:16 +10:30
emultempl PR27451, -z start_stop_gc 2021-03-01 17:28:03 +10:30
po Remove arm-symbianelf 2021-02-09 23:36:16 +10:30
scripttempl Add DWARF-5 section names to PE and PEP linker scripts. 2021-03-01 16:25:06 +00:00
testsuite PR27451, -z start_stop_gc for powerpc64 2021-03-02 21:49:56 +10:30
.gitignore
aclocal.m4 Implement a workaround for GNU mak jobserver 2021-01-12 05:45:44 -08:00
ChangeLog bfd, ld, libctf: skip zero-refcount strings in CTF string reporting 2021-03-02 15:10:10 +00:00
ChangeLog-0001
ChangeLog-0203
ChangeLog-2004
ChangeLog-2005
ChangeLog-2006
ChangeLog-2007
ChangeLog-2008
ChangeLog-2009
ChangeLog-2010
ChangeLog-2011
ChangeLog-2012
ChangeLog-2013
ChangeLog-2014
ChangeLog-2015
ChangeLog-2016
ChangeLog-2017
ChangeLog-2018
ChangeLog-2019 ChangeLog rotation 2020-01-01 18:12:08 +10:30
ChangeLog-2020 ChangeLog rotation 2021-01-01 10:31:02 +10:30
ChangeLog-9197
ChangeLog-9899 PR27116, Spelling errors found by Debian style checker 2021-01-01 14:36:35 +10:30
config.in Add a new option to the linker: --error-handling-script=<NAME>. Run the script <NAME> if an undefined symbol or unfound library error is encountered. 2020-10-16 11:37:26 +01:00
configure Implement a workaround for GNU mak jobserver 2021-01-12 05:45:44 -08:00
configure.ac Implement a workaround for GNU mak jobserver 2021-01-12 05:45:44 -08:00
configure.host Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
configure.tgt Remove arm-symbianelf 2021-02-09 23:36:16 +10:30
deffile.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
deffilep.y Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
dep-in.sed
elf-hints-local.h
fdl.texi
gen-doc.texi Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
genscrba.sh
genscripts.sh Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
h8-doc.texi Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ld.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ld.texi PR27451, -z start_stop_gc 2021-03-01 17:28:03 +10:30
ldbuildid.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldbuildid.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldcref.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldctor.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldctor.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldelf.c PR27259, SHF_LINK_ORDER self-link 2021-01-28 18:53:30 +10:30
ldelf.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldelfgen.c bfd, ld, libctf: skip zero-refcount strings in CTF string reporting 2021-03-02 15:10:10 +00:00
ldelfgen.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldemul.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldemul.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldexp.c Warn when a script redefines a symbol 2021-02-21 14:28:16 +10:30
ldexp.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldfile.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldfile.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldgram.y ld --defsym 2021-02-02 01:27:12 +10:30
ldint.texi Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldlang.c Weak references to __start_/__stop_ symbols 2021-03-01 14:26:39 +10:30
ldlang.h SHF_LINK_ORDER fixup_link_order in ld 2021-01-13 22:06:02 +10:30
ldlex-wrapper.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldlex.h ld --defsym 2021-02-02 01:27:12 +10:30
ldlex.l ld --defsym 2021-02-02 01:27:12 +10:30
ldmain.c PR27451, -z start_stop_gc 2021-03-01 17:28:03 +10:30
ldmain.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldmisc.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldmisc.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldver.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldver.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldwrite.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
ldwrite.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
lexsup.c PR27451, -z start_stop_gc 2021-03-01 17:28:03 +10:30
libdep_plugin.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
MAINTAINERS Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
Makefile.am Remove arm-symbianelf 2021-02-09 23:36:16 +10:30
Makefile.in Remove arm-symbianelf 2021-02-09 23:36:16 +10:30
mri.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
mri.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
NEWS PR27451, -z start_stop_gc 2021-03-01 17:28:03 +10:30
pe-dll.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
pe-dll.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
pep-dll.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
pep-dll.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
plugin.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
plugin.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
README Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
stamp-h.in
sysdep.h Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
testplug2.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
testplug3.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
testplug4.c Update year range in copyright notice of binutils files 2021-01-01 10:31:05 +10:30
testplug.c ld: remove stray debug fprintf 2021-02-18 10:36:39 +00:00
TODO

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

		README for LD

This is the GNU linker.  It is distributed with other "binary
utilities" which should be in ../binutils.  See ../binutils/README for
more general notes, including where to send bug reports.

There are many features of the linker:

* The linker uses a Binary File Descriptor library (../bfd)
  that it uses to read and write object files.  This helps
  insulate the linker itself from the format of object files.

* The linker supports a number of different object file
  formats.  It can even handle multiple formats at once:
  Read two input formats and write a third.

* The linker can be configured for cross-linking.

* The linker supports a control language.

* There is a user manual (ld.texi), as well as the
  beginnings of an internals manual (ldint.texi).

Installation
============

See ../binutils/README.

If you want to make a cross-linker, you may want to specify
a different search path of -lfoo libraries than the default.
You can do this by setting the LIB_PATH variable in ./Makefile
or using the --with-lib-path configure switch.

To build just the linker, make the target all-ld from the top level
directory (one directory above this one).

Porting to a new target
=======================

See the ldint.texi manual.

Reporting bugs etc
===========================

See ../binutils/README.

Known problems
==============

The Solaris linker normally exports all dynamic symbols from an
executable.  The GNU linker does not do this by default.  This is
because the GNU linker tries to present the same interface for all
similar targets (in this case, all native ELF targets).  This does not
matter for normal programs, but it can make a difference for programs
which try to dlopen an executable, such as PERL or Tcl.  You can make
the GNU linker export all dynamic symbols with the -E or
--export-dynamic command line option.

HP/UX 9.01 has a shell bug that causes the linker scripts to be
generated incorrectly.  The symptom of this appears to be "fatal error
- scanner input buffer overflow" error messages.  There are various
workarounds to this:
  * Build and install bash, and build with "make SHELL=bash".
  * Update to a version of HP/UX with a working shell (e.g., 9.05).
  * Replace "(. ${srcdir}/scripttempl/${SCRIPT_NAME}.sc)" in
    genscripts.sh with "sh ${srcdir}..." (no parens) and make sure the
    emulparams script used exports any shell variables it sets.

Copyright (C) 2012-2021 Free Software Foundation, Inc.

Copying and distribution of this file, with or without modification,
are permitted in any medium without royalty provided the copyright
notice and this notice are preserved.