glibc/manual
Adhemerval Zanella 6a785f1dcf elf: Add support to memory sealing
The new Linux mseal syscall allows seal memory mappings to avoid
further changes such as memory protection or remap.  The sealing
is done in multiple places where the memory is supposed to
be immutable over program execution:

  * All shared library dependencies from the binary, including the
    read-only segments after PT_GNU_RELRO setup.

  * The binary itself, including dynamic and static links.  In both
    It is up either to binary or the loader to set up the sealing.

  * Any preload libraries.

  * Any library loaded with dlopen with RTLD_NODELETE flag (including
    libgcc.so loaded to enable unwind and/or thread cancellation).

  * Audit modules.

  * The loader bump allocator.

For binary dependencies, the RTLD_NODELETE signals the
link_map should be sealed.  It also makes dlopen objects with the
flag sealed as well.

The sealing is controlled by a new tunable, glibc.rtld.seal, with
three different states:

  0. Disabled, where no memory sealing is done.

  1. Enabled, where the loader will issue the mseal syscall on the
     memory mappings but any failure will be ignored.  This is
     the default.

  2. Enforce, similar to Enabled but any failure from the mseal
     will terminate the process.

Checked on x86_64-linux-gnu and aarch64-linux-gnu.
2024-07-31 17:02:05 -03:00
..
examples Minor code improvement to timespec_subtract example 2024-06-15 08:53:50 -07:00
argp.texi stdlib: Reinstate stable mergesort implementation on qsort 2024-01-15 15:58:35 -03:00
arith.texi manual: significand() uses FLT_RADIX, not 2 2024-04-03 09:16:22 -03:00
charset.texi
check-safety.sh
conf.texi Modernize and fix doc’s “Date and Time” (BZ 31876) 2024-06-15 08:53:50 -07:00
contrib.texi contrib.texi: Fix format of MIPS and RISC-V 2024-07-21 14:12:54 +02:00
creature.texi Always define __USE_TIME_BITS64 when 64 bit time_t is used 2024-04-02 15:28:36 -03:00
crypt.texi
ctype.texi
debug.texi
dir
dynlink.texi manual: Do not mention STATIC_TLS in dynamic linker hardening recommendations 2024-07-24 12:50:17 +02:00
errno.texi
fdl-1.3.texi
filesys.texi
freemanuals.texi
getopt.texi
header.texi
install-plain.texi
install.texi install.texi: bump "latest verified" versions 2024-07-21 00:27:35 +02:00
intro.texi manual: add syscalls 2024-07-09 11:54:29 +02:00
io.texi manual/io: Fix swapped reading and writing phrase. 2024-01-30 20:10:38 +01:00
ipc.texi
job.texi
lang.texi manual: clarify defintions of floating point exponent bounds (bug 31518) 2024-05-21 10:25:46 +02:00
lgpl-2.1.texi
libc-texinfo.sh
libc.texinfo
libcbook.texi
llio.texi manual: add syscalls 2024-07-09 11:54:29 +02:00
locale.texi stdlib: Reinstate stable mergesort implementation on qsort 2024-01-15 15:58:35 -03:00
macros.texi manual: add syscalls 2024-07-09 11:54:29 +02:00
maint.texi Always define __USE_TIME_BITS64 when 64 bit time_t is used 2024-04-02 15:28:36 -03:00
Makefile manual: add syscalls 2024-07-09 11:54:29 +02:00
math.texi Implement C23 exp2m1, exp10m1 2024-06-17 16:31:49 +00:00
memory.texi linux: Add mseal syscall support 2024-07-31 17:02:04 -03:00
message.texi
nss.texi
nsswitch.texi
pattern.texi
pipe.texi
platform.texi
probes.texi
process.texi
README.pretty-printers
README.tunables elf: Remove LD_HWCAP_MASK / tunable glibc.cpu.hwcap_mask 2024-06-18 10:45:36 +02:00
resource.texi manual: make setrlimit() description less ambiguous 2024-07-29 16:03:45 +02:00
search.texi Fix bsearch, qsort doc to match POSIX better 2024-04-06 10:10:32 -07:00
setjmp.texi
signal.texi
socket.texi manual: add syscalls 2024-07-09 11:54:29 +02:00
startup.texi manual: add syscalls 2024-07-09 11:54:29 +02:00
stdbit.texi manual: Fix up stdbit.texi 2024-02-01 16:36:55 +01:00
stdio-fp.c
stdio.texi manual/stdio: Clarify putc and putwc 2024-07-29 14:55:10 +02:00
string.texi Fix strnlen doc re array size 2024-06-26 16:16:05 +01:00
summary.pl
sysinfo.texi
syslog.texi
terminal.texi
texinfo.tex
texis.awk
threads.texi Linux: Make __rseq_size useful for feature detection (bug 31965) 2024-07-09 19:33:37 +02:00
time.texi Modernize and fix doc’s “Date and Time” (BZ 31876) 2024-06-15 08:53:50 -07:00
tsort.awk
tunables.texi elf: Add support to memory sealing 2024-07-31 17:02:05 -03:00
users.texi
xtract-typefun.awk

			TUNABLE FRAMEWORK
			=================

Tunables is a feature in the GNU C Library that allows application authors and
distribution maintainers to alter the runtime library behaviour to match their
workload.

The tunable framework allows modules within glibc to register variables that
may be tweaked through an environment variable.  It aims to enforce a strict
namespace rule to bring consistency to naming of these tunable environment
variables across the project.  This document is a guide for glibc developers to
add tunables to the framework.

ADDING A NEW TUNABLE
--------------------

The TOP_NAMESPACE macro is defined by default as 'glibc'.  If distributions
intend to add their own tunables, they should do so in a different top
namespace by overriding the TOP_NAMESPACE macro for that tunable.  Downstream
implementations are discouraged from using the 'glibc' top namespace for
tunables they don't already have consensus to push upstream.

There are three steps to adding a tunable:

1. Add a tunable to the list and fully specify its properties:

For each tunable you want to add, make an entry in elf/dl-tunables.list.  The
format of the file is as follows:

TOP_NAMESPACE {
  NAMESPACE1 {
    TUNABLE1 {
      # tunable attributes, one per line
    }
    # A tunable with default attributes, i.e. string variable.
    TUNABLE2
    TUNABLE3 {
      # its attributes
    }
  }
  NAMESPACE2 {
    ...
  }
}

The list of allowed attributes are:

- type:			Data type.  Defaults to STRING.  Allowed types are:
			INT_32, UINT_64, SIZE_T and STRING.  Numeric types may
			be in octal or hexadecimal format too.

- minval:		Optional minimum acceptable value.  For a string type
			this is the minimum length of the value.

- maxval:		Optional maximum acceptable value.  For a string type
			this is the maximum length of the value.

- default:		Specify an optional default value for the tunable.

- env_alias:		An alias environment variable

2. Use TUNABLE_GET/TUNABLE_SET/TUNABLE_SET_WITH_BOUNDS to get and set tunables.

3. OPTIONAL: If tunables in a namespace are being used multiple times within a
   specific module, set the TUNABLE_NAMESPACE macro to reduce the amount of
   typing.

GETTING AND SETTING TUNABLES
----------------------------

When the TUNABLE_NAMESPACE macro is defined, one may get tunables in that
module using the TUNABLE_GET macro as follows:

  val = TUNABLE_GET (check, int32_t, TUNABLE_CALLBACK (check_callback))

where 'check' is the tunable name, 'int32_t' is the C type of the tunable and
'check_callback' is the function to call if the tunable got initialized to a
non-default value.  The macro returns the value as type 'int32_t'.

The callback function should be defined as follows:

  void
  TUNABLE_CALLBACK (check_callback) (int32_t *valp)
  {
  ...
  }

where it can expect the tunable value to be passed in VALP.

Tunables in the module can be updated using:

  TUNABLE_SET (check, val)

where 'check' is the tunable name and 'val' is a value of same type.

To get and set tunables in a different namespace from that module, use the full
form of the macros as follows:

  val = TUNABLE_GET_FULL (glibc, malloc, mmap_max, int32_t, NULL)

  TUNABLE_SET_FULL (glibc, malloc, mmap_max, val)

where 'glibc' is the top namespace, 'malloc' is the tunable namespace and the
remaining arguments are the same as the short form macros.

The minimum and maximum values can updated together with the tunable value
using:

  TUNABLE_SET_WITH_BOUNDS (check, val, min, max)

where 'check' is the tunable name, 'val' is a value of same type, 'min' and
'max' are the minimum and maximum values of the tunable.

To set the minimum and maximum values of tunables in a different namespace
from that module, use the full form of the macros as follows:

  val = TUNABLE_GET_FULL (glibc, malloc, mmap_max, int32_t, NULL)

  TUNABLE_SET_WITH_BOUNDS_FULL (glibc, malloc, mmap_max, val, min, max)

where 'glibc' is the top namespace, 'malloc' is the tunable namespace and the
remaining arguments are the same as the short form macros.

When TUNABLE_NAMESPACE is not defined in a module, TUNABLE_GET is equivalent to
TUNABLE_GET_FULL, so you will need to provide full namespace information for
both macros.  Likewise for TUNABLE_SET, TUNABLE_SET_FULL,
TUNABLE_SET_WITH_BOUNDS and TUNABLE_SET_WITH_BOUNDS_FULL.

** IMPORTANT NOTE **

The tunable list is set as read-only after the dynamic linker relocates itself,
so setting tunable values must be limited only to tunables within the dynamic
linker, that too before relocation.

FUTURE WORK
-----------

The framework currently only allows a one-time initialization of variables
through environment variables and in some cases, modification of variables via
an API call.  A future goals for this project include:

- Setting system-wide and user-wide defaults for tunables through some
  mechanism like a configuration file.

- Allow tweaking of some tunables at runtime