glibc/localedata/charmaps
Mike FABIAN a7b5eb821d Update to Unicode 16.0.0 [BZ #32168]
Unicode 16.0.0 Support: Character encoding, character type info, and
transliteration tables are all updated to Unicode 16.0.0, using
the generator scripts contributed by Mike FABIAN (Red Hat).

Changes in CHARMAP and WIDTH:

    Total added characters in newly generated CHARMAP: 5185
    Total removed characters in newly generated WIDTH: 1
    Total added characters in newly generated WIDTH: 170

The removed character from WIDTH is U+1171E AHOM CONSONANT SIGN MEDIAL RA.
It changed like this:

UnicodeData.txt 15.1.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mn;0;NSM;;;;;N;;;;;
UnicodeData.txt 16.0.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mc;0;L;;;;;N;;;;;

EastAsianWidth.txt 15.1.0: 1171D..1171F   ; N  # Mn     [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
EastAsianWidth.txt 16.0.0: 1171E          ; N  # Mc         AHOM CONSONANT SIGN MEDIAL RA

I.e it changed from Mn (Mark Nonspacing) to Mc (Mark Spacing
combining). So it should now have width 1 instead of 0, therefore it
is OK that it was removed from WIDTH, characters not in WIDTH get
width 1 by default.

Nothing suspicious when browsing the list of the 170 added characters.

Changes in ctype:

    alpha: Added 4452 characters in new ctype which were not in old ctype
    combining: Added 51 characters in new ctype which were not in old ctype
    combining_level3: Added 43 characters in new ctype which were not in old ctype
    graph: Added 5185 characters in new ctype which were not in old ctype
    lower: Added 25 characters in new ctype which were not in old ctype
    print: Added 5185 characters in new ctype which were not in old ctype
    punct: Missing 33 characters of old ctype in new ctype
    punct: Added 766 characters in new ctype which were not in old ctype
    tolower: Added 27 characters in new ctype which were not in old ctype
    totitle: Added 27 characters in new ctype which were not in old ctype
    toupper: Added 27 characters in new ctype which were not in old ctype
    upper: Added 27 characters in new ctype which were not in old ctype

Nothing suspicous in the additions.

About the 33 characters removed from `punct`:

U+0363 - U+036F are identical in UnicodeData.txt. Difference in DerivedCoreProperties.txt:

DerivedCoreProperties.txt 15.1.0: not there.
DerivedCoreProperties.txt 16.0.0: 0363..036F    ; Alphabetic # Mn  [13] COMBINING LATIN SMALL LETTER A..COMBINING LATIN SMALL LETTER X

So that’s the reason why they are added to `alpha` and removed from `punct`.

Same for U+1DD3 - U+1DE6, they are identical in UnicodeData.txt but there is a difference in DerivedCoreProperties.txt:

DerivedCoreProperties.txt 15.1.0: 1DE7..1DF4    ; Alphabetic # Mn  [14] COMBINING LATIN SMALL LETTER ALPHA..COMBINING LATIN SMALL LETTER U WITH DIAERESIS
DerivedCoreProperties.txt 16.0.0: 1DD3..1DF4    ; Alphabetic # Mn  [34] COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE..COMBINING LATIN SMALL LETTER U WITH DIAERESIS

So they became `Alphabetic` and were thus added to `alpha` and removed from `punct`.

Resolves: BZ #32168

Reviewed-by: Carlos O'Donell <carlos@redhat.com>
2024-09-27 14:43:38 +02:00
..
ANSI_X3.4-1968 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
ANSI_X3.110-1983 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
ARMSCII-8 Update. 2002-04-19 07:49:16 +00:00
ASMO_449
BIG5 Remove trailing whitespace from localedata. 2013-06-07 14:56:03 +00:00
BIG5-HKSCS Update BIG5-HKSCS charmap to HKSCS-2008 2013-06-11 17:02:59 +02:00
BRF * iconvdata/brf.c: New file. 2007-01-20 00:23:01 +00:00
BS_4730 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
BS_VIEWDATA locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP737 Update. 2000-07-31 23:14:10 +00:00
CP770 Add support for CP770, CP771, CP772, CP773, and CP774 2011-05-09 23:15:39 -04:00
CP771 Add support for CP770, CP771, CP772, CP773, and CP774 2011-05-09 23:15:39 -04:00
CP772 Add support for CP770, CP771, CP772, CP773, and CP774 2011-05-09 23:15:39 -04:00
CP773 Add support for CP770, CP771, CP772, CP773, and CP774 2011-05-09 23:15:39 -04:00
CP774 Add support for CP770, CP771, CP772, CP773, and CP774 2011-05-09 23:15:39 -04:00
CP775 Update. 2000-07-31 23:14:10 +00:00
CP949 * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
CP1125 2002-12-03 Roland McGrath <roland@redhat.com> 2002-12-03 20:56:06 +00:00
CP1250 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1251 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1252 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1253 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1254 Bug 21399: Fix CP1254 comment for U+00EC 2017-04-19 08:10:35 -04:00
CP1255 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1256 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CP1257
CP1258
CP10007 localedata: change M$ to Microsoft 2016-08-10 00:49:14 +08:00
CSA_Z243.4-1985-1
CSA_Z243.4-1985-2
CSA_Z243.4-1985-GR
CSN_369103 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
CWI
DEC-MCS
DIN_66003
DS_2089
EBCDIC-AT-DE
EBCDIC-AT-DE-A
EBCDIC-CA-FR
EBCDIC-DK-NO locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
EBCDIC-DK-NO-A
EBCDIC-ES
EBCDIC-ES-A
EBCDIC-ES-S
EBCDIC-FI-SE
EBCDIC-FI-SE-A
EBCDIC-FR
EBCDIC-IS-FRISS
EBCDIC-IT
EBCDIC-PT
EBCDIC-UK
EBCDIC-US
ECMA-CYRILLIC
ES
ES2
EUC-JISX0213 Update. 2004-08-02 22:33:57 +00:00
EUC-JP * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
EUC-JP-MS Update. 2003-08-14 21:48:29 +00:00
EUC-KR [BZ #3954] 2007-02-16 01:25:28 +00:00
EUC-TW Update. 2002-04-20 07:58:48 +00:00
GB2312 Update. 2000-09-14 20:43:43 +00:00
GB18030 add GB18030-2022 charmap and test the entire GB18030 charmap [BZ #30243] 2023-08-29 19:02:30 +02:00
GB_1988-80 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
GBK localedata: GBK: add mapping for 0x80->Euro sign [BZ #20864] 2016-11-26 17:20:22 -05:00
GEORGIAN-ACADEMY Update. 2001-08-04 08:05:56 +00:00
GEORGIAN-PS Update. 2001-08-04 08:05:56 +00:00
GOST_19768-74
GREEK7 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
GREEK7-OLD
GREEK-CCITT locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
HP-GREEK8 [BZ #5465] 2007-12-12 08:06:15 +00:00
HP-ROMAN8
HP-ROMAN9 [BZ #5427] 2007-12-08 01:38:23 +00:00
HP-THAI8 [BZ #5465] 2007-12-12 08:06:15 +00:00
HP-TURKISH8 [BZ #5465] 2007-12-12 08:06:15 +00:00
IBM037
IBM038
IBM256 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM273 localedata: Make IBM273 compatible with ISO-8859-1 [BZ #23290] 2018-06-14 22:34:10 +02:00
IBM274
IBM275
IBM277 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM278 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM280 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM281
IBM284 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM285
IBM290
IBM297 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM420
IBM423
IBM424 localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882] 2021-05-18 07:21:45 +02:00
IBM437
IBM500
IBM850
IBM851
IBM852
IBM855
IBM856 Update. 2002-04-19 07:49:16 +00:00
IBM857
IBM858 Add new codepage charmaps/IBM858 [BZ #21084] 2017-09-14 15:50:57 +02:00
IBM860
IBM861
IBM862
IBM863
IBM864
IBM865
IBM866
IBM866NAV Remove trailing whitespace from localedata. 2013-06-07 14:56:03 +00:00
IBM868
IBM869
IBM870 Add correct mapping for <U017E>, <U017A>, <U017D> and <U0179>. Add missing mappings. 2002-08-29 09:45:29 +00:00
IBM871
IBM874 Update. 2004-07-20 07:39:10 +00:00
IBM875 charmaps: IBM875: fix mapping of iota/upsilon variants [BZ #18453] 2016-05-07 19:55:55 -04:00
IBM880
IBM891
IBM903
IBM904
IBM905
IBM918
IBM922 Update. 2002-04-19 07:49:16 +00:00
IBM1004
IBM1026
IBM1047 Update. 2003-06-16 17:18:06 +00:00
IBM1124 Update. 2002-04-19 07:49:16 +00:00
IBM1129 Update. 2002-04-19 07:49:16 +00:00
IBM1132 Update. 2002-04-19 07:49:16 +00:00
IBM1133 Update. 2002-04-19 07:49:16 +00:00
IBM1160 Update. 2002-04-19 07:49:16 +00:00
IBM1161 Update. 2002-04-19 07:49:16 +00:00
IBM1162 Update. 2002-04-19 07:49:16 +00:00
IBM1163 Update. 2002-04-19 07:49:16 +00:00
IBM1164 Update. 2002-04-19 07:49:16 +00:00
IEC_P27-1
INIS
INIS-8 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
INIS-CYRILLIC
INVARIANT
ISIRI-3342 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
ISO_646.BASIC
ISO_646.IRV
ISO_2033-1983 locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
ISO_5427
ISO_5427-EXT locale: Remove obsolete repertoire map references 2015-07-21 03:54:02 -04:00
ISO_5428 2002-10-18 Bruno Haible <bruno@clisp.org> 2002-11-04 19:22:53 +00:00
ISO_6937
ISO_6937-2-25
ISO_6937-2-ADD
ISO_8859-1,GL Remove trailing whitespace from localedata. 2013-06-07 14:56:03 +00:00
ISO_8859-SUPP
ISO_10367-BOX
ISO_10646
ISO_11548-1 * iconvdata/Makefile (modules): Add ISO_11548-1. 2005-11-16 08:16:43 +00:00
ISO-8859-1 Update. 2000-06-24 18:53:49 +00:00
ISO-8859-2
ISO-8859-3
ISO-8859-4
ISO-8859-5
ISO-8859-6 Update. 2000-10-01 19:54:15 +00:00
ISO-8859-7 Update. 2004-08-02 22:33:57 +00:00
ISO-8859-8 Update. 2000-10-01 19:54:15 +00:00
ISO-8859-9
ISO-8859-9E * iconvdata/Makefile (modules): Add ISO8859-9E. 2007-09-30 04:00:26 +00:00
ISO-8859-10
ISO-8859-11 Update. 2004-08-02 22:33:57 +00:00
ISO-8859-13
ISO-8859-14
ISO-8859-15 Update. 2003-06-16 17:18:06 +00:00
ISO-8859-16 Update. 2003-06-16 17:18:06 +00:00
ISO-IR-90
ISO-IR-197 Update. 2000-10-01 18:59:42 +00:00
ISO-IR-209 Fix the Unicode values for LETTER H WITH CARON. 2002-04-24 20:14:49 +00:00
IT
JIS_C6220-1969-JP
JIS_C6220-1969-RO
JIS_C6229-1984-A
JIS_C6229-1984-B
JIS_C6229-1984-B-ADD
JIS_C6229-1984-HAND
JIS_C6229-1984-HAND-ADD
JIS_C6229-1984-KANA
JIS_X0201
JOHAB [BZ #3954] 2007-02-16 01:25:28 +00:00
JUS_I.B1.002
JUS_I.B1.003-MAC
JUS_I.B1.003-SERB
KOI8-R
KOI8-RU * iconvdata/Makefile (modules): Add KOI8-RU. 2007-09-29 04:35:54 +00:00
KOI8-T * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
KOI8-U * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
KOI-8
KSC5636
LATIN-GREEK
LATIN-GREEK-1
MAC-CENTRALEUROPE [BZ #4972] 2007-09-23 17:21:22 +00:00
MAC-CYRILLIC Update. 2001-05-24 08:50:10 +00:00
MAC-IS
MAC-SAMI * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
MAC-UK
MACINTOSH Update. 2002-01-18 06:26:02 +00:00
MIK [BZ #2625] 2006-05-01 14:11:50 +00:00
MSZ_7795.3
NATS-DANO
NATS-DANO-ADD
NATS-SEFI
NATS-SEFI-ADD
NC_NC00-10
NEXTSTEP
NF_Z_62-010
NF_Z_62-010_1973
NS_4551-1
NS_4551-2
PT
PT2
PT154 Update. 2004-04-17 22:34:18 +00:00
RK1048 Update. 2004-04-17 22:34:18 +00:00
SAMI
SAMI-WS2 Update. 2001-10-01 00:14:14 +00:00
SEN_850200_B
SEN_850200_C
SHIFT_JIS Update. 2001-06-17 22:44:18 +00:00
SHIFT_JISX0213 Update. 2004-08-02 22:33:57 +00:00
T.61-7BIT
T.61-8BIT
T.101-G2
TCVN5712-1 Update. 2001-12-10 01:37:56 +00:00
TIS-620 * sysdeps/unix/bsd/bsd4.4/freebsd/bits/typesizes.h: New file. 2002-11-03 01:21:26 +00:00
TSCII Update. 2002-09-24 04:19:03 +00:00
UTF-8 Update to Unicode 16.0.0 [BZ #32168] 2024-09-27 14:43:38 +02:00
VIDEOTEX-SUPPL
VISCII Update. 2000-10-27 21:07:08 +00:00
WINDOWS-31J * sunrpc/xdr_stdio.c (xdrstdio_getlong, xdrstdio_putlong): 2005-03-22 09:14:36 +00:00