1 # Copyright (c) 2000 Unicode, Inc. All Rights reserved.
3 # Name: GSM 03.38 to Unicode
7 # Authors: Ken Whistler
11 # Source: http://www.unicode.org/Public/MAPPINGS/ETSI/GSM0338.TXT
12 # See there for the license and for a description of the charset.
13 # Formatted into ICU .ucm format by Markus Scherer on 2006-nov-02.
14 # Commented-out mappings are turned into fallbacks (|1), all others are turned
15 # into round-trips (|0).
16 # Multi-byte mappings are preserved as multi-single-byte character mappings,
17 # using ICU's m:n conversion capability.
19 # The substitution character is not documented in the Unicode file.
20 # \x3F is chosen here because \x1A is a graphic character.
22 # Other deviations from the Unicode file:
24 # The GSM standard specifies that one or two ESC bytes (\x1B), if not followed
25 # by a recognized final byte, be mapped to spaces (that is, reverse fallbacks
27 # The Unicode file round-trips a single \x1B to U+00A0 (NBSP) and has no mapping
29 # (Reverse fallbacks to U+00A0 would result in Unicode text that cannot be
30 # converted back to GSM 03.38. A roundtrip for U+00A0 adds a character that is
31 # not mappable in the standard.)
33 # See the ietf-charsets list email "Re: GSM 03.38 substitution character?"
34 # at http://mail.apps.ietf.org/ietf/charsets/msg01696.html
37 # The GSM standard maps U+00C7 capital C-cedilla to \x09 but the Unicode file
38 # contains and documents a "fix" to map U+00E7 small c-cedilla instead, based on
39 # an interpretation of the intent of the standard. Prevailing implementations
40 # in mobile phones follow the standard.
42 # This file follows the GSM standard.
44 # See the GSM standard at
45 # http://www.3gpp.org/ftp/Specs/archive/03_series/03.38/0338-720.zip
47 # For problems with the table format please submit a bug
48 # at http://www.icu-project.org/ .
49 # For issues with the mappings please contact Unicode
50 # at http://www.unicode.org/reporting.html
52 <code_set_name> "gsm-03.38-2000"
53 <char_name_mask> "AXXXX"
59 <icu:charsetFamily> "ASCII"