1 # Copyright (C) 2016 and later: Unicode, Inc. and others.
2 # License & terms of use: http://www.unicode.org/copyright.html
3 # Copyright (C) 2010, International Business Machines
4 # Corporation and others. All Rights Reserved.
6 # file name: testnorm.txt
8 # tab size: 8 (not used)
11 # created on: 2010feb15
12 # created by: Markus W. Scherer
14 # Normalization test data, for improving code coverage.
16 # Selection of Canonical_Combining_Class (ccc) values
47 # ICU 63 normalization with UCPTrie requires inert surrogate code points.
48 # D802:2 # surrogates with non-zero combining classes
54 # Some interesting mappings
62 # ICU 63 normalization with UCPTrie requires inert surrogate code points.
63 # D800>D7FF # surrogates with mappings, and mappings to empty strings
68 E001=61 338 # composition with trail<=33FF and composite>7FFF
69 E002=E001 308 # recursive mapping needs reordering
70 E003>62 307 327 337 # mapping needs reordering
71 E011=E010 F0011 # composition of BMP+supplementary, and F0011 is maybe & combines-fwd
72 E111>1101 # mapping ends in Jamo L
73 E112>1102 62 # mapping starts with Jamo L
83 F0010=F0011 E012 # composition of supplementary+BMP