2 # Date: 2010-08-18, 17:25:00 PDT [KW]
4 # Line Break Properties
6 # This file is a normative contributory data file in the
7 # Unicode Character Database.
8 # It contains both normative and informative data.
10 # Copyright (c) 1991-2010 Unicode, Inc.
11 # For terms of use, see http://www.unicode.org/terms_of_use.html
13 # The format is two fields separated by a semicolon.
14 # Field 0: Unicode value
15 # Field 1: LineBreak property, consisting of one of the following values:
17 # "BK", "CR", "LF", "CM", "SG", "GL", "CB", "SP", "ZW",
18 # "NL", "WJ", "JL", "JV", "JT", "H2", "H3"
20 # "XX", "OP", "CL", "CP", "QU", "NS", "EX", "SY",
21 # "IS", "PR", "PO", "NU", "AL", "ID", "IN", "HY",
22 # "BB", "BA", "SA", "AI", "B2"
23 # - All code points, assigned and unassigned, that are not listed
24 # explicitly are given the value "XX".
25 # The unassigned code points that default to "ID" include ranges in the
27 # CJK Unified Ideographs Extension A: U+3400..U+4DBF
28 # CJK Unified Ideographs: U+4E00..U+9FFF
29 # CJK Compatibility Ideographs: U+F900..U+FAFF
30 # CJK Unified Ideographs Extension B: U+20000..U+2A6DF
31 # CJK Unified Ideographs Extension C: U+2A700..U+2B73F
32 # CJK Unified Ideographs Extension D: U+2B740..U+2B81F
33 # CJK Compatibility Ideographs Supplement: U+2F800..U+2FA1F
34 # and any other reserved code points on
35 # Planes 2 and 3: U+20000..U+2FFFD
37 # - Characters ranges are specified as for other property files in
38 # the Unicode Character Database.
40 # The Unicode name of each character is provided in a comment for help
41 # in identifying the characters.
43 # See UAX #14: Unicode Line Breaking Algorithm, for more information
45 # @missing: 0000..10FFFF; XX