]>
Commit | Line | Data |
---|---|---|
1 | .\" Copyright (c) 2002, 2003 Tim J. Robbins | |
2 | .\" All rights reserved. | |
3 | .\" | |
4 | .\" Redistribution and use in source and binary forms, with or without | |
5 | .\" modification, are permitted provided that the following conditions | |
6 | .\" are met: | |
7 | .\" 1. Redistributions of source code must retain the above copyright | |
8 | .\" notice, this list of conditions and the following disclaimer. | |
9 | .\" 2. Redistributions in binary form must reproduce the above copyright | |
10 | .\" notice, this list of conditions and the following disclaimer in the | |
11 | .\" documentation and/or other materials provided with the distribution. | |
12 | .\" | |
13 | .\" THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND | |
14 | .\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE | |
15 | .\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE | |
16 | .\" ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE | |
17 | .\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL | |
18 | .\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS | |
19 | .\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) | |
20 | .\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT | |
21 | .\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY | |
22 | .\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF | |
23 | .\" SUCH DAMAGE. | |
24 | .\" | |
25 | .\" $FreeBSD: src/lib/libc/locale/gb18030.5,v 1.6 2004/07/05 06:36:36 ru Exp $ | |
26 | .\" | |
27 | .Dd August 10, 2003 | |
28 | .Dt GB18030 5 | |
29 | .Os | |
30 | .Sh NAME | |
31 | .Nm gb18030 | |
32 | .Nd "GB 18030 encoding method for Chinese text" | |
33 | .Sh SYNOPSIS | |
34 | .Nm ENCODING | |
35 | .Qq GB18030 | |
36 | .Sh DESCRIPTION | |
37 | The | |
38 | .Nm GB18030 | |
39 | encoding implements GB 18030-2000, a PRC national standard for the encoding of | |
40 | Chinese characters. | |
41 | It is a superset of the older GB\ 2312-1980 and GBK encodings, | |
42 | and incorporates Unicode's Unihan Extension A completely. | |
43 | It also provides code space for all Unicode 3.0 code points. | |
44 | .Pp | |
45 | Multibyte characters in the | |
46 | .Nm GB18030 | |
47 | encoding can be one byte, two bytes, or | |
48 | four bytes long. | |
49 | There are a total of over 1.5 million code positions. | |
50 | .Pp | |
51 | .No GB\ 11383-1981 Pq Tn ASCII | |
52 | characters are represented by single bytes in the range 0x00 to 0x7F. | |
53 | .Pp | |
54 | Chinese characters are represented as either two bytes or four bytes. | |
55 | Characters that are represented by two bytes begin with a byte in the range | |
56 | 0x81-0xFE and end with a byte either in the range 0x40-0x7E or 0x80-0xFE. | |
57 | .Pp | |
58 | Characters that are represented by four bytes begin with a byte in the range | |
59 | 0x81-0xFE, have a second byte in the range 0x30-0x39, a third byte in the range | |
60 | 0x81-0xFE and a fourth byte in the range 0x30-0x39. | |
61 | .Sh SEE ALSO | |
62 | .Xr euc 5 , | |
63 | .Xr gb2312 5 , | |
64 | .Xr gbk 5 , | |
65 | .Xr utf8 5 | |
66 | .Rs | |
67 | .%T "Chinese National Standard GB 18030-2000: Information Technology -- Chinese ideograms coded character set for information interchange -- Extension for the basic set" | |
68 | .%D "March 2000" | |
69 | .Re | |
70 | .Rs | |
71 | .%Q "The Unicode Consortium" | |
72 | .%T "The Unicode Standard, Version 3.0" | |
73 | .%D "2000" | |
74 | .Re | |
75 | .Sh STANDARDS | |
76 | The | |
77 | .Nm GB18030 | |
78 | encoding is believed to be compatible with GB 18030-2000. |