]> git.saurik.com Git - apple/icu.git/blame - icuSources/data/translit/xh_xh_FONIPA.txt
ICU-66108.tar.gz
[apple/icu.git] / icuSources / data / translit / xh_xh_FONIPA.txt
CommitLineData
f3c0d7a5
A
1# © 2016 and later: Unicode, Inc. and others.
2# License & terms of use: http://www.unicode.org/copyright.html#License
3#
2ca993e8 4# File: xh_xh_FONIPA.txt
f3c0d7a5 5# Generated from CLDR
2ca993e8
A
6#
7
8# Pronunciation rules for isiXhosa.
9#
10# Author: mjansche@google.com (Martin Jansche)
11#
12# These rules transcribe isiXhosa into the phoneme inventory used within the
13# NCHLT Speech Corpus (https://sites.google.com/site/nchltspeechcorpus/home).
14#
15# The rules were tested using the NCHLT-inlang isiXhosa pronunciation dictionary
16# (http://rma.nwu.ac.za/index.php/resource-catalogue/nchlt-inlang-dictionaries.html).
17# They correctly account for 14,999 out of 15,000 entries in the dictionary.
18#
19# The NCHLT 2013 phone set does not distinguish short and long vowels and does
20# not indicate tone in any way. Transcription of tone is out of scope without a
21# dictionary, since tone is generally not indicated in the orthography. Nasal
22# clicks are not treated as separated phonemes in the NCHLT 2013 phone set and
23# are transcribed as a sequence of nasal plus click instead.
24#
25# One minor notational deviation from the NCHLT 2013 phone set is that we use a
26# tie bar within the complex (slack voiced) clicks, e.g. ɡ\u0361ǀ instead of ɡǀ, to
27# avoid ambiguity and make the phoneme inventory uniquely decodable.
28::Lower;
29nyh → ɲʰ;
30n { tsh → t\u0361ʃʼ;
31tsh → t\u0361ʃʰ;
32tyh → cʰ;
33bh → bʰ;
34ch → ǀʰ;
35dl → ɮ;
36dy → ɟ;
37gc → ɡ\u0361ǀ;
38gq → ɡ\u0361ǃ;
39gr → ɣ;
40gx → ɡ\u0361ǁ;
41hl → ɬ;
42kh → kʰ;
43kr → k\u0361x;
44mh } [^l] → mʰ; # <mhl> denotes /mɬ/ instead
45nh → nʰ;
46ny → ɲ;
47ph → pʰ;
48qh → ǃʰ;
49sh → ʃ;
50th → tʰ;
51tl → t\u0361ɬʼ;
52ts → t\u0361sʼ;
53ty → cʼ;
54xh → ǁʰ;
55aa → | a;
56ee → | e;
57ii → | i;
58kc → | c;
59kq → | q;
60mm → | m;
61oo → | o;
62rh → | r;
63uu → | u;
64a → a;
65b → ɓ;
66c → ǀ;
67d → d;
68e → ɛ;
69f → f;
70g → ɡ;
71h → h;
72i → i;
73j → d\u0361ʒ;
74k → kʼ;
75l → l;
76m → m;
77n } g → ŋ;
78n → n;
79o → ɔ;
80p → pʼ;
81q → ǃ;
82r → r;
83s → s;
84t → tʼ;
85u → u;
86v → v;
87w → w;
88x → ǁ;
89y → j;
90z → z;
91