]>
Commit | Line | Data |
---|---|---|
2ca993e8 A |
1 | # *************************************************************************** |
2 | # * | |
3 | # * Copyright (C) 2004-2016, International Business Machines | |
4 | # * Corporation; Unicode, Inc.; and others. All Rights Reserved. | |
5 | # * | |
6 | # *************************************************************************** | |
7 | # File: zu_zu_FONIPA.txt | |
8 | # Generated from CLDR | |
9 | # | |
10 | ||
11 | # Pronunciation rules for isiZulu. | |
12 | # | |
13 | # Author: mjansche@google.com (Martin Jansche) | |
14 | # | |
15 | # These rules transcribe isiZulu into the phoneme inventory used within the | |
16 | # NCHLT Speech Corpus (https://sites.google.com/site/nchltspeechcorpus/home). | |
17 | # | |
18 | # The rules were tested using the NCHLT-inlang isiZulu pronunciation dictionary | |
19 | # (http://rma.nwu.ac.za/index.php/resource-catalogue/nchlt-inlang-dictionaries.html). | |
20 | # They correctly account for all 15,000 entries in the dictionary. | |
21 | # | |
22 | # The NCHLT 2013 phone set does not indicate tone in any way. Transcription of | |
23 | # tone is out of scope without a dictionary, since tone is generally not | |
24 | # indicated in the orthography. Nasal clicks are not treated as separated | |
25 | # phonemes in the NCHLT 2013 phone set and are transcribed as a sequence of | |
26 | # nasal plus click instead. | |
27 | # | |
28 | # One minor notational deviation from the NCHLT 2013 phone set is that we use a | |
29 | # tie bar within the complex (depressor) clicks, e.g. ɡ\u0361ǀ instead of ɡǀ, to | |
30 | # avoid ambiguity and make the phoneme inventory uniquely decodable. | |
31 | ::Lower; | |
32 | tsh → t\u0361ʃʼ; | |
33 | bh → b; | |
34 | ch → ǀʰ; | |
35 | dl → ɮ; | |
36 | gc → ɡ\u0361ǀ; | |
37 | gq → ɡ\u0361ǃ; | |
38 | gx → ɡ\u0361ǁ; | |
39 | hh → ɦ; # To investigate: /ɦ/ and /h/ may be switched in the NCHLT dictionary. | |
40 | hl → ɬ; | |
41 | kh → kʰ; | |
42 | kl → k\u0361ɬ; | |
43 | ny → ɲ; | |
44 | ph → pʰ; | |
45 | qh → ǃʰ; | |
46 | n { sh → t\u0361sʼ; | |
47 | sh → ʃ; | |
48 | th → tʰ; | |
49 | xh → ǁʰ; | |
50 | a → a; | |
51 | m { b → b; | |
52 | b → ɓ; | |
53 | c → ǀ; | |
54 | d → d; | |
55 | e → ɛ; | |
56 | f → f; | |
57 | g → ɡ; | |
58 | h → h; | |
59 | i → i; | |
60 | j → d\u0361ʒ; | |
61 | k → k; | |
62 | l → l; | |
63 | m → m; | |
64 | [$] { n } gc → n; | |
65 | n } [gk] → ŋ; | |
66 | n } j → ɲ; | |
67 | n → n; | |
68 | o → ɔ; | |
69 | p → pʼ; | |
70 | q → ǃ; | |
71 | n { s → t\u0361sʼ; | |
72 | s → s; | |
73 | t → tʼ; | |
74 | u → u; | |
75 | v → v; | |
76 | w → w; | |
77 | x → ǁ; | |
78 | y → j; | |
79 | n { z → d\u0361z; | |
80 | z → z; | |
81 |