]> git.saurik.com Git - apple/icu.git/blob - icuSources/data/translit/el_el_Latn_BGN.txt
ICU-59152.0.1.tar.gz
[apple/icu.git] / icuSources / data / translit / el_el_Latn_BGN.txt
1 # © 2016 and later: Unicode, Inc. and others.
2 # License & terms of use: http://www.unicode.org/copyright.html#License
3 #
4 # File: el_el_Latn_BGN.txt
5 # Generated from CLDR
6 #
7
8 #
9 ########################################################################
10 # BGN/PCGN 1962 System
11 #
12 # This system is a simplified version of the system devised by the PCGN
13 # in 1941 and later adopted by the BGN. In 1962 the two organizations
14 # agreed to joint adoption of certain changes in the original system,
15 # specifically the omission of special rules for the treatment of Greek
16 # geographic names of Albanian, Bulgarian, Italian, Macedonian, and
17 # Turkish origin. That revision eliminated the need to consider the
18 # origin of names and removed ambiguity from the romanization of Greek
19 # expressions of possible non-Greek origin. This system is based on
20 # the pronunciation of modern Greek and is not intended for use in
21 # the romanization of classical Greek.
22 #
23 # The Greek Alphabet as defined by the BGN (Pages 29-31):
24 #
25 # ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
26 # αβγδεζηθικλμνξοπρσςτυφχψω
27 #
28 # Originally prepared by Michael Everson <everson@evertype.com>
29 ########################################################################
30 #
31 # MINIMAL FILTER: Greek-Latin
32 #
33 :: [ΆΈΉΊΌΎΏΐΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩΪΫάέήίΰαβγδεζηθικλμνξοπρςστυφχψωϊϋόύώἀἁἂἃἄἅἆἇἈἉἊἋἌἍἎἏἐἑἒἓἔἕἘἙἚἛἜἝἠἡἢἣἤἥἦἧἨἩἪἫἬἭἮἯἰἱἲἳἴἵἶἷἸἹἺἻἼἽἾἿὀὁὂὃὄὅὈὉὊὋὌὍὐὑὒὓὔὕὖὗὙὛὝὟὠὡὢὣὤὥὦὧὨὩὫὬὭὮὯὰάὲέὴήὶίὸόὺύὼώᾀᾁᾂᾃᾄᾅᾆᾇᾈᾉᾊᾋᾌᾍᾎᾏᾐᾑᾒᾓᾔᾕᾖᾗᾘᾙᾚᾛᾜᾝᾞᾟᾠᾡᾢᾣᾤᾥᾦᾧᾨᾩᾪᾫᾬᾭᾮᾯᾲᾳᾴᾶᾷᾺΆᾼῂῃῄῆῇῈΈῊΉῌῖῚΊῤῥῦῪΎῲῳῴῶῷῸΌῺΏῼ῾] ;
34 :: NFD (NFC) ;
35 #
36 #
37 ########################################################################
38 #
39 ########################################################################
40 #
41 # Define All Transformation Variables
42 #
43 ########################################################################
44 #
45 $upperConsonants = [ΒΓΔΖΘΚΛΜΝΞΠΡΣΤΦΧΨ] ;
46 $lowerConsonants = [βγδζθκλμνξπρσςτφχψ] ;
47 $consonants = [$upperConsonants $lowerConsonants] ;
48 $upperVowels = [ΑΕΗΙΟΥΩ] ;
49 $lowerVowels = [αεηιουω] ;
50 $vowels = [$upperVowels $lowerVowels] ;
51 $lower = [$lowerConsonants $lowerVowels] ;
52 #
53 #
54 # Use this $wordBoundary until bug 2034 is fixed in ICU:
55 # http://bugs.icu-project.org/cgi-bin/icu-bugs/transliterate?id=2034;expression=boundary;user=guest
56 #
57 $wordBoundary = [^[:L:][:M:][:N:]] ;
58 #
59 #
60 ########################################################################
61 #
62 ########################################################################
63 #
64 # Rules moved to front to avoid masking
65 #
66 ########################################################################
67 #
68 ########################################################################
69 #
70 # BGN Page 32 Rule 1:
71 #
72 # The apostrophe and reversed apostrophe, on or the other of which is
73 # written in Greek in front of all initial uppercase vowel characters,
74 # above all initial lowercase vowel characters, and above the second
75 # character of all initial two-vowel character sequences, should not
76 # be romanized, e.g., Ἀθῆναι → Athínai, Ἠράκλειον → Iráklion,
77 # Οἰνόφυτα → Oinófita. These apostrophes must be distinguished from
78 # accent marks hen they occur together, e.g. Ἄβατον → Ávaton,
79 # Ἤλια → Ília, Οἴτη → Oíti. The reversed apostrophe is sometimes found
80 # also with ρ and should, likewise, not be romanized: ῥέμα → réma.
81 #
82 # BGN Page 32 Rule 2a:
83 #
84 # Stress is shown in Greek by the use of the tilde or circumflex,
85 # the acute accent, or the grave accent; all of those marks should
86 # be represented in romanization by an acute accent, e.g.,
87 # Ἀθῆναι → Athínai, Νδία → Día, Ζεμενὸν → Zemenón.
88 #
89 # BGN Page 32 Rule 4:
90 #
91 # The character ι (ióta) is sometimes found written under, or,
92 # in uppercase, to the right of the vowel characters α, η, and ω.
93 # This "subscript iota" should not be romanized, e.g.,
94 # Μυρτῷον Πέλαγος or ΜΥΡΤῼΟΝ ΠΕΛΑΓΟΣ [but not ΜΥΡΤΩΙΟΝ ΠΕΛΑΓΟΣ]
95 # → Mirtóön Pélagos.
96 #
97 ########################################################################
98 #
99 [ἈἉᾼᾈᾉ] → Α ; # GREEK CAPITAL LETTER ALPHA
100 [ἀἁᾳᾀᾁ] → α ; # GREEK SMALL LETTER ALPHA
101 [ἊἋἌἍἎἏᾊᾋᾌᾍᾎᾏᾺΆ] → Ά ; # GREEK CAPITAL LETTER ALPHA WITH TONOS
102 [ἂἃἄἅἆἇὰάᾂᾃᾄᾅᾆᾇᾲᾴᾶᾷ] → ά ; # GREEK SMALL LETTER ALPHA WITH TONOS
103 [ἘἙ] → Ε ; # GREEK CAPITAL LETTER EPSILON
104 [ἐἑὲέ] → ε ; # GREEK SMALL LETTER EPSILON
105 [ἚἛἜἝῈΈ] → Έ ; # GREEK CAPITAL LETTER EPSILON WITH TONOS
106 [ἒἓἔἕ] → έ ; # GREEK SMALL LETTER EPSILON WITH TONOS
107 [ἨἩᾘᾙῌ] → Η ; # GREEK CAPITAL LETTER ETA
108 [ἠἡᾐᾑῃ] → η ; # GREEK SMALL LETTER ETA
109 [ἪἫἬἭἮἯᾚᾛᾜᾝᾞᾟῊΉ] → Ή ; # GREEK CAPITAL LETTER ETA WITH TONOS
110 [ἢἣἤἥἦἧὴήᾒᾓᾔᾕᾖᾗῂῄῆῇ] → ή ; # GREEK SMALL LETTER ETA WITH TONOS
111 [ἸἹ] → Ι ; # GREEK CAPITAL LETTER IOTA
112 [ἰἱ] → ι ; # GREEK SMALL LETTER IOTA
113 [ἺἻἼἽἾἿῚΊ] → Ί ; # GREEK CAPITAL LETTER IOTA WITH TONOS
114 [ἲἳἴἵἶἷὶίῖ] → ί ; # GREEK SMALL LETTER IOTA WITH TONOS
115 [ὈὉ] → Ο ; # GREEK CAPITAL LETTER OMICRON
116 [ὀὁ] → ο ; # GREEK SMALL LETTER OMICRON
117 [ὊὋὌὍῸΌ] → Ό ; # GREEK CAPITAL LETTER OMICRON WITH TONOS
118 [ὂὃὄὅὸό] → ό ; # GREEK SMALL LETTER OMICRON WITH TONOS
119 Ὑ → Υ ; # GREEK CAPITAL LETTER UPSILON
120 [ὐὑ] → υ ; # GREEK SMALL LETTER UPSILON
121 [ὛὝὟῪΎ] → Ύ ; # GREEK CAPITAL LETTER UPSILON WITH TONOS
122 [ὒὓὔὕὖὗὺύῦ] → ύ ; # GREEK SMALL LETTER UPSILON WITH TONOS
123 [ὨὩᾨᾩῼ] → Ω ; # GREEK CAPITAL LETTER OMEGA
124 [ὠὡᾠᾡῳ] → ω ; # GREEK SMALL LETTER OMEGA
125 [ὬὫὬὭὮὯᾪᾫᾬᾭᾮᾯῺΏ] → Ώ ; # GREEK CAPITAL LETTER OMEGA WITH TONOS
126 [ὢὣὤὥὦὧὼώᾢᾣᾤᾥᾦᾧῲῴῶῷ] → ώ ; # GREEK SMALL LETTER OMEGA WITH TONOS
127 Ῥ → Ρ ; # GREEK CAPITAL LETTER RHO
128 [ῤῥ] → ρ ; # GREEK SMALL LETTER RHO
129 #
130 #
131 ########################################################################
132 #
133 # End of Rules 1, 2a, and 4
134 #
135 ########################################################################
136 #
137 ########################################################################
138 #
139 # BGN Page 32 Rules 2b and 2c:
140 #
141 # If the stressed vowel is written as a sequence of two vowel characters
142 # in Greek, the # second vowel character should carry the accent;
143 # similarly, in Romanization the acute accent should be placed over the
144 # second vowel letter, e.g., Οἰνοῦσαι → Oinoúsai, Οἴτη → Oíti,
145 # Θεσπιαὶ → Thespiaí.
146 #
147 # Where a syllable containing on the combinations αυ, ευ, or ηυ
148 # carries the stress, this is marked in Greek on the character υ.
149 # In romanization it should be shown on the preceding vowel
150 # letter, e.g., Πειραιεύς → Piraiévs, Αὔρα → Ávra.
151 #
152 Αί → Aí ;
153 αί → aí ;
154 Οί → Oí ;
155 οί → Oí ;
156 Ού → Oú ;
157 ού → oú ;
158 Αύ → Άυ ;
159 αύ → άυ ;
160 Εύ → Έυ ;
161 εύ → έυ ;
162 Ηύ → Ήυ ;
163 ηύ → ήυ ;
164 #
165 #
166 ########################################################################
167 #
168 # End of Rules 2b and 2c
169 #
170 ########################################################################
171 #
172 ########################################################################
173 #
174 # BGN Page 32 Rule 3:
175 #
176 # The dieresis should be shown in romanization where it occurs in Greek,
177 # e.g., Μαρινέϊκα → Marinéïka, Ἀχαΐα → Akhaï\u0301a; and over the second vowel
178 # etter in romanization of the following combinations fo Greek vowel
179 # characters: αε, e.g., Ἀετὸς → Aëtos; αη, e.g., Ἀηδὼν → Aïdhon; οη,
180 # e.g. Οἰνόη → Oinóï; ωο, e.g., Ἠρῶον → Iróön.
181 #
182 [ΪΫ] → Ï ;
183 [ϊϋ] → ï ;
184 [ΐΰ] → ï\u0301 ;
185 Αε → Aë ;
186 αε → aë ;
187 Αη → Aï ;
188 αη → aï ;
189 Οη → Oï ;
190 οη → oï ;
191 Ωο → Oö ;
192 ωο → oö ;
193 Άε → Áë ;
194 άε → áë ;
195 Άη → Áï ;
196 άη → áï ;
197 Όη → Óï ;
198 όη → óï ;
199 Ώο → Óö ;
200 ώο → óö ;
201 #
202 #
203 ########################################################################
204 #
205 # End of Rule 3
206 #
207 ########################################################################
208 #
209 ########################################################################
210 #
211 # Start of Alphabetic Transformations
212 #
213 ########################################################################
214 #
215 ΑΙ → AI ; # GREEK CAPITAL LETTER ALPHA + CAPITAL IOTA
216 Αι → Ai ; # GREEK CAPITAL LETTER ALPHA + SMALL IOTA
217 αι → ai ; # GREEK SMALL LETTER ALPHA + SMALL IOTA
218 ΑΥ → AV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON
219 Αυ → Av ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON
220 αυ → av ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON
221 Α → A ; # GREEK CAPITAL LETTER ALPHA
222 α → a ; # GREEK SMALL LETTER ALPHA
223 Ά → Á ; # GREEK CAPITAL LETTER ALPHA WITH TONOS
224 ά → á ; # GREEK SMALL LETTER ALPHA WITH TONOS
225 Β → V ; # GREEK CAPITAL LETTER BETA
226 β → v ; # GREEK SMALL LETTER BETA
227 ΓΓ → NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL GAMMA
228 Γγ → Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL GAMMA
229 γγ → ng ; # GREEK SMALL LETTER GAMMA + SMALL GAMMA
230 $wordBoundary{ΓΚ → G ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA
231 $wordBoundary{Γκ → G ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA
232 $wordBoundary{γκ → g ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA
233 ΓΚ → NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA
234 Γκ → Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA
235 γκ → ng ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA
236 #
237 #
238 ########################################################################
239 #
240 # BGN Page 29 Rule 3a:
241 #
242 # The character γ should be romanized g before α, ο, ου, ω, and
243 # consonants other than γ, ξ, and χ.
244 #
245 ########################################################################
246 #
247 Γ}[ΑΟΩ [$upperConsonants - [ΓΞΧ]]] → G ; # GREEK CAPITAL LETTER GAMMA
248 Γ}[αοω [$lowerConsonants - [γξχ]]] → G ; # GREEK CAPITAL LETTER GAMMA
249 Γ}ΟΥ → G ; # GREEK CAPITAL LETTER GAMMA
250 Γ}ου → G ; # GREEK CAPITAL LETTER GAMMA
251 γ}[αοω [$lowerConsonants - [γξχ]]] → g ; # GREEK SMALL LETTER GAMMA
252 γ}ου → g ; # GREEK SMALL LETTER GAMMA
253 #
254 #
255 ########################################################################
256 #
257 # End of Rule 3a
258 #
259 ########################################################################
260 #
261 ########################################################################
262 #
263 # BGN Page 29 Rule 3b:
264 #
265 # The character γ should be romanized y before αι, ε, ει, η, ι, οι, υ,
266 # and υι.
267 #
268 ########################################################################
269 #
270 Γ}[ΑΕΟΥ]Ι → Y ; # GREEK CAPITAL LETTER GAMMA
271 Γ}[ΕΗΙΥ] → Y ; # GREEK CAPITAL LETTER GAMMA
272 Γ}[αεου]ι → Y ; # GREEK CAPITAL LETTER GAMMA
273 Γ}[εηιυ] → Y ; # GREEK CAPITAL LETTER GAMMA
274 γ}[αεου]ι → y ; # GREEK SMALL LETTER GAMMA
275 γ}[εηιυ] → y ; # GREEK SMALL LETTER GAMMA
276 #
277 #
278 ########################################################################
279 #
280 # End of Rule 3b
281 #
282 ########################################################################
283 #
284 ########################################################################
285 #
286 # BGN Page 29 Rule 3c:
287 #
288 # The character γ should be romanized n before ξ and χ.
289 #
290 ########################################################################
291 #
292 Γ}[ΞΧ] → N ; # GREEK CAPITAL LETTER GAMMA
293 Γ}[ξχ] → N ; # GREEK CAPITAL LETTER GAMMA
294 γ}[ξχ] → n ; # GREEK SMALL LETTER GAMMA
295 #
296 #
297 ########################################################################
298 #
299 # End of Rule 3c
300 #
301 ########################################################################
302 #
303 Γ → G ; # GREEK CAPITAL LETTER GAMMA
304 γ → g ; # GREEK SMALL LETTER GAMMA
305 #
306 #
307 ########################################################################
308 #
309 # BGN Page 29 Rule 4a:
310 #
311 # The character δ should be romanized d when between ν and ρ.
312 #
313 ########################################################################
314 #
315 Ν{Δ}Ρ → D ; # GREEK CAPITAL LETTER DELTA
316 ν{δ}ρ → d ; # GREEK SMALL LETTER GAMMA
317 #
318 #
319 ########################################################################
320 #
321 # End of Rule 4a
322 #
323 ########################################################################
324 #
325 Δ} $lower → Dh ; # GREEK CAPITAL LETTER PSI
326 Δ → DH ; # GREEK CAPITAL LETTER DELTA
327 δ → dh ; # GREEK SMALL LETTER DELTA
328 ΕΙ → I ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA
329 Ει → I ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA
330 ει → i ; # GREEK SMALL LETTER EPSILON + SMALL IOTA
331 ΕΪ → EÏ ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA DIAERESIS
332 Εϊ → Eï ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA DIAERESIS
333 εϊ → eï ; # GREEK SMALL LETTER EPSILON + SMALL IOTA DIAERESIS
334 ΕΥ → EV ; # GREEK CAPITAL LETTER EPSILON + CAPITAL UPSILON
335 Ευ → Ev ; # GREEK CAPITAL LETTER EPSILON + SMALL UPSILON
336 ευ → ev ; # GREEK SMALL LETTER EPSILON + SMALL UPSILON
337 Ε → E ; # GREEK CAPITAL LETTER EPSILON
338 ε → e ; # GREEK SMALL LETTER EPSILON
339 Έ → É ; # GREEK CAPITAL LETTER EPSILON WITH TONOS
340 έ → é ; # GREEK SMALL LETTER EPSILON WITH TONOS
341 Ζ → Z ; # GREEK CAPITAL LETTER ZETA
342 ζ → z ; # GREEK SMALL LETTER ZETA
343 ΗΥ → IV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON
344 Ηυ → Iv ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON
345 ηυ → iv ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON
346 Η → I ; # GREEK CAPITAL LETTER ETA
347 η → i ; # GREEK SMALL LETTER ETA
348 Ή → Í ; # GREEK CAPITAL LETTER ETA WITH TONOS
349 ή → í ; # GREEK SMALL LETTER ETA WITH TONOS
350 Θ} $lower → Th ; # GREEK CAPITAL LETTER THETA
351 Θ → TH ; # GREEK CAPITAL LETTER THETA
352 θ → th ; # GREEK SMALL LETTER THETA
353 Ι → I ; # GREEK CAPITAL LETTER IOTA
354 ι → i ; # GREEK SMALL LETTER IOTA
355 Ί → Í ; # GREEK CAPITAL LETTER IOTA WITH TONOS
356 ί → í ; # GREEK SMALL LETTER IOTA WITH TONOS
357 Κ → K ; # GREEK CAPITAL LETTER KAPPA
358 κ → k ; # GREEK SMALL LETTER KAPPA
359 Λ → L ; # GREEK CAPITAL LETTER LAMDA
360 λ → l ; # GREEK SMALL LETTER LAMDA
361 $wordBoundary{ΜΠ → B ; # GREEK CAPITAL LETTER MU + CAPITAL PI
362 $wordBoundary{Μπ → B ; # GREEK CAPITAL LETTER MU + SMALL PI
363 $wordBoundary{μπ → b ; # GREEK SMALL LETTER MU + SMALL PI
364 ΜΠ → MB ; # GREEK CAPITAL LETTER MU + CAPITAL PI
365 Μπ → Mb ; # GREEK CAPITAL LETTER MU + SMALL PI
366 μπ → mb ; # GREEK SMALL LETTER MU + SMALL PI
367 Μ → M ; # GREEK CAPITAL LETTER MU
368 μ → m ; # GREEK SMALL LETTER MU
369 $wordBoundary{ΝΤ → D ; # GREEK CAPITAL LETTER NU + CAPITAL TAU
370 $wordBoundary{Ντ → D ; # GREEK CAPITAL LETTER NU + SMALL TAU
371 $wordBoundary{ντ → d ; # GREEK SMALL LETTER NU + SMALL TAU
372 ΝΤ → ND ; # GREEK CAPITAL LETTER NU + CAPITAL TAU
373 Ντ → Nd ; # GREEK CAPITAL LETTER NU + SMALL TAU
374 ντ → nd ; # GREEK SMALL LETTER NU + SMALL TAU
375 Ν → N ; # GREEK CAPITAL LETTER NU
376 ν → n ; # GREEK SMALL LETTER NU
377 Ξ → X ; # GREEK CAPITAL LETTER KSI
378 ξ → x ; # GREEK SMALL LETTER KSI
379 ΟΙ → OI ; # GREEK CAPITAL LETTER OMICRON + CAPITAL IOTA
380 Οι → Oi ; # GREEK CAPITAL LETTER OMICRON + SMALL IOTA
381 οι → oi ; # GREEK SMALL LETTER OMICRON + SMALL IOTA
382 ΟΥ → OU ; # GREEK CAPITAL LETTER OMICRON + CAPITAL UPSILON
383 Ου → Ou ; # GREEK CAPITAL LETTER OMICRON + SMALL UPSILON
384 ου → ou ; # GREEK SMALL LETTER OMICRON + SMALL UPSILON
385 Ο → O ; # GREEK CAPITAL LETTER OMICRON
386 ο → o ; # GREEK SMALL LETTER OMICRON
387 Ό → Ó ; # GREEK CAPITAL LETTER OMICRON WITH TONOS
388 ό → ó ; # GREEK SMALL LETTER OMICRON WITH TONOS
389 Π → P ; # GREEK CAPITAL LETTER PI
390 π → p ; # GREEK SMALL LETTER PI
391 Ρ → R ; # GREEK CAPITAL LETTER RHO
392 ρ → r ; # GREEK SMALL LETTER RHO
393 Σ → S ; # GREEK CAPITAL LETTER SIGMA
394 σ → s ; # GREEK SMALL LETTER SIGMA
395 ς → s ; # GREEK SMALL LETTER FINAL SIGMA
396 Τ → T ; # GREEK CAPITAL LETTER TAU
397 τ → t ; # GREEK SMALL LETTER TAU
398 #
399 #
400 ########################################################################
401 #
402 # End Rule 3.5
403 #
404 ########################################################################
405 #
406 Υ → I ; # GREEK CAPITAL LETTER UPSILON
407 υ → i ; # GREEK SMALL LETTER UPSILON
408 Ύ → Í ; # GREEK CAPITAL LETTER UPSILON WITH TONOS
409 ύ → í ; # GREEK SMALL LETTER UPSILON WITH TONOS
410 Φ → F ; # GREEK CAPITAL LETTER PHI
411 φ → f ; # GREEK SMALL LETTER PHI
412 Χ} $lower → Kh ; # GREEK CAPITAL LETTER CHI
413 Χ → KH ; # GREEK CAPITAL LETTER CHI
414 χ → kh ; # GREEK SMALL LETTER CHI
415 Ψ} $lower → Ps ; # GREEK CAPITAL LETTER PSI
416 Ψ → PS ; # GREEK CAPITAL LETTER PSI
417 ψ → ps ; # GREEK SMALL LETTER PSI
418 Ω → O ; # GREEK CAPITAL LETTER OMEGA
419 ω → o ; # GREEK SMALL LETTER OMEGA
420 Ώ → Ó ; # GREEK CAPITAL LETTER OMEGA WITH TONOS
421 ώ → ó ; # GREEK SMALL LETTER OMEGA WITH TONOS
422 #
423 #
424 ########################################################################
425