]> git.saurik.com Git - apple/icu.git/blame - icuSources/data/translit/el_el_Latn_BGN.txt
ICU-62107.0.1.tar.gz
[apple/icu.git] / icuSources / data / translit / el_el_Latn_BGN.txt
CommitLineData
f3c0d7a5
A
1# © 2016 and later: Unicode, Inc. and others.
2# License & terms of use: http://www.unicode.org/copyright.html#License
3#
2ca993e8 4# File: el_el_Latn_BGN.txt
f3c0d7a5 5# Generated from CLDR
729e4ab9 6#
2ca993e8
A
7
8#
9########################################################################
10# BGN/PCGN 1962 System
11#
12# This system is a simplified version of the system devised by the PCGN
13# in 1941 and later adopted by the BGN. In 1962 the two organizations
14# agreed to joint adoption of certain changes in the original system,
15# specifically the omission of special rules for the treatment of Greek
16# geographic names of Albanian, Bulgarian, Italian, Macedonian, and
17# Turkish origin. That revision eliminated the need to consider the
18# origin of names and removed ambiguity from the romanization of Greek
19# expressions of possible non-Greek origin. This system is based on
20# the pronunciation of modern Greek and is not intended for use in
21# the romanization of classical Greek.
22#
23# The Greek Alphabet as defined by the BGN (Pages 29-31):
24#
25# ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
26# αβγδεζηθικλμνξοπρσςτυφχψω
27#
28# Originally prepared by Michael Everson <everson@evertype.com>
29########################################################################
30#
31# MINIMAL FILTER: Greek-Latin
32#
729e4ab9
A
33:: [ΆΈΉΊΌΎΏΐΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩΪΫάέήίΰαβγδεζηθικλμνξοπρςστυφχψωϊϋόύώἀἁἂἃἄἅἆἇἈἉἊἋἌἍἎἏἐἑἒἓἔἕἘἙἚἛἜἝἠἡἢἣἤἥἦἧἨἩἪἫἬἭἮἯἰἱἲἳἴἵἶἷἸἹἺἻἼἽἾἿὀὁὂὃὄὅὈὉὊὋὌὍὐὑὒὓὔὕὖὗὙὛὝὟὠὡὢὣὤὥὦὧὨὩὫὬὭὮὯὰάὲέὴήὶίὸόὺύὼώᾀᾁᾂᾃᾄᾅᾆᾇᾈᾉᾊᾋᾌᾍᾎᾏᾐᾑᾒᾓᾔᾕᾖᾗᾘᾙᾚᾛᾜᾝᾞᾟᾠᾡᾢᾣᾤᾥᾦᾧᾨᾩᾪᾫᾬᾭᾮᾯᾲᾳᾴᾶᾷᾺΆᾼῂῃῄῆῇῈΈῊΉῌῖῚΊῤῥῦῪΎῲῳῴῶῷῸΌῺΏῼ῾] ;
34:: NFD (NFC) ;
2ca993e8
A
35#
36#
37########################################################################
38#
39########################################################################
40#
41# Define All Transformation Variables
42#
43########################################################################
44#
729e4ab9
A
45$upperConsonants = [ΒΓΔΖΘΚΛΜΝΞΠΡΣΤΦΧΨ] ;
46$lowerConsonants = [βγδζθκλμνξπρσςτφχψ] ;
47$consonants = [$upperConsonants $lowerConsonants] ;
48$upperVowels = [ΑΕΗΙΟΥΩ] ;
49$lowerVowels = [αεηιουω] ;
50$vowels = [$upperVowels $lowerVowels] ;
51$lower = [$lowerConsonants $lowerVowels] ;
2ca993e8
A
52#
53#
54# Use this $wordBoundary until bug 2034 is fixed in ICU:
55# http://bugs.icu-project.org/cgi-bin/icu-bugs/transliterate?id=2034;expression=boundary;user=guest
56#
51004dcb 57$wordBoundary = [^[:L:][:M:][:N:]] ;
2ca993e8
A
58#
59#
60########################################################################
61#
62########################################################################
63#
64# Rules moved to front to avoid masking
65#
66########################################################################
67#
68########################################################################
69#
70# BGN Page 32 Rule 1:
71#
72# The apostrophe and reversed apostrophe, on or the other of which is
73# written in Greek in front of all initial uppercase vowel characters,
74# above all initial lowercase vowel characters, and above the second
75# character of all initial two-vowel character sequences, should not
76# be romanized, e.g., Ἀθῆναι → Athínai, Ἠράκλειον → Iráklion,
77# Οἰνόφυτα → Oinófita. These apostrophes must be distinguished from
78# accent marks hen they occur together, e.g. Ἄβατον → Ávaton,
79# Ἤλια → Ília, Οἴτη → Oíti. The reversed apostrophe is sometimes found
80# also with ρ and should, likewise, not be romanized: ῥέμα → réma.
81#
82# BGN Page 32 Rule 2a:
83#
84# Stress is shown in Greek by the use of the tilde or circumflex,
85# the acute accent, or the grave accent; all of those marks should
86# be represented in romanization by an acute accent, e.g.,
87# Ἀθῆναι → Athínai, Νδία → Día, Ζεμενὸν → Zemenón.
88#
89# BGN Page 32 Rule 4:
90#
91# The character ι (ióta) is sometimes found written under, or,
92# in uppercase, to the right of the vowel characters α, η, and ω.
93# This "subscript iota" should not be romanized, e.g.,
94# Μυρτῷον Πέλαγος or ΜΥΡΤῼΟΝ ΠΕΛΑΓΟΣ [but not ΜΥΡΤΩΙΟΝ ΠΕΛΑΓΟΣ]
95# → Mirtóön Pélagos.
96#
97########################################################################
98#
729e4ab9
A
99[ἈἉᾼᾈᾉ] → Α ; # GREEK CAPITAL LETTER ALPHA
100[ἀἁᾳᾀᾁ] → α ; # GREEK SMALL LETTER ALPHA
101[ἊἋἌἍἎἏᾊᾋᾌᾍᾎᾏᾺΆ] → Ά ; # GREEK CAPITAL LETTER ALPHA WITH TONOS
102[ἂἃἄἅἆἇὰάᾂᾃᾄᾅᾆᾇᾲᾴᾶᾷ] → ά ; # GREEK SMALL LETTER ALPHA WITH TONOS
103[ἘἙ] → Ε ; # GREEK CAPITAL LETTER EPSILON
104[ἐἑὲέ] → ε ; # GREEK SMALL LETTER EPSILON
105[ἚἛἜἝῈΈ] → Έ ; # GREEK CAPITAL LETTER EPSILON WITH TONOS
106[ἒἓἔἕ] → έ ; # GREEK SMALL LETTER EPSILON WITH TONOS
107[ἨἩᾘᾙῌ] → Η ; # GREEK CAPITAL LETTER ETA
108[ἠἡᾐᾑῃ] → η ; # GREEK SMALL LETTER ETA
109[ἪἫἬἭἮἯᾚᾛᾜᾝᾞᾟῊΉ] → Ή ; # GREEK CAPITAL LETTER ETA WITH TONOS
110[ἢἣἤἥἦἧὴήᾒᾓᾔᾕᾖᾗῂῄῆῇ] → ή ; # GREEK SMALL LETTER ETA WITH TONOS
111[ἸἹ] → Ι ; # GREEK CAPITAL LETTER IOTA
112[ἰἱ] → ι ; # GREEK SMALL LETTER IOTA
113[ἺἻἼἽἾἿῚΊ] → Ί ; # GREEK CAPITAL LETTER IOTA WITH TONOS
114[ἲἳἴἵἶἷὶίῖ] → ί ; # GREEK SMALL LETTER IOTA WITH TONOS
115[ὈὉ] → Ο ; # GREEK CAPITAL LETTER OMICRON
116[ὀὁ] → ο ; # GREEK SMALL LETTER OMICRON
117[ὊὋὌὍῸΌ] → Ό ; # GREEK CAPITAL LETTER OMICRON WITH TONOS
118[ὂὃὄὅὸό] → ό ; # GREEK SMALL LETTER OMICRON WITH TONOS
119Ὑ → Υ ; # GREEK CAPITAL LETTER UPSILON
120[ὐὑ] → υ ; # GREEK SMALL LETTER UPSILON
121[ὛὝὟῪΎ] → Ύ ; # GREEK CAPITAL LETTER UPSILON WITH TONOS
122[ὒὓὔὕὖὗὺύῦ] → ύ ; # GREEK SMALL LETTER UPSILON WITH TONOS
123[ὨὩᾨᾩῼ] → Ω ; # GREEK CAPITAL LETTER OMEGA
124[ὠὡᾠᾡῳ] → ω ; # GREEK SMALL LETTER OMEGA
125[ὬὫὬὭὮὯᾪᾫᾬᾭᾮᾯῺΏ] → Ώ ; # GREEK CAPITAL LETTER OMEGA WITH TONOS
126[ὢὣὤὥὦὧὼώᾢᾣᾤᾥᾦᾧῲῴῶῷ] → ώ ; # GREEK SMALL LETTER OMEGA WITH TONOS
127Ῥ → Ρ ; # GREEK CAPITAL LETTER RHO
128[ῤῥ] → ρ ; # GREEK SMALL LETTER RHO
2ca993e8
A
129#
130#
131########################################################################
132#
133# End of Rules 1, 2a, and 4
134#
135########################################################################
136#
137########################################################################
138#
139# BGN Page 32 Rules 2b and 2c:
140#
141# If the stressed vowel is written as a sequence of two vowel characters
142# in Greek, the # second vowel character should carry the accent;
143# similarly, in Romanization the acute accent should be placed over the
144# second vowel letter, e.g., Οἰνοῦσαι → Oinoúsai, Οἴτη → Oíti,
145# Θεσπιαὶ → Thespiaí.
146#
147# Where a syllable containing on the combinations αυ, ευ, or ηυ
148# carries the stress, this is marked in Greek on the character υ.
149# In romanization it should be shown on the preceding vowel
150# letter, e.g., Πειραιεύς → Piraiévs, Αὔρα → Ávra.
151#
729e4ab9
A
152Αί → Aí ;
153αί → aí ;
154Οί → Oí ;
155οί → Oí ;
156Ού → Oú ;
157ού → oú ;
158Αύ → Άυ ;
159αύ → άυ ;
160Εύ → Έυ ;
161εύ → έυ ;
162Ηύ → Ήυ ;
163ηύ → ήυ ;
2ca993e8
A
164#
165#
166########################################################################
167#
168# End of Rules 2b and 2c
169#
170########################################################################
171#
172########################################################################
173#
174# BGN Page 32 Rule 3:
175#
176# The dieresis should be shown in romanization where it occurs in Greek,
177# e.g., Μαρινέϊκα → Marinéïka, Ἀχαΐα → Akhaï\u0301a; and over the second vowel
178# etter in romanization of the following combinations fo Greek vowel
179# characters: αε, e.g., Ἀετὸς → Aëtos; αη, e.g., Ἀηδὼν → Aïdhon; οη,
180# e.g. Οἰνόη → Oinóï; ωο, e.g., Ἠρῶον → Iróön.
181#
729e4ab9
A
182[ΪΫ] → Ï ;
183[ϊϋ] → ï ;
184[ΐΰ] → ï\u0301 ;
185Αε → Aë ;
186αε → aë ;
187Αη → Aï ;
188αη → aï ;
189Οη → Oï ;
190οη → oï ;
191Ωο → Oö ;
192ωο → oö ;
193Άε → Áë ;
194άε → áë ;
195Άη → Áï ;
196άη → áï ;
197Όη → Óï ;
198όη → óï ;
199Ώο → Óö ;
200ώο → óö ;
2ca993e8
A
201#
202#
203########################################################################
204#
205# End of Rule 3
206#
207########################################################################
208#
209########################################################################
210#
211# Start of Alphabetic Transformations
212#
213########################################################################
214#
729e4ab9
A
215ΑΙ → AI ; # GREEK CAPITAL LETTER ALPHA + CAPITAL IOTA
216Αι → Ai ; # GREEK CAPITAL LETTER ALPHA + SMALL IOTA
217αι → ai ; # GREEK SMALL LETTER ALPHA + SMALL IOTA
218ΑΥ → AV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON
219Αυ → Av ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON
220αυ → av ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON
221Α → A ; # GREEK CAPITAL LETTER ALPHA
222α → a ; # GREEK SMALL LETTER ALPHA
223Ά → Á ; # GREEK CAPITAL LETTER ALPHA WITH TONOS
224ά → á ; # GREEK SMALL LETTER ALPHA WITH TONOS
225Β → V ; # GREEK CAPITAL LETTER BETA
226β → v ; # GREEK SMALL LETTER BETA
227ΓΓ → NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL GAMMA
228Γγ → Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL GAMMA
229γγ → ng ; # GREEK SMALL LETTER GAMMA + SMALL GAMMA
230$wordBoundary{ΓΚ → G ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA
231$wordBoundary{Γκ → G ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA
232$wordBoundary{γκ → g ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA
233ΓΚ → NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA
234Γκ → Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA
235γκ → ng ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA
2ca993e8
A
236#
237#
238########################################################################
239#
240# BGN Page 29 Rule 3a:
241#
242# The character γ should be romanized g before α, ο, ου, ω, and
243# consonants other than γ, ξ, and χ.
244#
245########################################################################
246#
729e4ab9
A
247Γ}[ΑΟΩ [$upperConsonants - [ΓΞΧ]]] → G ; # GREEK CAPITAL LETTER GAMMA
248Γ}[αοω [$lowerConsonants - [γξχ]]] → G ; # GREEK CAPITAL LETTER GAMMA
249Γ}ΟΥ → G ; # GREEK CAPITAL LETTER GAMMA
250Γ}ου → G ; # GREEK CAPITAL LETTER GAMMA
251γ}[αοω [$lowerConsonants - [γξχ]]] → g ; # GREEK SMALL LETTER GAMMA
252γ}ου → g ; # GREEK SMALL LETTER GAMMA
2ca993e8
A
253#
254#
255########################################################################
256#
257# End of Rule 3a
258#
259########################################################################
260#
261########################################################################
262#
263# BGN Page 29 Rule 3b:
264#
265# The character γ should be romanized y before αι, ε, ει, η, ι, οι, υ,
266# and υι.
267#
268########################################################################
269#
729e4ab9 270Γ}[ΑΕΟΥ]Ι → Y ; # GREEK CAPITAL LETTER GAMMA
51004dcb 271Γ}[ΕΗΙΥ] → Y ; # GREEK CAPITAL LETTER GAMMA
729e4ab9 272Γ}[αεου]ι → Y ; # GREEK CAPITAL LETTER GAMMA
51004dcb 273Γ}[εηιυ] → Y ; # GREEK CAPITAL LETTER GAMMA
729e4ab9 274γ}[αεου]ι → y ; # GREEK SMALL LETTER GAMMA
51004dcb 275γ}[εηιυ] → y ; # GREEK SMALL LETTER GAMMA
2ca993e8
A
276#
277#
278########################################################################
279#
280# End of Rule 3b
281#
282########################################################################
283#
284########################################################################
285#
286# BGN Page 29 Rule 3c:
287#
288# The character γ should be romanized n before ξ and χ.
289#
290########################################################################
291#
729e4ab9
A
292Γ}[ΞΧ] → N ; # GREEK CAPITAL LETTER GAMMA
293Γ}[ξχ] → N ; # GREEK CAPITAL LETTER GAMMA
294γ}[ξχ] → n ; # GREEK SMALL LETTER GAMMA
2ca993e8
A
295#
296#
297########################################################################
298#
299# End of Rule 3c
300#
301########################################################################
302#
729e4ab9
A
303Γ → G ; # GREEK CAPITAL LETTER GAMMA
304γ → g ; # GREEK SMALL LETTER GAMMA
2ca993e8
A
305#
306#
307########################################################################
308#
309# BGN Page 29 Rule 4a:
310#
311# The character δ should be romanized d when between ν and ρ.
312#
313########################################################################
314#
729e4ab9
A
315Ν{Δ}Ρ → D ; # GREEK CAPITAL LETTER DELTA
316ν{δ}ρ → d ; # GREEK SMALL LETTER GAMMA
2ca993e8
A
317#
318#
319########################################################################
320#
321# End of Rule 4a
322#
323########################################################################
324#
729e4ab9
A
325Δ} $lower → Dh ; # GREEK CAPITAL LETTER PSI
326Δ → DH ; # GREEK CAPITAL LETTER DELTA
327δ → dh ; # GREEK SMALL LETTER DELTA
328ΕΙ → I ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA
329Ει → I ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA
330ει → i ; # GREEK SMALL LETTER EPSILON + SMALL IOTA
331ΕΪ → EÏ ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA DIAERESIS
332Εϊ → Eï ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA DIAERESIS
333εϊ → eï ; # GREEK SMALL LETTER EPSILON + SMALL IOTA DIAERESIS
334ΕΥ → EV ; # GREEK CAPITAL LETTER EPSILON + CAPITAL UPSILON
335Ευ → Ev ; # GREEK CAPITAL LETTER EPSILON + SMALL UPSILON
336ευ → ev ; # GREEK SMALL LETTER EPSILON + SMALL UPSILON
337Ε → E ; # GREEK CAPITAL LETTER EPSILON
338ε → e ; # GREEK SMALL LETTER EPSILON
339Έ → É ; # GREEK CAPITAL LETTER EPSILON WITH TONOS
340έ → é ; # GREEK SMALL LETTER EPSILON WITH TONOS
341Ζ → Z ; # GREEK CAPITAL LETTER ZETA
342ζ → z ; # GREEK SMALL LETTER ZETA
343ΗΥ → IV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON
344Ηυ → Iv ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON
345ηυ → iv ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON
346Η → I ; # GREEK CAPITAL LETTER ETA
347η → i ; # GREEK SMALL LETTER ETA
348Ή → Í ; # GREEK CAPITAL LETTER ETA WITH TONOS
349ή → í ; # GREEK SMALL LETTER ETA WITH TONOS
350Θ} $lower → Th ; # GREEK CAPITAL LETTER THETA
351Θ → TH ; # GREEK CAPITAL LETTER THETA
352θ → th ; # GREEK SMALL LETTER THETA
353Ι → I ; # GREEK CAPITAL LETTER IOTA
354ι → i ; # GREEK SMALL LETTER IOTA
355Ί → Í ; # GREEK CAPITAL LETTER IOTA WITH TONOS
356ί → í ; # GREEK SMALL LETTER IOTA WITH TONOS
357Κ → K ; # GREEK CAPITAL LETTER KAPPA
358κ → k ; # GREEK SMALL LETTER KAPPA
359Λ → L ; # GREEK CAPITAL LETTER LAMDA
360λ → l ; # GREEK SMALL LETTER LAMDA
361$wordBoundary{ΜΠ → B ; # GREEK CAPITAL LETTER MU + CAPITAL PI
362$wordBoundary{Μπ → B ; # GREEK CAPITAL LETTER MU + SMALL PI
363$wordBoundary{μπ → b ; # GREEK SMALL LETTER MU + SMALL PI
364ΜΠ → MB ; # GREEK CAPITAL LETTER MU + CAPITAL PI
365Μπ → Mb ; # GREEK CAPITAL LETTER MU + SMALL PI
366μπ → mb ; # GREEK SMALL LETTER MU + SMALL PI
367Μ → M ; # GREEK CAPITAL LETTER MU
368μ → m ; # GREEK SMALL LETTER MU
369$wordBoundary{ΝΤ → D ; # GREEK CAPITAL LETTER NU + CAPITAL TAU
370$wordBoundary{Ντ → D ; # GREEK CAPITAL LETTER NU + SMALL TAU
371$wordBoundary{ντ → d ; # GREEK SMALL LETTER NU + SMALL TAU
372ΝΤ → ND ; # GREEK CAPITAL LETTER NU + CAPITAL TAU
373Ντ → Nd ; # GREEK CAPITAL LETTER NU + SMALL TAU
374ντ → nd ; # GREEK SMALL LETTER NU + SMALL TAU
375Ν → N ; # GREEK CAPITAL LETTER NU
376ν → n ; # GREEK SMALL LETTER NU
377Ξ → X ; # GREEK CAPITAL LETTER KSI
378ξ → x ; # GREEK SMALL LETTER KSI
379ΟΙ → OI ; # GREEK CAPITAL LETTER OMICRON + CAPITAL IOTA
380Οι → Oi ; # GREEK CAPITAL LETTER OMICRON + SMALL IOTA
381οι → oi ; # GREEK SMALL LETTER OMICRON + SMALL IOTA
382ΟΥ → OU ; # GREEK CAPITAL LETTER OMICRON + CAPITAL UPSILON
383Ου → Ou ; # GREEK CAPITAL LETTER OMICRON + SMALL UPSILON
384ου → ou ; # GREEK SMALL LETTER OMICRON + SMALL UPSILON
385Ο → O ; # GREEK CAPITAL LETTER OMICRON
386ο → o ; # GREEK SMALL LETTER OMICRON
387Ό → Ó ; # GREEK CAPITAL LETTER OMICRON WITH TONOS
388ό → ó ; # GREEK SMALL LETTER OMICRON WITH TONOS
389Π → P ; # GREEK CAPITAL LETTER PI
390π → p ; # GREEK SMALL LETTER PI
391Ρ → R ; # GREEK CAPITAL LETTER RHO
392ρ → r ; # GREEK SMALL LETTER RHO
393Σ → S ; # GREEK CAPITAL LETTER SIGMA
394σ → s ; # GREEK SMALL LETTER SIGMA
395ς → s ; # GREEK SMALL LETTER FINAL SIGMA
396Τ → T ; # GREEK CAPITAL LETTER TAU
397τ → t ; # GREEK SMALL LETTER TAU
2ca993e8
A
398#
399#
400########################################################################
401#
402# End Rule 3.5
403#
404########################################################################
405#
729e4ab9
A
406Υ → I ; # GREEK CAPITAL LETTER UPSILON
407υ → i ; # GREEK SMALL LETTER UPSILON
408Ύ → Í ; # GREEK CAPITAL LETTER UPSILON WITH TONOS
409ύ → í ; # GREEK SMALL LETTER UPSILON WITH TONOS
410Φ → F ; # GREEK CAPITAL LETTER PHI
411φ → f ; # GREEK SMALL LETTER PHI
412Χ} $lower → Kh ; # GREEK CAPITAL LETTER CHI
413Χ → KH ; # GREEK CAPITAL LETTER CHI
414χ → kh ; # GREEK SMALL LETTER CHI
415Ψ} $lower → Ps ; # GREEK CAPITAL LETTER PSI
416Ψ → PS ; # GREEK CAPITAL LETTER PSI
417ψ → ps ; # GREEK SMALL LETTER PSI
418Ω → O ; # GREEK CAPITAL LETTER OMEGA
419ω → o ; # GREEK SMALL LETTER OMEGA
420Ώ → Ó ; # GREEK CAPITAL LETTER OMEGA WITH TONOS
421ώ → ó ; # GREEK SMALL LETTER OMEGA WITH TONOS
2ca993e8
A
422#
423#
424########################################################################
425