Use accesskey "n" to jump to the internal navigation links at any point. Right now you can

 
r12a >> docs

Mongolian

Cyrillic orthography notes

Updated 15 December, 2024 • recent changes scripts/cyrl/mn • leave a comment

This page brings together basic information about the Cyrillic script and its use for the Mongolian language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Mongolian using Unicode.

Referencing this document

Richard Ishida, Mongolian (Cyrillic) Orthography Notes, 15-Dec-2024, https://r12a.github.io/scripts/cyrl/mn

 

Click to toggle Table of Contents.

Phonological transcriptions should be treated as a guide, only. They are taken from the sources consulted, and may be narrow or broad, phonemic or phonetic, depending on what is available. They mostly represent pronunciation of words in isolation. For more detailed information about allophones, alternations, sandhi, dialectal differences, and so on, follow the links to cited references.

This is an interactive document. Click/tap on the following to reveal detailed information and examples for each character: (a) coloured characters in examples and lists; (b) link text on character names. If your browser supports it, your cursor will change to look like as you hover over these items.

More about using this page

Character names. The names of characters in codepoint markup drop the initial CYRILLIC label (purely to reduce the length of the examples). In other places the full name can be found.

Navigation. The Toggle images icon opens the table of contents in a popup window. Dismiss it by clicking on the X alongside it, or by hitting the ESC key.

Detailed character notes. Clicking on coloured characters in lists or on character names opens panels that give detailed information about each character. This information is taken from the companion document, Cyrillic Character Notes. (Those panels can be dismissed by pressing on the ESC key.)

Transcriptions & transliterations. Phonological transcriptions are surrounded by ⌈corner brackets⌋, to indicate that they vary between narrow, [phonetic] and broad, /phonemic/ transcriptions.
Latin transcriptions between <angle brackets>, represent the letters as commonly written in the Latin script.
A transliteration has also been developed especially for this orthography, and is generally based on the sound of a letter where possible, but where a letter has multiple pronunciations, the transliteration represents only one.
Transliterations provide perfect round-trip conversion between the native script and Latin, whereas Latin transcriptions rarely do.
When you click on an example to see its composition, the top of the panel that opens contains a transliteration, followed by the native text, then (if available) an IPA transcription.

Copied !
TOC.
Accessibility settings
ˇ

Languages using the Cyrillic scriptMongolian pickerTerms listCharacter notesCyrillic linksOther orthography notes

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size:   24px

Хүн бүр төрж мэндлэхэд эрх чөлөөтэй, адилхан нэр төртэй, ижил эрхтэй байдаг. Оюун ухаан, нандин чанар заяасан хүн гэгч өөр хоорондоо ахан дүүгийн үзэл санаагаар харьцах учиртай.

Хүн бүр энэ Тунхаглалд заасан бүхий л эрх, эрх чөлөөг ямар ч ялгаваргүйгээр, яс үндэс, арьс өнгө, хүйс, хэл, шашин шүтлэг, улс төрийн болон бусад үзэл бодол, үндэсний буюу нийгмийн гарал, эд хөрөнгийн байдал, язгуур угсаа, бусад ялгааг эс харгалзан адилхан эдлэх ёстой. Мөн түүнчлэн тухайн хүний харъяалдаг улс орон буюу нутаг дэвсгэрийн улс төр, эрх зүйн буюу олон улсын статус ямар ч байлаа гэсэн, тэрхүү нутаг дэвсгэр нь тусгаар тогтносон, бусдын асрамжид байгаа, өөртөө захиргаагүй буюу бүрэн эрхт байдал нь өөр ямар ч байдлаар хязгаарлагдмал байсан, хүнийг ялгаварлаж үл болно.

Source: Unicode UDHR, articles 1 & 2

Usage & history

Origins of the Latin script, 7thC – today.

Phoenician

└ Greek

└ Old Italic

└ Cyrillic

+ Glagolitic

+ Latin

+ Armenian

+ Georgian

+ Coptic

+ Runes

The Mongolian Cyrillic alphabet is used for the standard dialect of the Mongolian language in the modern state of Mongolia. Ethnologue lists 2,640,000 native speakers of Halh Mongolian, but Wikipedia list 5.2 million speakers across all dialects, including the Inner Mongolia Autonomous Region of the People's Republic of China. Cyrillic has not been adopted as the writing system in the Inner Mongolia region of China, which continues to use the traditional Mongolian script. In Mongolia, the Halh (or Khalkha) dialect is predominant.

Монгол Кирилл үсэг mongol kirill üseg Mongolian cyrillic alphabet Кирилл цагаан толгой kirill tsagaan tolgoi Mongolian cyrillic alphabet

In the Mongolian People's Republic (Outer Mongolia), the traditional script was replaced by a Cyrillic orthography since the early 1940s, as a result of the spreading of Russian influence following the expansion of Russian Empire and the subsequent Soviet Union. Its introduction is credited with an increase in the literacy rate from 17.3% to 73.5% between 1941 and 1950.8

Source: Wikipedia.

Script codecyrl
Language codekhk
Script typealphabet
Origineasia
Native speakers2,640,000
  
Total characters75
Letters67
Combining marks2
Symbols1
Punctuation5
Possible other19
Unicode blocks6
  
Character counts above are for this
orthography but exclude ASCII.
  
Text directionltr
Post-consonant vowelsletters
Standalone vowelsletters
Case distinctionyes
Cursive scriptno
Combining marksno
Clusters markedno
Other ligaturesno
Word separatorspace
Wraps atword
Hyphenationyes -
G Clusters OK?yes
Justificationspaces
Baselineromn

Basic features

Cyrillic is an alphabet. Letters typically represent a consonant or vowel sound. See the table to the right for a brief overview of features for the Mongolian language.

Cyrillic Mongolian text runs left-to-right in horizontal lines. Words are separated by spaces.

The script is bicameral. The shapes of the upper and lowercase forms are typically the same. There can be a significant difference, however, between regular and cursive/italic shapes for the same character.

Normal text contains no combining marks (and decomposed text contains only 2). The visual forms of letters don't usually interact.

❯ Consonant summary table

Mongolian has 21 basic consonant letters, including 3 for writing sounds from foreign loan words, and one of which is not used in uppercase. The letter inventory also includes a hard sign and a soft sign.

❯ Vowel summary table

The orthography is an alphabet that writes vowels using 16 vowel letters (30 in total, because 2 are only used in lowercase), including 4 ioticised vowels which may also indicate palatalisation of the previous consonant. Long vowels are indicated by doubling the vowel letters. A number of diphthongs are written using the semi-vowel letter йU+0439 SMALL LETTER SHORT I.

Vowel reduction is a significant feature of Mongolian. Non-initial short vowels are reduced to vestiges or to zero, and non-initial long vowels in the orthography are reduced to short vowel length.

Vowel harmony is another key feature, grouping vowels in a way that indicates a front or back position for the tongue root (ATR).

There are no special mechanisms to represent standalone vowels. Combining marks are normally not used, and only occur in decomposed text.

Text is generally wrapped at word boundaries, and justification predominantly stretches the spaces between words.

Numbers use ASCII digits.

Character index

The index points to locations where a character is mentioned in this page, and indicates whether it is used by the Cyrillic orthography described here.

Manage characters.

Click on the image to the left to view all the 'main' and 'infrequent' characters in the index in various groupings or open related apps.

Letters

Show

Basic consonants

list all 21
б0431
CYRILLIC SMALL LETTER BEplosive b b
в0432
CYRILLIC SMALL LETTER VEfricative w̜ v f v
г0433
CYRILLIC SMALL LETTER GHEplosive ɡ ɣ ɢ g
д0434
CYRILLIC SMALL LETTER DEplosive d d
ж0436
CYRILLIC SMALL LETTER ZHEaffricate d͡ʒ j
з0437
CYRILLIC SMALL LETTER ZEaffricate d͡z z
к043A
(loan)    CYRILLIC SMALL LETTER KAplosive/fricative Used for transliteration. k x k
л043B
CYRILLIC SMALL LETTER ELapproximant ɮ l
м043C
CYRILLIC SMALL LETTER EMnasal m m
н043D
CYRILLIC SMALL LETTER ENnasal n ŋ n
п043F
(loan)    CYRILLIC SMALL LETTER PEplosive Used mostly for foreign loan words. p p
р0440
CYRILLIC SMALL LETTER ERapproximant r r
с0441
CYRILLIC SMALL LETTER ESfricative s s
т0442
CYRILLIC SMALL LETTER TEplosive t t
ф0444
(loan)    CYRILLIC SMALL LETTER EFfricative Used for transliteration. f pʰ f
х0445
CYRILLIC SMALL LETTER HAfricative x kh
ц0446
CYRILLIC SMALL LETTER TSEaffricate t͡s ts
ч0447
CYRILLIC SMALL LETTER CHEaffricate t͡ʃ ch
ш0448
CYRILLIC SMALL LETTER SHAfricative ʃ sh
щ0449
(loan)    CYRILLIC SMALL LETTER SHCHAfricative Used for Russian words only. ʃ sh
ы044B
CYRILLIC SMALL LETTER YERUvowel Only used in suffixes for masculine words (a, o, y). No uppercase. Not word-initial. i y
list all 20
Б0411
CYRILLIC CAPITAL LETTER BEplosive b B
В0412
CYRILLIC CAPITAL LETTER VEfricative w̜ v f V
Г0413
CYRILLIC CAPITAL LETTER GHEplosive ɡ ɣ ɢ G
Д0414
CYRILLIC CAPITAL LETTER DEplosive d D
Ж0416
CYRILLIC CAPITAL LETTER ZHEaffricate d͡ʒ J
З0417
CYRILLIC CAPITAL LETTER ZEaffricate d͡z Z
К041A
(loan)    CYRILLIC CAPITAL LETTER KAplosive/fricative Used for transliteration. k x K
Л041B
CYRILLIC CAPITAL LETTER ELapproximant ɮ L
М041C
CYRILLIC CAPITAL LETTER EMnasal m M
Н041D
CYRILLIC CAPITAL LETTER ENnasal n ŋ N
П041F
(loan)    CYRILLIC CAPITAL LETTER PEplosive Used mostly for foreign loan words. p P
Р0420
CYRILLIC CAPITAL LETTER ERapproximant r R
С0421
CYRILLIC CAPITAL LETTER ESfricative s S
Т0422
CYRILLIC CAPITAL LETTER TEplosive t T
Ф0424
(loan)    CYRILLIC CAPITAL LETTER EFfricative Used for transliteration. f pʰ F
Х0425
CYRILLIC CAPITAL LETTER HAfricative x KH
Ц0426
CYRILLIC CAPITAL LETTER TSEaffricate t͡s TS
Ч0427
CYRILLIC CAPITAL LETTER CHEaffricate t͡ʃ CH
Ш0428
CYRILLIC CAPITAL LETTER SHAfricative ʃ SH
Щ0429
(loan)    CYRILLIC CAPITAL LETTER SHCHAfricative Used for Russian words only. ʃ SH

Vowels

list all 12
а0430
CYRILLIC SMALL LETTER Aback (-ATR) vowel a a
е0435
CYRILLIC SMALL LETTER IEioticised vowel ji je ye
и0438
CYRILLIC SMALL LETTER Ineutral vowel i i
й0439
CYRILLIC SMALL LETTER SHORT Iglide Used for diphthongs. i
о043E
CYRILLIC SMALL LETTER Oback (-ATR) vowel ɔ o
у0443
CYRILLIC SMALL LETTER Uback (-ATR) vowel ʊ u
э044D
CYRILLIC SMALL LETTER Efront (+ATR) vowel e i e
ю044E
CYRILLIC SMALL LETTER YUioticised vowel jʊ ju yu
я044F
CYRILLIC SMALL LETTER YAioticised vowel ja ya
ё0451
CYRILLIC SMALL LETTER IOioticised vowel yo
ү04AF
CYRILLIC SMALL LETTER STRAIGHT Ufront (+ATR) vowel u ü
ө04E9
CYRILLIC SMALL LETTER BARRED Ofront (+ATR) vowel ɵ o ö
list all 12
Ё0401
CYRILLIC CAPITAL LETTER IOioticised vowel yo
А0410
CYRILLIC CAPITAL LETTER Aback (-ATR) vowel a a
Е0415
CYRILLIC CAPITAL LETTER IEioticised vowel ye
И0418
CYRILLIC CAPITAL LETTER Ineutral vowel i i
Й0419
CYRILLIC CAPITAL LETTER SHORT Iglide i
О041E
CYRILLIC CAPITAL LETTER Oback (-ATR) vowel ɔ o
У0423
CYRILLIC CAPITAL LETTER Uback (-ATR) vowel ʊ u
Э042D
CYRILLIC CAPITAL LETTER Efront (+ATR) vowel e i e
Ю042E
CYRILLIC CAPITAL LETTER YUioticised vowel jʊ ju yu
Я042F
CYRILLIC CAPITAL LETTER YAioticised vowel ja ya
Ү04AE
CYRILLIC CAPITAL LETTER STRAIGHT Ufront (+ATR) vowel u ü
Ө04E8
CYRILLIC CAPITAL LETTER BARRED Ofront (+ATR) vowel ɵ o ö

Other

list both
ъ044A
CYRILLIC SMALL LETTER HARD SIGN "
ь044C
CYRILLIC SMALL LETTER SOFT SIGNshort vowel / palatalisation marker ĭ ʲ i

Combining marks

Show
list both
̆0306
(rare)    COMBINING BREVEdiacritic Found in decomposed text only.
̈0308
(rare)    COMBINING DIAERESISdiacritic Found in decomposed text only.

Punctuation

Show
list all 7
«00AB
LEFT-POINTING DOUBLE ANGLE QUOTATION MARKquotation mark
»00BB
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARKquotation mark
201E
DOUBLE LOW-9 QUOTATION MARKquotation mark
201C
LEFT DOUBLE QUOTATION MARKquotation mark
2014
(tbc)    EM DASHem dash
2026
HORIZONTAL ELLIPSISellipsis

ASCII

list all 8
,002C
COMMAcomma ,
;003B
SEMICOLONsemicolon ;
:003A
COLONcolon :
.002E
FULL STOPfull stop .
?003F
QUESTION MARKquestion mark ?
!0021
EXCLAMATION MARKexclamation mark !
(0028
LEFT PARENTHESISparenthesis (
)0029
RIGHT PARENTHESISparenthesis )

Symbols

Show
list
20AE
TUGRIK SIGNcurrency symbol

Other

Show

To be investigated

list all 19
%0025
(tbc)    PERCENT SIGNpercentage mark
[005B
(tbc)    LEFT SQUARE BRACKETbracket [
]005D
(tbc)    RIGHT SQUARE BRACKETbracket ]
§00A7
(tbc)    SECTION SIGNsection sign §
ʼ02BC
(tbc)    MODIFIER LETTER APOSTROPHEapostrophe ʼ
Ы042B
(unused)    CYRILLIC CAPITAL LETTER YERUvowel i y
2011
(tbc)    NON-BREAKING HYPHENnon-breaking hyphen
2013
(tbc)    EN DASHen dash
2014
(tbc)    EM DASHem dash
2018
(tbc)    LEFT SINGLE QUOTATION MARKquotation mark
201A
(tbc)    U+201A SINGLE LOW-9 QUOTATION MARKquotation mark
2020
(tbc)    DAGGERdagger
2021
(tbc)    DOUBLE DAGGERdouble dagger
2030
(tbc)    PER MILLE SIGNper mille mark
2032
(tbc)    PRIMEprime
2033
(tbc)    DOUBLE PRIMEdouble prime
2039
(tbc)    LEFT SINGLE QUOTATION MARKquotation mark
203A
(tbc)    RIGHT SINGLE QUOTATION MARKquotation mark
2116
(tbc)    NUMERO SIGNnumber symbol

Phonology

These are the sounds of Halh (or Khalkha) Mongolian.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Plain vowels

i u ʊ ʊː e ɵ ɵ ɔ ɔː a

Diphthongs

ui ʊi ɔi ai

A significant feature of Mongolian phonology is that vowel sounds are divided into front (+ATR), back (-ATR), and neutral groups (see Vowel harmony). The front and back distinction has to do with the position of the tongue root (ATR means Advanced Tongue Root). The phonology is more complicated, and sounds are somewhat more fluid than described here. See the sources for more detailed information.

Consonant sounds

labial alveolar post-
alveolar
palatal velar uvular glottal
stop p b d t     k ɡ ɢ  
affricate   t͡s d͡z t͡ʃ d͡ʒ        
fricative f s
ɮ
ʃ   x    
nasal m n     ŋ  
approximant w     j    
trill/flap   r    

Some phonological transcriptions use t and where others use d and t for the same sounds, respectively. Similar contrasts are applied to the bilabial and affricate pairs in the repertoire (but not to the k/g pairing). Here we use the latter, partly because it is probably better indicative to the non-expert of the approximate sounds involved, and also because that corresponds with the Cyrillic letters used.

Other sources also indicate palatised versions of most consonants (eg. and ) in a table such as this, but they are not shown here. Palatalisation appears to be restricted to words containing -ATR (back) vowels8.

Tone

Mongolian is not a tonal language.

Structure

The basic unit of text is a word, however words can contain prefixes and suffixes.

Syllables tend to follow the pattern:

(C)V(V)(C)(C)(C)

Long vowels only occur in initial syllables. Mongolian has a strong tendency to reduce non-initial short vowels, either to epenthetic remnants or to zero. Non-initial vowels written as long are pronounced with normal length. See Vowel stress & reduction.

Vowels

The orthography is an alphabet that writes vowels using 16 vowel letters (30 in total, because 2 are only used in lowercase), including 4 ioticised vowels which may also indicate palatalisation of the previous consonant. Long vowels are indicated by doubling the vowel letters. A number of diphthongs are written using the semi-vowel letter йU+0439 SMALL LETTER SHORT I.

Vowel reduction is a significant feature of Mongolian. Non-initial short vowels are reduced to vestiges or to zero, and non-initial long vowels in the orthography are reduced to short vowel length.

Vowel harmony is another key feature, grouping vowels in a way that indicates a front or back position for the tongue root (ATR).

There are no special mechanisms to represent standalone vowels. Combining marks are normally not used, and only occur in decomposed text.

Vowel summary table

The following table summarises the main vowel to character assigments.

These are nominal pronunciations that don't take into account vowel harmony or vowel reduction. The vowels with IPA beginning j.. have transcriptions for standalone contexts; after a consonant the j generally transmutes to ʲ in the sense that it palatalises the consonant.

Neutral:

5
iиii0438
ийii 0438
0439
йiĭ0439
ʲьiʲ044C
iыyɪ044B

4
iИ iI0418
ИЙ ii 0418
0419
Й i0419
ʲЫunusedy042B
ATR+:

9
eэee044D
ээeeee044D
044D
ɵ oөöɵ04E9
ɵːөөööɵɵ04E9
04E9
uүüu04AF
үүüü 04AF
04AF
  
еyejᵉ0435
juюyujᵘ044E
juːюүyüüjᵘu044E
04AF

9
eЭeE042D
ЭЭeeEE042D
042D
ɵ oӨöƟ04E8
ɵːӨӨööƟƟ04E8
04E8
uҮüU04AE
ҮҮüü 04AE
04AE
  
ЕyeJᵉ0415
juЮyuJᵘ042E
juːЮҮyüüJᵘU042E
04AE
ATR-:

12
aаaa0430
ааaaaa0430
0430
ɔоoɔ043E
ɔːооooɔɔ043E
043E
ʊуuʊ0443
ʊːууuu 0443
0443
  
jaяyajᵃ044F
jaːяаyaajᵃa044F
0430
ёёyojᵒ0451
ёoёоyoojᵒɔ0451
043E
юyujᵘ044E
jʊːюуyuujᵘʊ044E
0443

12
aАaA0410
ААaaAA0410
0410
ɔОoƆ041E
ɔːООooƆƆ041E
041E
ʊУuƱ0423
ʊːУУuu 0423
0423
  
jaЯyaJᵃ042F
jaːЯАyaaJᵃA042F
0410
ёЁyoJᵒ0401
ёoЁОyooJᵒƆ0401
041E
ЮyuJᵘ042E
jʊːЮУyuuJᵘƱ042E
0423
Digraphs:

4
i̯aиаia 0438
0430
ʊi̯уйui 0443
0439
u̯aуаua 0443
0430
ui̯үйui 04AF
0439

4
i̯aИАia 0418
0410
ʊi̯УЙui 0423
0419
u̯aУАua 0423
0410
ui̯ҮЙui 04AE
0419

both
ei̯эйei 044D
0439
ɔi̯ойoi 043E
0439

both
ei̯ЭЙei 042D
0419
ɔi̯ОЙoi 041E
0419

ɛːайai 0430
0439

ɛːАЙ  0410
0419

For additional details see Vowel sounds to characters.

Post-consonant vowels

Basic vowels

Halh Mongolian uses twelve plain vowel and one semi-vowel letters.


10
иii i0438
ыiy ɪ044B
ьĭ ʲi ʲ044C
йi ĭ0439
уʊu ʊ0443
үuü u04AF
эe ie e044D
өɵ oö ɵ04E9
оɔo ɔ043E
аaa a0430

8
Иii I0418
    
    
Йi 0419
Уʊu Ʊ0423
Үuü U04AE
Эe ie E042D
Өɵ oö Ɵ04E8
Оɔo Ɔ041E
Аaa A0410

йU+0439 SMALL LETTER SHORT I is used for diphthongs. 

A number of additional letters represent vowel sounds that begin with a y-glide:


4
еji jeye jᵉ0435
юjʊ juyu jᵘ044E
ёyo jᵒ0451
яjaya jᵃ044F

4
Еye Jᵉ0415
Юjʊ juyu Jᵘ042E
Ёyo Jᵒ0401
Яjaya Jᵃ042F

When these letters are used after a consonant, they indicate that the consonant is palatalised. When they occur as standalone vowels (at the beginning of a word or after another vowel), they are usually transcribed phonetically as j…. Note that юU+044E SMALL LETTER YU can represent either a +ATR or a -ATR vowel.

Reduction plays an important part in the realisation of these vowel sounds. See Vowel stress & reduction.

Diphthongs / digraphs


7
иаi̯aia  0438
0430
уйʊi̯ui  0443
0439
уаu̯aua  0443
0430
үйui̯ui  04AF
0439
эйei̯ei  044D
0439
ойɔi̯oi  043E
0439
айɛːai  0430
0439

Note that the final digraph is pronounced ɛː, rather than as a diphthong.

Vowel length

Length is phonemically distinctive. Long vowels are most commonly indicated by a doubling of the vowel letter, eg. compare цас t͡sʰas snow цаас t͡saːs paper

These are the long plain vowels. Note the slight difference for .


7
ийii  0438
0439
ууʊːuu  0443
0443
үүüü  04AF
04AF
ээee ee044D
044D
өөɵːöö ɵɵ04E9
04E9
ооɔːoo ɔɔ043E
043E
ааaa aa0430
0430

When the long vowel begins with a glide, a combination of letters is used to lengthen the sound.


4
яаjaːyaa jᵃa044F
0430
ёоjɔːyoo jᵒɔ0451
043E
юүjuːyüü jᵘu044E
04AF
юуjʊːyuu jᵘʊ044E
0443

Vowel harmony

Vowel harmony is an important aspect of the Mongolian language. Vowels are classed under one of the following 3 types:

ATR stands for Advanced Tongue Root.

A native word that begins with a -ATR vowel continues with only -ATR and/or neutral vowels. A word beginning with +ATR vowels continues with only +ATR and/or neutral vowels. Foreign loan words don't follow this pattern, and compound words (especially place names) may be made up of two words of different type, eg. Cүхбаатар sʊxbaːtr Sükhbaatar

The +ATR vowel letters are:


5
эe i044D
өɵ o04E9
үu04AF
 
еji je0435
юjʊ ju044E

The -ATR vowel letters are:


6
аa0430
оɔ043E
уʊ0443
 
яja044F
ё0451
юjʊ ju044E

The following vowel letter is neutral, and can appear in words with either +ATR or -ATR vowels.


иi0438

Grammatical suffixes usually also have +ATR and -ATR versions.

Vowel stress & reduction

For non-stressed, non-initial syllables, some sources group consonants into those which need to be preceded or followed by a vowel:


7
мm043C
нn ŋ043D
гɡ ɣ ɢ0433
лɮ043B
бb0431
вw̜ v f0432
рr0440

And those which don't:


9
дd0434
жd͡ʒ0436
зd͡z0437
сs0441
тt0442
хx0445
цt͡s0446
чt͡ʃ0447
шʃ0448

However, Mongolian pronunciation can still appear to be very different from the written text because unstressed vowels are typically reduced or omitted when a word is pronounced, eg.

хадгалагдах adɡᵊɮᵊɡdᵊx to be saved

Word stress always falls on the first syllable of a Mongolian word, unless there are long vowels or diphthongs later in the word, in which case those take the stress.

The first vowel in a word is never reduced, even if unstressed, eg.

цагдаа t͡saɡˈtaː police

харандаа arəntaarᵊnˈda/χaɾɐnˈda/kharandaa pencil

If there is more than one long vowel, the first long vowel is long, and the second is short, but not otherwise reduced, eg.

хаашаа ˈxaːʃa where to

Different rules apply to foreign loan words, eg.

автобyс avˈtɔbʊs bus

машин maˈʃiŋ car

Observation: Sometimes vowels appear to move to places they are not in the orthography, eg. ойлгосон ɔilɔksᵊn I understood

Observation: Also, ioticised vowels may lose the second part of their sound, resulting in a remnant that sounds like j, eg. баярлалаа bajrɮɮa thank you баяртай bajrtɛː goodbye

Vowel sounds to characters

This section maps Halh Mongolian vowel sounds to common graphemes in the Cyrillic orthography.

Plain vowels

Ioticised vowels

Diphthongs

Consonants

Mongolian has 21 basic consonant letters, including 3 for writing sounds from foreign loan words, and one of which is not used in uppercase. The letter inventory also includes a hard sign and a soft sign.

Consonant summary table

The following table summarises the main consonant to character assigments.

The left column is lowercase, and the right uppercase.

Onsets

6
pпloanpp043F
bб bb0431
tт tt0442
dд dd0434
kкloankk043A
ɡ ɢг gg0433

6
pПloanPP041F
bБ BB0411
tТ TT0422
dД DD0414
kКloanKK041A
ɡ ɢГ GG0413

4
t͡sцtsʦ0446
d͡zзzʣ0437
t͡ʃчchʧ0447
d͡ʒжjʤ0436

4
t͡sЦTSʦ̿0426
d͡zЗZʣ̿0417
t͡ʃЧCHʧ̿0427
d͡ʒЖJʤ̿0416

6
f~pʰфloanff0444
sс ss0441
ʃш shʃ0448
ʃщloanshš0449
xх khx0445
xкloankk043A

6
f~pʰФloanFF0424
sС SS0421
ʃШ SHƩ0428
ʃЩloanSH0429
xХ KHX0425
xКloanKK041A

both
mмmm043C
nнnn043D

both
mМMM041C
nНNN041D

3
вvv0432
ɮлlɮ043B
rрrr0440

3
ВVV0412
ɮЛLɮ041B
rРRR0420
Finals

4
-tдdd0434
-kгgg0433
нnn043D
-fвvv0432

4
-tДDD0414
-kГGG0413
НNN041D
-fВVV0412

For additional details see Consonant sounds to characters.

Mongolian consonants

The Mongolian language has a basic set of 16 consonants.


16
бbb b0431
тtt t0442
дdd d0434
гɡ ɣ ɢg g0433
цt͡sts ʦ0446
зd͡zz ʣ0437
чt͡ʃch ʧ0447
жd͡ʒj ʤ0436
сss s0441
шʃsh ʃ0448
хxkh x0445
мmm m043C
нn ŋn n043D
вw̜ v fv v0432
лɮl ɮ043B
рrr r0440

16
БbB B0411
ТtT T0422
ДdD D0414
Гɡ ɣ ɢG G0413
Цt͡sTS ʦ̿0426
Зd͡zZ ʣ̿0417
Чt͡ʃCH ʧ̿0427
Жd͡ʒJ ʤ̿0416
СsS S0421
ШʃSH Ʃ0428
ХxKH X0425
МmM M041C
Нn ŋN N041D
Вw̜ v fV V0412
ЛɮL ɮ041B
РrR R0420

гU+0433 SMALL LETTER GHE represents either ɡ or ɢ. In words with +ATR (front/feminine) vowels (үэө) it is always ɡ. In words with −ATR (-ATR/masculine) vowels (уоа) it is ɢ unless it occurs in syllable-final position, when it normally reverts to ɡ (but see syllable_final).7

Foreign sounds


4
пloanpp p043F
фloanf pʰf f0444
кloank xk k043A
щloanʃsh š0449

пU+043F SMALL LETTER PE, фU+0444 SMALL LETTER EF and кU+043A SMALL LETTER KA are usually only used for foreign loan words, and the latter two may be pronounced and x, respectively.7

щU+0449 SMALL LETTER SHCHA is only used for Russian words.7

Diacritics


both
̆rare0306
̈rare0308

Typically, Cyrillic Mongolian text will use no combining marks at all. However, when the text is decomposed, the letters йU+0439 SMALL LETTER SHORT I and ёU+0451 SMALL LETTER IO become и + ̆U+0438 SMALL LETTER I + U+0306 COMBINING BREVE and е + ̈U+0435 SMALL LETTER IE + U+0308 COMBINING DIAERESIS.

Hard & soft signs


both
ьĭ ʲi ʲ044C
ъ" ʲ044A

ьU+044C SMALL LETTER SOFT SIGN does one of two things:4

This may result in a short ĭ sound, eg. арьс skin амьтан ȧmĭtan animal

ъU+044A SMALL LETTER HARD SIGN is only used to separate яU+044F SMALL LETTER YA and ёU+0451 SMALL LETTER IO from a -ATR verb stem ending with a consonant,4 eg. явъя let's go уулзъя let's meet бодъё let's think

Finals


4
дdd d0434
гɡ ɣ ɢg g0433
нn ŋn n043D
вw̜ v fv v0432

A number of consonants change their sound in final position. These include:

letternormalfinalexample
гU+0433 SMALL LETTER GHE (in female words)ɢ ɡ~k өндөг ɵndᵊk egg
нU+043D SMALL LETTER ENnŋ будан budaŋ fog
дU+0434 SMALL LETTER DE d t гадаад ɢadaːt foreign
вU+0432 SMALL LETTER VE v w арав arᵊw̜ ten

In a number of words, the syllable-final sound change is prevented by following the consonant with a mute, syllable-final vowel letter7, eg. халбага aɮbaɡᵊ spoon энэ en this

Consonant clusters & gemination

Because the script is alphabetic, there are no special mechanisms for representing clusters of consonants without intervening vowels, or doubled consonants.

Consonant sounds to characters

This section maps Halh Mongolian consonant sounds to common graphemes in the Cyrillic orthography.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

p pʲ

lc пU+043F SMALL LETTER PE Used mostly for foreign loan words.

uc ПU+041F CAPITAL LETTER PE Used mostly for foreign loan words.

k kʲ

lc кU+043A SMALL LETTER KA Used for transliteration.

uc КU+041A CAPITAL LETTER KA Used for transliteration.

codaгU+0433 SMALL LETTER GHE when word-final.

ɢ

lc гU+0433 SMALL LETTER GHE in words with −ATR (back/masculine) vowels, unless it occurs in syllable-final position.

uc ГU+0413 CAPITAL LETTER GHE

f fʲ

lc фU+0444 SMALL LETTER EF Used for transliteration.

uc ФU+0424 CAPITAL LETTER EF

lc вU+0432 SMALL LETTER VE at the beginning of a cluster, sometimes.

uc ВU+0412 CAPITAL LETTER VE

ʃ

lc шU+0448 SMALL LETTER SHA

uc ШU+0428 CAPITAL LETTER SHA

lc щU+0449 SMALL LETTER SHCHA Used for Russian words only.

uc ЩU+0429 CAPITAL LETTER SHCHA Used for Russian words only.

x xʲ

lc хU+0445 SMALL LETTER HA

uc ХU+0425 CAPITAL LETTER HA

lc кU+043A SMALL LETTER KA Used for transliteration.

uc КU+041A CAPITAL LETTER KA Used for transliteration.

Numbers

This section describes typographic features related to digits, dates, currencies, etc.

The Cyrillic orthography of Mongolian uses ASCII digits.

Currency


20AE

The Mongolian unit of currency is the tugrik, formerly subdivided into 100 möngö. The standard abbreviation is MNT, and the currency symbol is U+20AE TUGRIK SIGN.

Text direction

Mongolian in Cyrillic is written in horizontal lines with text running from left to right.

Show default bidi_class properties for characters in the Mongolian orthography described here.

Glyph shaping & positioning

This section describes typographic features related to font/writing styles, cursive text, context-based shaping, context-based positioning, letterform slopes, weights & italics, and case & other character transforms.

You can experiment with examples using the All Cyrillic character app and the Mongolian character app.

Letterform slopes, weights, & italics

Are italicisation, bolding, oblique, etc relevant? Do italic fonts lean in the right direction? Is synthesised italicisation problematic? Are there other problems relating to bolding or italicisation - perhaps relating to generalised assumptions of applicability?

Cyrillic doesn't normally have any of the changeability of complex scripts. Characters are typically separate and self-contained. However, there can be a significant difference in shape between regular and italic/cursive font shapes for the same character.

вшйм

вшйм

Conservative transformations between regular and italic.

гдт

гдт

More radical transformations between regular and italic.

Note in particular the italic form of т in the figure just above, which looks similar to the italic form of м shown in the previous figure.

The shapes of the italic forms can also vary by language.w

The shape of the breve sign in Cyrillic is different from that used for Latin text.5 A font such as Brill can detect the appropriate shape from the adjacent characters.

й ̆ й i ̆ i

̆U+0306 COMBINING BREVE between cyrillic and latin characters changes shape in the Brill font.

Case & other character transforms

Is the orthography bicameral? Are there other character pairings, especially when transforms are needed to convert between the two?

Cyrillic is bicameral, and applications may need to enable transforms to allow the user to switch between cases.

Typographic units

Word boundaries

Are words separated by spaces, or other characters? Are there special requirements when double-clicking on the text? Are words hyphenated?

The concept of 'word' is difficult to define in any language (see What is a word?). Here, a word is a vaguely-defined, but recognisable semantic unit that is typically smaller than a phrase and may comprise one or more syllables.

Words are separated by spaces.

Graphemes

A grapheme is a user-perceived unit of text. Text operations that use graphemes as a unit of text include line-breaking, forwards deletion, cursor movement & selection, character counts, text spacing, text insertion, justification, case conversions, and sorting. The Unicode Standard uses generalised rules to define 'grapheme clusters', which approximate the likely grapheme boundaries in a writing system, however they don't work well with many complex scripts.

Cyrillic Mongolian graphemes are straightforward, and can be mapped to Unicode grapheme clusters.

Grapheme clusters

Base (Combining_mark)*

The 2 combining marks that occur in Cyrillic Mongolian appear only on the rare occasions when the text is decomposed, and only one combining mark at a time appears after any base. All such decompositions conform to Unicode grapheme clusters.

Click on the text version of this word to see more detail about the composition.

лийр ɮiːr pear
(decomposed)

Punctuation & inline features

This section describes typographic features related to word boundaries, phrase & section boundaries, bracketed text, quotations & citations, emphasis, abbreviation, ellipsis & repetition, inline notes & annotations, other punctuation, and other inline text decoration.

Phrase & section boundaries

What characters are used to indicate the boundaries of phrases, sentences, and sections?


7
,002C
;003B
:003A
.002E
?003F
!0021
2010

The cyrillic orthography uses ASCII punctuation.

phrase

,U+002C COMMA

;U+003B SEMICOLON

:U+003A COLON

sentence

.U+002E FULL STOP

?U+003F QUESTION MARK

!U+0021 EXCLAMATION MARK

Bracketed text


both
(0028
)0029

Mongolian commonly uses ASCII parentheses to insert parenthetical information into text.

  start end
standard

(U+0028 LEFT PARENTHESIS

)U+0029 RIGHT PARENTHESIS

Quotations & citations

What characters are used to indicate quotations? Do quotations within quotations use different characters? What characters are used to indicate dialogue? Are the same mechanisms used to cite words, or for scare quotes, etc? What about citing book or article names?


5
« 00AB
» 00BB
 201E
 201C
 2014

The standard approach is to use angle brackets by default, and the quotation marks for nested quotes. An alternative is to use the quotation marks at the top level.9

  start end
initial

«U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK

»U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK

nested

U+201E DOUBLE LOW-9 QUOTATION MARK

U+201C LEFT DOUBLE QUOTATION MARK

Үндсэн хүн амын нэрээс улсын оноосон нэрийг «Монгол» хэмээжээ.
Mongolian quotation marks.

For dialogue, the quotation dash is commonly used to introduce the spoken text, but also to terminate it before identifying the speaker. U+2014 EM DASH could be used for this, with spaces around it.9

Abbreviation, ellipsis & repetition

What characters are used to indicate abbreviation, ellipsis & repetition?


2026

Line & paragraph layout

This section describes typographic features related to line breaking & hyphenation, text alignment & justification, text spacing, baselines, line height, counters, lists, and styling initials.

Line breaking & hyphenation

Are there special rules about the way text wraps when it hits the end of a line? Does line-breaking wrap whole 'words' at a time, or characters, or something else (such as syllables in Tibetan and Javanese)? What characters should not appear at the end or start of a line, and what should be done to prevent that? Is hyphenation used, or something else? What rules are used? What difficulties exist?

Spaces between words provide the primary line break opportunities.6

Line-edge rules

As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.

Show (default) line-breaking properties for characters in the Mongolian orthography.

The following list gives examples of typical behaviours for some of the characters used in Mongolian. Context may affect the behaviour of some of these and other characters.

Click/tap on the characters to show what they are.

  • « „ (   should not be the last character on a line.
  • » “ ) . , ; ! ? %   should not begin a new line.
  •   should be kept with any number, even if separated by a space or parenthesis.

Text alignment & justification

Does text in a paragraph needs to have flush lines down both sides? Does the script allow punctuation to hang outside the text box at the start or end of a line? Where adjustments are need to make a line flush, how is that done? Does the script shrink/stretch space between words and/or letters? Are word baselines stretched, as in Arabic? What about paragraph indents?

Justification is done, principally, by adjusting the space between words.

Baselines, line height, etc.

Does the script have special requirements for baseline alignment between mixed scripts and in general? Is line height special for this script? Are there other aspects that affect line spacing, or positioning of items vertically within a line?

Cyrillic uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

Cyrillic has little in the way of ascenders and descenders, and mostly the font metrics are the same as for ASCII text. One difference is the use of a couple of diacritics, which rise above the ASCII ascender height in capital letters..

To give an approximate idea, Figure 5 compares Latin and Cyrillic glyphs from Noto fonts.

HhqxюбдфйЮБДФЙ HhqxюбдфйЮБДФЙ
Font metrics for Latin text compared with Cyrillic glyphs in the Noto Serif (top) and Noto Sans (bottom) fonts.

Figure 6 shows similar comparisons for the Doulos SIL and Helvetica fonts.

HhqxюбдфйЮБДФЙ HhqxюбдфйЮБДФЙ
Latin font metrics compared with Cyrillic glyphs in the Doulos SIL (top) and Helvetica (bottom) fonts.

Page & book layout

This section describes typographic features related to general page layout & progression; grids & tables, notes, footnotes, etc, forms & user interaction, and page numbering, running headers, etc.

Online resources

  1. President of Mongolia Website
  2. Wikipedia home page
  3. Wikipedia page: Монгол Улс

References & sources

1J Bat-Ireedui, Alan JK Sanders (1999), Colloquial Mongolian, Taylor & Francis Ltd

2Jan-Olof Svantesson, Anna Tsendina, Anastasia Karlsson, Vivan Franzen (2005), The Phonology of Mongolian, Oxford University Press, ISBN 9780199554270

3Gov.uk (2019), Romanization of Mongolian Cyrillic, BGN/PCGN 1964 System

4номын гэр, The Mongolian Alphabet

5ScriptSource, Glyph Variant for U+0306 (retr. Apr 2021)

6Unicode Consortium (2020), The Unicode Standard, Version 13.0, Chapter 7.4: Europe-I, Cyrillic, 311-315, ISBN 978-1-936213-16-0.

7Wikipedia, Mongolian Cyrillic alphabet (retr. May 2021)

8Wikipedia, Mongolian Language (retr. May 2021)

9Wikipedia, Quotation mark (retr. Apr 2021)

See recent changes.  •  Make a comment.  •  Licence CC-By © r12a.