Mongolian

Cyrillic orthography notes

Updated 21 April, 2024

This page brings together basic information about the Cyrillic script and its use for the Mongolian language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Mongolian using Unicode.

Referencing this document

Richard Ishida, Mongolian (Cyrillic) Orthography Notes, 21-Apr-2024, https://r12a.github.io/scripts/cyrl/mn

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size:   24px

Хүн бүр төрж мэндлэхэд эрх чөлөөтэй, адилхан нэр төртэй, ижил эрхтэй байдаг. Оюун ухаан, нандин чанар заяасан хүн гэгч өөр хоорондоо ахан дүүгийн үзэл санаагаар харьцах учиртай.

Хүн бүр энэ Тунхаглалд заасан бүхий л эрх, эрх чөлөөг ямар ч ялгаваргүйгээр, яс үндэс, арьс өнгө, хүйс, хэл, шашин шүтлэг, улс төрийн болон бусад үзэл бодол, үндэсний буюу нийгмийн гарал, эд хөрөнгийн байдал, язгуур угсаа, бусад ялгааг эс харгалзан адилхан эдлэх ёстой. Мөн түүнчлэн тухайн хүний харъяалдаг улс орон буюу нутаг дэвсгэрийн улс төр, эрх зүйн буюу олон улсын статус ямар ч байлаа гэсэн, тэрхүү нутаг дэвсгэр нь тусгаар тогтносон, бусдын асрамжид байгаа, өөртөө захиргаагүй буюу бүрэн эрхт байдал нь өөр ямар ч байдлаар хязгаарлагдмал байсан, хүнийг ялгаварлаж үл болно.

Source: Unicode UDHR, articles 1 & 2

Usage & history

Origins of the Latin script, 7thC – today.

Phoenician

└ Greek

└ Old Italic

└ Cyrillic

+ Glagolitic

+ Latin

+ Armenian

+ Georgian

+ Coptic

+ Runes

The Mongolian Cyrillic alphabet is used for the standard dialect of the Mongolian language in the modern state of Mongolia. Ethnologue lists 2,640,000 native speakers of Halh Mongolian, but Wikipedia list 5.2 million speakers across all dialects, including the Inner Mongolia Autonomous Region of the People's Republic of China. Cyrillic has not been adopted as the writing system in the Inner Mongolia region of China, which continues to use the traditional Mongolian script. In Mongolia, the Halh (or Khalkha) dialect is predominant.

Монгол Кирилл үсэг mongol kirill üseg Mongolian cyrillic alphabet Кирилл цагаан толгой kirill tsagaan tolgoi Mongolian cyrillic alphabet

In the Mongolian People's Republic (Outer Mongolia), the traditional script was replaced by a Cyrillic orthography since the early 1940s, as a result of the spreading of Russian influence following the expansion of Russian Empire and the subsequent Soviet Union. Its introduction is credited with an increase in the literacy rate from 17.3% to 73.5% between 1941 and 1950.wm

Source: Wikipedia.

Basic features

Cyrillic is an alphabet. Letters typically represent a consonant or vowel sound. See the table to the right for a brief overview of features for the Mongolian language.

Cyrillic Mongolian text runs left-to-right in horizontal lines. Words are separated by spaces.

The script is bicameral. The shapes of the upper and lowercase forms are typically the same. There can be a significant difference, however, between regular and cursive/italic shapes for the same character.

Normal text contains no combining marks (and decomposed text contains only 2). The visual forms of letters don't usually interact.

❯ consonantSummary

Mongolian has 21 basic consonant letters, including 3 for writing sounds from foreign loan words, and one of which is not used in uppercase. The letter inventory also includes a hard sign and a soft sign.

❯ basicV

The orthography is an alphabet that writes vowels using 16 vowel letters (30 in total, because 2 are only used in lowercase), including 4 ioticised vowels which may also indicate palatalisation of the previous consonant. Long vowels are indicated by doubling the vowel letters. A number of diphthongs are written using the semi-vowel letter й.

Vowel reduction is a significant feature of Mongolian. Non-initial short vowels are reduced to vestiges or to zero, and non-initial long vowels in the orthography are reduced to short vowel length.

Vowel harmony is another key feature, grouping vowels in a way that indicates a front or back position for the tongue root (ATR).

There are no special mechanisms to represent standalone vowels. Combining marks are normally not used, and only occur in decomposed text.

Text is generally wrapped at word boundaries, and justification predominantly stretches the spaces between words.

Numbers use ASCII digits.

Character index

Letters

Show

Basic consonants

б␣в␣г␣д␣ж␣з␣к␣л␣м␣н␣п␣р␣с␣т␣ф␣х␣ц␣ч␣ш␣щ␣ы
Б␣В␣Г␣Д␣Ж␣З␣К␣Л␣М␣Н␣П␣Р␣С␣Т␣Ф␣Х␣Ц␣Ч␣Ш␣Щ

Vowels

а␣е␣и␣й␣о␣у␣э␣ю␣я␣ё␣ү␣ө
Ё␣А␣Е␣И␣Й␣О␣У␣Э␣Ю␣Я␣Ү␣Ө

Other

ъ␣ь

Combining marks

Show
̆␣̈

Punctuation

Show
«␣»␣‐␣„␣“␣—␣…

ASCII

,␣;␣:␣.␣?␣!␣(␣)

Symbols

Show

Other

Show

To be investigated

%␣[␣]␣§␣ʼ␣Ы␣‑␣–␣—␣‘␣‚␣†␣‡␣‰␣′␣″␣‹␣›␣№
Items to show in lists

Phonology

These are the sounds of Halh (or Khalkha) Mongolian.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Plain vowels

i u ʊ ʊː e ɵ ɵ ɔ ɔː a

Diphthongs

ui ʊi ɔi ai

A significant feature of Mongolian phonology is that vowel sounds are divided into front (+ATR), back (-ATR), and neutral groups (see harmony). The front and back distinction has to do with the position of the tongue root (ATR means Advanced Tongue Root). The phonology is more complicated, and sounds are somewhat more fluid than described here. See the sources for more detailed information.

Consonant sounds

labial alveolar post-
alveolar
palatal velar uvular glottal
stop p b d t     k ɡ ɢ  
affricate   t͡s d͡z t͡ʃ d͡ʒ        
fricative f s
ɮ
ʃ   x    
nasal m n     ŋ  
approximant w     j    
trill/flap   r    

Some phonological transcriptions use t and where others use d and t for the same sounds, respectively. Similar contrasts are applied to the bilabial and affricate pairs in the repertoire (but not to the k/g pairing). Here we use the latter, partly because it is probably better indicative to the non-expert of the approximate sounds involved, and also because that corresponds with the Cyrillic letters used.

Other sources also indicate palatised versions of most consonants (eg. and ) in a table such as this, but they are not shown here. Palatalisation appears to be restricted to words containing -ATR (back) vowelswm,#Consonants.

Tone

Mongolian is not a tonal language.

Structure

The basic unit of text is a word, however words can contain prefixes and suffixes.

Syllables tend to follow the pattern:

(C)V(V)(C)(C)(C)

Long vowels only occur in initial syllables. Mongolian has a strong tendency to reduce non-initial short vowels, either to epenthetic remnants or to zero. Non-initial vowels written as long are pronounced with normal length. See reduction.

Vowels

Vowel summary table

The following table summarises the main vowel to character assigments.

These are nominal pronunciations that don't take into account vowel harmony or vowel reduction. The vowels with IPA beginning j.. have transcriptions for standalone contexts; after a consonant the j generally transmutes to ʲ in the sense that it palatalises the consonant.

Neutral:
и␣ий␣й␣ь␣ы
И␣ИЙ␣Й␣Ь␣Ы
ATR+:
э␣ээ␣ө␣ɵɵ␣ү␣үү␣ ␣е␣ю␣юү
Э␣ЭЭ␣Ө␣ƟƟ␣Ү␣ҮҮ␣ ␣Е␣Ю␣ЮҮ
ATR-:
а␣аа␣о␣оо␣у␣уу␣ ␣я␣яа␣ё␣ёo␣ю␣юу
А␣АА␣О␣ОО␣У␣УУ␣ ␣Я␣ЯА␣Ё␣ЁO␣Ю␣ЮУ
Digraphs:
иа␣уй␣уа␣үй
ИА␣УЙ␣УА␣ҮЙ
эй␣ой
ЭЙ␣ОЙ
ай
АЙ

For additional details see vowel_mappings.

Here is the set of characters described in this section.

Ё␣А␣Е␣И␣Й␣О␣У␣Ы␣Ь␣Э␣Ю␣Я␣а␣е␣и␣й␣о␣у␣ы␣ь␣э␣ю␣я␣ё␣Ү␣ү␣Ө␣ө

Post-consonant vowels

Basic vowels

Halh Mongolian uses twelve plain vowel and one semi-vowel letters.

и␣ы␣ь␣й␣у␣ү␣э␣ө␣о␣а
И␣ ␣ ␣Й␣У␣Ү␣Э␣Ө␣О␣А

й is used for diphthongs. 

A number of additional letters represent vowel sounds that begin with a y-glide:

е␣ю␣ё␣я
Е␣Ю␣Ё␣Я

When these letters are used after a consonant, they indicate that the consonant is palatalised. When they occur as standalone vowels (at the beginning of a word or after another vowel), they are usually transcribed phonetically as j…. Note that ю can represent either a +ATR or a -ATR vowel.

Reduction plays an important part in the realisation of these vowel sounds. See reduction.

Diphthongs / digraphs

иа␣уй␣уа␣үй␣эй␣ой␣ай

Note that the final digraph is pronounced ɛː, rather than as a diphthong.

Vowel length

Length is phonemically distinctive. Long vowels are most commonly indicated by a doubling of the vowel letter, eg. compare цас цаас

These are the long plain vowels. Note the slight difference for .

ий␣уу␣үү␣ээ␣өө␣оо␣аа

When the long vowel begins with a glide, a combination of letters is used to lengthen the sound.

яа␣ёо␣юү␣юу

Vowel harmony

Vowel harmony is an important aspect of the Mongolian language. Vowels are classed under one of the following 3 types:

ATR stands for Advanced Tongue Root.

A native word that begins with a -ATR vowel continues with only -ATR and/or neutral vowels. A word beginning with +ATR vowels continues with only +ATR and/or neutral vowels. Foreign loan words don't follow this pattern, and compound words (especially place names) may be made up of two words of different type, eg. Cүхбаатар

The +ATR vowel letters are:

э␣ө␣ү␣ ␣е␣ю

The -ATR vowel letters are:

а␣о␣у␣ ␣я␣ё␣ю

The following vowel letter is neutral, and can appear in words with either +ATR or -ATR vowels.

и

Grammatical suffixes usually also have +ATR and -ATR versions.

Vowel stress & reduction

For non-stressed, non-initial syllables, some sources group consonants into those which need to be preceded or followed by a vowel:

м␣н␣г␣л␣б␣в␣р

And those which don't:

д␣ж␣з␣с␣т␣х␣ц␣ч␣ш

However, Mongolian pronunciation can still appear to be very different from the written text because unstressed vowels are typically reduced or omitted when a word is pronounced, eg.

хадгалагдах

Word stress always falls on the first syllable of a Mongolian word, unless there are long vowels or diphthongs later in the word, in which case those take the stress.

The first vowel in a word is never reduced, even if unstressed, eg.

цагдаа

харандаа

If there is more than one long vowel, the first long vowel is long, and the second is short, but not otherwise reduced, eg.

хаашаа

Different rules apply to foreign loan words, eg.

автобyс

машин

Observation: Sometimes vowels appear to move to places they are not in the orthography, eg. ойлгосон

Observation: Also, ioticised vowels may lose the second part of their sound, resulting in a remnant that sounds like j, eg. баярлалаа баяртай

Vowel sounds to characters

This section maps Halh Mongolian vowel sounds to common graphemes in the Cyrillic orthography. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

Plain vowels

i
 

и

жижиг

ы

огурцы

э

ээмэр

И

Ы

Э

 

ий

лийр

u
 

ү

хүн

Ү

 

үү

сүү

ʊ
 

у

утас

У

ʊː
 

уу

уугуул

e
 

э

гэдэс

Э

 

ээ

ээж

ɵ~o
 

ө

өндөг

Ө

ɵː
 

өө

өөрөө

ɛː
 

ай

баяртай

АЙ

ɔ
 

о

хол

О

 

оо

хоосон

a
 

а

цас

А

 

аа

цаас

Ioticised vowels

 

е

jʊ ju
 

ю

валют

Ю

jʊː juː
 

юү

юу

Diphthongs

Consonants

Consonant summary table

The following table summarises the main consonant to character assigments.

The left column is lowercase, and the right uppercase.

Onsets
п␣б␣т␣д␣к␣г
П␣Б␣Т␣Д␣К␣Г
ц␣з␣ч␣ж
Ц␣З␣Ч␣Ж
ф␣с␣ш␣щ␣х␣к
Ф␣С␣Ш␣Щ␣Х␣К
м␣н
М␣Н
в␣л␣р
В␣Л␣Р
Finals
д␣г␣н␣в
Д␣Г␣Н␣В

For additional details see consonant_mappings.

Mongolian consonants

The Mongolian language has a basic set of 16 consonants.

б␣т␣д␣г␣ц␣з␣ч␣ж␣с␣ш␣х␣м␣н␣в␣л␣р
Б␣Т␣Д␣Г␣Ц␣З␣Ч␣Ж␣С␣Ш␣Х␣М␣Н␣В␣Л␣Р

г represents either ɡ or ɢ. In words with +ATR (front/feminine) vowels (үэө) it is always ɡ. In words with −ATR (-ATR/masculine) vowels (уоа) it is ɢ unless it occurs in syllable-final position, when it normally reverts to ɡ (but see syllable_final).wc

Foreign sounds

п␣ф␣к␣щ

п, ф and к are usually only used for foreign loan words, and the latter two may be pronounced and x, respectively.wc

щ is only used for Russian words.wc

Diacritics

̆␣̈

Typically, Cyrillic Mongolian text will use no combining marks at all. However, when the text is decomposed, the letters й and ё become 0438 0306 and 0435 0308.

Hard & soft signs

ь␣ъ

ь does one of two things:ng

This may result in a short ĭ sound, eg. арьс амьтан

ъ is only used to separate я and ё from a -ATR verb stem ending with a consonant,ng eg. явъя уулзъя бодъё

Finals

д␣г␣н␣в

A number of consonants change their sound in final position. These include:

letternormalfinalexample
г (in female words)ɢ ɡ~k өндөг
нnŋ будан
д d t гадаад
в v w арав

In a number of words, the syllable-final sound change is prevented by following the consonant with a mute, syllable-final vowel letterwc, eg. халбага энэ

Consonant clusters & gemination

Because the script is alphabetic, there are no special mechanisms for representing clusters of consonants without intervening vowels, or doubled consonants.

Consonant sounds to characters

This section maps Halh Mongolian consonant sounds to common graphemes in the Cyrillic orthography. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.

p pʲ
 

п

пүүжин

П

b bʲ
 

б

будан

Б

t tʲ
 

т

тийм

when word-final.

гадаад

Т

t͡s
 

ц

цас

Ц

t͡ʃ
 

ч

t͡ʃixᵊ

Ч

d dʲ
 

д

дэр

Д

d͡z
 

з

зун

З

d͡ʒ
 

ж

жаал

Ж

k kʲ
 

к

кофе

when word-final.

өндөг

К

ɡ ɡʲ
 

г in words with +ATR (front/feminine) vowels.

гэргий

Г

ɢ
 

г in words with −ATR (back/masculine) vowels, unless it occurs in syllable-final position.

монгол

Г

f fʲ
 

ф

фабрик

в at the beginning of a cluster, sometimes.

навч

Ф

В

s
 

с

саваа

С

ʃ
 

ш

ширээ

щ

щит

Ш

Щ

ɣ
 

г

зөгий

Г

x xʲ
 

х

харах

Х

m mʲ
 

м

мөс

М

n nʲ
 

н

намар

Н

ŋ
 

будан

Н

w̜ w̜ʲ
 

в, esp. word final.

вино

В

r rʲ
 

р

ширээ

Р

ɮ ɮʲ
 

л

лийр

Л

Numbers

The Cyrillic orthography of Mongolian uses ASCII digits.

Currency

The Mongolian unit of currency is the tugrik, formerly subdivided into 100 möngö. The standard abbreviation is MNT, and the currency symbol is .

Text direction

Mongolian in Cyrillic is written in horizontal lines with text running from left to right.

Show default bidi_class properties for characters in the Mongolian orthography described here.

Glyph shaping & positioning

You can experiment with examples using the All Cyrillic character app and the Mongolian character app.

Letterform slopes, weights, & italics

Cyrillic doesn't normally have any of the changeability of complex scripts. Characters are typically separate and self-contained. However, there can be a significant difference in shape between regular and italic/cursive font shapes for the same character.

вшйм

вшйм

Conservative transformations between regular and italic.

гдт

гдт

More radical transformations between regular and italic.

Note in particular the italic form of т in the figure just above, which looks similar to the italic form of м shown in the previous figure.

The shapes of the italic forms can also vary by language.w

The shape of the breve sign in Cyrillic is different from that used for Latin text.s A font such as Brill can detect the appropriate shape from the adjacent characters.

й ̆ й i ̆ i

0306 between cyrillic and latin characters changes shape in the Brill font.

Case & other character transforms

Cyrillic is bicameral, and applications may need to enable transforms to allow the user to switch between cases.

Typographic units

Word boundaries

Words are separated by spaces.

Graphemes

Cyrillic Mongolian graphemes are straightforward, and can be mapped to Unicode grapheme clusters.

Grapheme clusters

Base (Combining_mark)*

The 2 combining marks that occur in Cyrillic Mongolian appear only on the rare occasions when the text is decomposed, and only one combining mark at a time appears after any base. All such decompositions conform to Unicode grapheme clusters.

Click on the text version of this word to see more detail about the composition.

лийр
(decomposed)

Punctuation & inline features

Phrase & section boundaries

,␣;␣:␣.␣?␣!␣‐

The cyrillic orthography uses ASCII punctuation.

phrase

,

;

:

sentence

.

?

!

Bracketed text

(␣)

Mongolian commonly uses ASCII parentheses to insert parenthetical information into text.

  start end
standard

(

)

Quotations & citations

«␣»␣„␣“␣—

The standard approach is to use angle brackets by default, and the quotation marks for nested quotes. An alternative is to use the quotation marks at the top level.wq

  start end
initial

«

»

nested

Үндсэн хүн амын нэрээс улсын оноосон нэрийг «Монгол» хэмээжээ.
Mongolian quotation marks.

For dialogue, the quotation dash is commonly used to introduce the spoken text, but also to terminate it before identifying the speaker. could be used for this, with spaces around it.wq

Abbreviation, ellipsis & repetition

Line & paragraph layout

Line breaking & hyphenation

Spaces between words provide the primary line break opportunities.u

Line-edge rules

As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.

Show (default) line-breaking properties for characters in the Mongolian orthography.

The following list gives examples of typical behaviours for some of the characters used in Mongolian. Context may affect the behaviour of some of these and other characters.

Click/tap on the characters to show what they are.

  • « „ (   should not be the last character on a line.
  • » “ ) . , ; ! ? %   should not begin a new line.
  •   should be kept with any number, even if separated by a space or parenthesis.

Text alignment & justification

Justification is done, principally, by adjusting the space between words.

Baselines, line height, etc.

Cyrillic uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

Cyrillic has little in the way of ascenders and descenders, and mostly the font metrics are the same as for ASCII text. One difference is the use of a couple of diacritics, which rise above the ASCII ascender height in capital letters..

To give an approximate idea, fig_baselines compares Latin and Cyrillic glyphs from Noto fonts.

HhqxюбдфйЮБДФЙ HhqxюбдфйЮБДФЙ
Font metrics for Latin text compared with Cyrillic glyphs in the Noto Serif (top) and Noto Sans (bottom) fonts.

fig_baselines_other shows similar comparisons for the Doulos SIL and Helvetica fonts.

HhqxюбдфйЮБДФЙ HhqxюбдфйЮБДФЙ
Latin font metrics compared with Cyrillic glyphs in the Doulos SIL (top) and Helvetica (bottom) fonts.

Page & book layout

Online resources

  1. President of Mongolia Website
  2. Wikipedia home page
  3. Wikipedia page: Монгол Улс

References