/* */ var charDetails = { // MAIN BLOCK '\u{0430}': `

а

a eg. цас

Combinations

is аа [U+0430 CYRILLIC SMALL LETTER A + U+0430 CYRILLIC SMALL LETTER A], eg. цаас харандаа

u̯a is уа [U+0443 CYRILLIC SMALL LETTER U + U+0430 CYRILLIC SMALL LETTER A], eg. хуаран

ɛː is ай [U+0430 CYRILLIC SMALL LETTER A + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. баяртай

jaː is яа [U+044F CYRILLIC SMALL LETTER YA + U+0430 CYRILLIC SMALL LETTER A], eg. яаж

See also я [U+044F CYRILLIC SMALL LETTER YA].

`, '\u{0431}': `

б

b bʲ, mostly used in word-initial position or after one of л [U+043B CYRILLIC SMALL LETTER EL], м [U+043C CYRILLIC SMALL LETTER EM] or б [U+0431 CYRILLIC SMALL LETTER BE]gov, eg. будан халбага

Sometimes transcribed as p.

`, '\u{0432}': `

в

v ~ w̜ eg. вино аав

f sometimes in cluster-initial position, eg. навч

Usually only word-initial in foreign words.gov

`, '\u{0433}': `

г

ɡ in words with +ATR (front/feminine) vowels (үэө)wc, eg. өргөн

ɢ in words with −ATR (back/masculine) vowels (уоа) unless it occurs in syllable-final positionwc, eg. монгол

-k normally in syllable-final positionbat,11, eg. өндөг

A final vowel is usually not pronounced, but causes pronunciation to remain ɡbat,11, eg. өнгө халбага

`, '\u{0434}': `

д

d eg. дэр

-t occurs in syllable-final positionbat,11, eg. гадаад

Sometimes transcribed as t.

`, '\u{0435}': `

е

eg. ес

Following a consonant it indicates that the consonant is palatalised.

`, '\u{0436}': `

ж

d͡ʒ eg. жаал

Sometimes transcribed as t͡ʃ.

`, '\u{0437}': `

з

d͡z eg. зун

Sometimes transcribed as t͡s or z.

`, '\u{0438}': `

и

i eg. жижиг

Combinations

is ий [U+0438 CYRILLIC SMALL LETTER I + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. лийр

i̯a is иа [U+0438 CYRILLIC SMALL LETTER I + U+0430 CYRILLIC SMALL LETTER A], eg. амиараа

`, '\u{0439}': `

й

Used for diphthongs.

Also makes the long sound, eg. тийм

`, '\u{043A}': `

к

k as a loan consonant, but often adopted into the Mongolian sound system as x, eg. кофе

`, '\u{043B}': `

л

ɮ eg. лийр

`, '\u{043C}': `

м

m eg. мөс

`, '\u{043D}': `

н

n eg. намар

ŋ sometimes when used as syllable-finalbat,11, eg. будан мөнгө

A final vowel is usually not pronounced, but causes pronunciation to remain nbat,11, eg. энэ чоно

`, '\u{043E}': `

о

ɔ eg. хол

Combinations

is оо [U+043E CYRILLIC SMALL LETTER O + U+043E CYRILLIC SMALL LETTER O], eg. хоосон

ɔi̯ is ой [U+043E CYRILLIC SMALL LETTER O + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. ойлгосон

jɔː is ёо [U+0451 CYRILLIC SMALL LETTER IO + U+043E CYRILLIC SMALL LETTER O], eg. ёотон

See also ё [U+0451 CYRILLIC SMALL LETTER IO].

`, '\u{043F}': `

п

p for foreign loan words, eg. пүүжин

Sometimes transcribed as .

`, '\u{0440}': `

р

r eg. ширээ

Mongolians find words beginning with r difficult to pronounce, and so put a vowel at the beginningbat,11, eg. орос

`, '\u{0441}': `

с

s eg. саваа

`, '\u{0442}': `

т

t tʲ eg. тийм

Sometimes transcribed as .

`, '\u{0443}': `

у

ʊ eg. утас

As a short vowel, it appears only in the first syllable (but can appear later as part of a long vowel or diphthong).bat,14

Combinations

ʊː is уу [U+0443 CYRILLIC SMALL LETTER U + U+0443 CYRILLIC SMALL LETTER U], eg. уугуул

ʊi̯ is уй [U+0443 CYRILLIC SMALL LETTER U + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. дугуй

u̯a is уа [U+0443 CYRILLIC SMALL LETTER U + U+0430 CYRILLIC SMALL LETTER A], eg. хуаран

jʊː is юу [U+044E CYRILLIC SMALL LETTER YU + U+0443 CYRILLIC SMALL LETTER U], eg. юу

See also ю [U+044E CYRILLIC SMALL LETTER YU].

`, '\u{0444}': `

ф

f for foreign loan words, but often pronounced , eg. фабрик

`, '\u{0445}': `

х

x eg. хөөрхөн

`, '\u{0446}': `

ц

t͡s eg. цас

Sometimes transcribed as t͡sʰ.

`, '\u{0447}': `

ч

t͡ʃ eg. чих

Sometimes transcribed as t͡ʃʰ.

`, '\u{0448}': `

ш

ʃ eg. ширээ

`, '\u{0449}': `

щ

ʃ in Russian words.

`, '\u{044A}': `

ъ

Only used to separate я [U+044F CYRILLIC SMALL LETTER YA] and ё [U+0451 CYRILLIC SMALL LETTER IO] from a -ATR (back) verb stem ending with a consonant4, eg. явъя явъя let's go уулзъя уулзъя let's meet бодъё бодъё let's think

`, '\u{044B}': `

ы

i used only in suffixes of masculine words (ie. containing а, о, or у)gov, eg. таны

Not used in uppercase. Cannot begin a word.

`, '\u{044C}': `

ь

Different descriptions appear.

ʲ or sometimes nothing, eg. арьс амьтан

In non-phonetic transcriptions it may appear as ĭbat,4 or ʲwc.

Not used in uppercase. Cannot begin a word.

Does one of two things:

`, '\u{044D}': `

э

e eg. ээмэр

Combinations

is ээ [U+044D CYRILLIC SMALL LETTER E + U+044D CYRILLIC SMALL LETTER E], eg. ээж

ei̯ is эй [U+044D CYRILLIC SMALL LETTER E + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. эмэгтэй

See also е [U+0435 CYRILLIC SMALL LETTER IE].

`, '\u{044E}': `

ю

jʊ ju eg. юу

Unlike other ioticised vowels, this letter represents both the front and back vowels.

Following a consonant it indicates that the consonant is palatalised.

Combinations

juː is юү [U+044E CYRILLIC SMALL LETTER YU + U+04AF CYRILLIC SMALL LETTER STRAIGHT U], eg. ширээ юү?

jʊː is юу [U+044E CYRILLIC SMALL LETTER YU + U+0443 CYRILLIC SMALL LETTER U], eg. юу

`, '\u{044F}': `

я

ja eg. яс

Following a consonant it indicates that the consonant is palatalised.

Combinations

jaː is яа [U+044F CYRILLIC SMALL LETTER YA + U+0430 CYRILLIC SMALL LETTER A], eg. яаж

`, '\u{0451}': `

ё

eg. хоёр

Following a consonant it indicates that the consonant is palatalised.

Combinations

jɔː is ёо [U+0451 CYRILLIC SMALL LETTER IO + U+043E CYRILLIC SMALL LETTER O], eg. ёотон

`, '\u{04AF}': `

ү

u eg. хүн

As a short vowel, it appears only in the first syllable (but can appear later as part of a long vowel or diphthong).bat,14

Combinations

is үү [U+04AF CYRILLIC SMALL LETTER STRAIGHT U + U+04AF CYRILLIC SMALL LETTER STRAIGHT U], eg. сүү

ui̯ is үй [U+04AF CYRILLIC SMALL LETTER STRAIGHT U + U+0439 CYRILLIC SMALL LETTER SHORT I], eg. бүйлс

juː is юү [U+044E CYRILLIC SMALL LETTER YU + U+04AF CYRILLIC SMALL LETTER STRAIGHT U], eg. ширээ юү?

`, '\u{04E9}': `

ө

ɵ o eg. өндөг

Combinations

is ɵɵ [U+0275 LATIN SMALL LETTER BARRED O + U+0275 LATIN SMALL LETTER BARRED O], eg. өөрөө

`, // COMMON PUNCTUATION // ".. '\u{201C}': `

`, // .." '\u{201D}': ` `, // '.. '\u{2018}': ` `, // ..' '\u{2019}': ` `, // « '\u{00AB}': ` `, // » '\u{00BB}': ` `, // ; '\u{003B}': ` `, // : '\u{003A}': ` `, // . '\u{002E}': ` `, // ? '\u{003F}': ` `, // ! '\u{0021}': ` `, // ( '\u{0028}': ` `, // ) '\u{0029}': ` `, // … '\u{2026}': ` `, // – '\u{2013}': ` `, // — '\u{2014}': ` `, // § '\u{00A7}': ` `, '\u{2020}': `

Called dagger, but also known as obelisk, obelus, or long cross.b321

A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321

In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321

`, '\u{2021}': `

Called dagger, but also known as diesis, or double obelisk.b321

A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

`, '\u{2032}': `

Abbreviation for feet (1′ = 12″).b330

Also used for minutes of arc (eg. 60′=1°).b330

`, '\u{2033}': `

Abbreviation for inches (1′ = 12″).b321

Also used for seconds of arc (eg. 360″=1°).b321

`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `

An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.

Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text อักษร​ไทย may look like อั ก ษ ร ไ ท ย when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).

`, // zwj '\u{200D}': `

Creates glyph joining behaviour in the absence of normal joining contexts.

`, // zwnj '\u{200C}': `

Prevents glyph joining behaviour.

`, // word-break '\u{2060}': `

An invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks, eg. it can be used around the + sign in base⁠+delta⁠ to prevent a line break occuring in that sequence of characters. It has no effect on word segmentation.

It can also be used to bracket other characters to turn them into non-breaking characters, such as U+2009 THIN SPACE or [U+2015 HORIZONTAL BAR].

Not to be confused with U+200D ZERO WIDTH JOINER or U+034F COMBINING GRAPHEME JOINER​, since it has no effect on shaping.

This functionality is also provided by U+FEFF ZERO WIDTH NO-BREAK SPACE, but since that character also represents the byte-order mark, the use of this word joiner character (added in Unicode 3.2) is strongly preferred over the latter.

`, // rli '\u{2067}': `

Sets the base direction for the following text to RTL, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // lri '\u{2066}': `

Sets the base direction for the following text to LTR, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, //fsi '\u{2068}': `

Sets the base direction for the following text to the direction of the first strong directional character, per Unicode Bidirectional Algorithm rules, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // pdi '\u{2069}': `

Ends the range of text that started with RLI, LRI, or FSI.

`, // rle '\u{202B}': `

Sets the base direction for the following text to RTL, with no isolation. The Unicode Standard recommends use of RLI, instead.

`, // lre '\u{202A}': `

Sets the base direction for the following text to LTR, with no isolation. The Unicode Standard recommends use of LRI, instead.

`, // pdf '\u{202C}': `

Ends the range of text that started with RLE, or LRE.

`, // rlm '\u{200F}': `

An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // lrm '\u{200E}': `

An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // cgj '\u{034F}': `

Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.

`, // alm '\u{061C}': `

Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.

`, // CURRENCY '\u{20AE}': `

Currency symbol for the Mongolian unit of currency, the tugrik, formerly subdivided into 100 möngö. The standard abbreviation is MNT.

`, }