Use accesskey "n" to jump to the internal navigation links at any point. Right now you can

 
r12a >> docs

Khmer block notes

Recent commits: kmblock [v28] • Leave a comment

Information about 104 characters out of 146 in Unicode blocks associated with the Khmer script.

Referencing this document

Ishida R, Khmer Block Notes, 21-Dec-2024, https://r12a.github.io/scripts/khmr/block

 

Click to toggle Table of Contents.

Phonological transcriptions should be treated as a guide, only. They are taken from the sources consulted, and may be narrow or broad, phonemic or phonetic, depending on what is available. They mostly represent pronunciation of words in isolation. For more detailed information about allophones, alternations, sandhi, dialectal differences, and so on, follow the links to cited references.

This is an interactive document. Click/tap on coloured examples to reveal detailed compositional information. If your browser supports it, your cursor will change to look like as you hover over these items.

More about using this page

Character names. The names of characters in codepoint markup drop the initial KHMER label (purely to reduce the length of the examples). In other places the full name can be found.

Fonts. The large character in the box will not be rendered unless the webfont downloaded with the page or a system font has a glyph for it. If there is no glyph and you want to see what it looks like, click on the Toggle images icon to toggle the large characters between font glyphs and graphics.

Navigation. The Toggle images icon takes you to the top of the page.

Transcriptions & transliterations. Phonological transcriptions are surrounded by ⌈corner brackets⌋, to indicate that they vary between narrow, [phonetic] and broad, /phonemic/ transcriptions.
Latin transcriptions between <angle brackets>, represent the letters as commonly written in the Latin script.
A transliteration has also been developed especially for this orthography, and is generally based on the sound of a letter where possible, but where a letter has multiple pronunciations, the transliteration represents only one.
Transliterations provide perfect round-trip conversion between the native script and Latin, whereas Latin transcriptions rarely do.
When you click on an example to see its composition, the top of the panel that opens contains a transliteration, followed by the native text, then (if available) an IPA transcription.

Show all characters listed here

Copied !

Languages using the Khmer scriptKhmer linksOther character notes

Khmer

U+1780 KHMER LETTER KA

Used by: Khmer

្ក

Khmer

a-series consonantkɑːk  Lo

UNGEN: k

k with inherent vowel ɑː or followed by a vowel, eg. ក៏ kɑː also, therefore

k before a subscript consonant, eg. បក្សី ɓak.ˈsəj bird

when followed by a subscript [U+1798 KHMER LETTER MO], [U+1793 KHMER LETTER NO] or [U+179B KHMER LETTER LO], eg. ក្មេង kʰmeiŋ child, youngster

k in final position, eg. លើក ləːk to lift

ʔ in final position after one of these vowels a, aː, ɑ, ɑː, eə̆, uə̆, iə, ɨə, uə, eg. កក kɑːʔ to freeze

The ɔ-series equivalent is [U+1782 KHMER LETTER KO].

U+1781 KHMER LETTER KHA

Used by: Khmer

្ខ

Khmer

a-series consonantkʰɑː  Lo

UNGEN: kh

with inherent vowel ɑː or before a vowel, eg. ខាង kʰaːŋ direction

k before a subscript consonant.

ខ្ពុរ kpuː to gargle

k in final position, eg. មុខ muk face, front

ʔ in final position after one of these vowels a, aː, ɑ, ɑː, eə̆, uə̆, iə, ɨə, uə, eg. ពិសាខ pisaːʔ April-May

The ɔ-series equivalent is [U+1783 KHMER LETTER KHO].

U+1782 KHMER LETTER KO

Used by: Khmer

្គ

Khmer

o-series consonantkɔː  Lo

UNGEN: k

k with inherent vowel ɔː or before a vowel, eg. គេ keː he, she, they (polite)

k before a subscript consonant.

k in final position, eg. រោគ roːk rook, disease

when followed by a subscript or , eg. គ្នា kʰniə together

ʔ in final position after one of these vowels a, aː, ɑ, ɑː, eə̆, uə̆, iə, ɨə, uə, eg. នាគ niəʔ dragon

The ɑ-series equivalent is [U+1780 KHMER LETTER KA].

U+1783 KHMER LETTER KHO

Used by: Khmer

្ឃ*

Khmer

o-series consonantkʰɔːḵʰ  Lo

UNGEN: kh

with inherent vowel ɔː or before a vowel, eg. ឃត់ kʰuə̆t to prevent

k before a subscript consonant, eg. ឃ្លាន kʰliən hungry

k (not common) in final position, eg. មេឃ meːk sky

ʔ in final position after one of these vowels a, aː, ɑ, ɑː, eə̆, uə̆, iə, ɨə, uə, eg. មាឃ miəʔ January

* Subscript consonant seldom used.

The ɑ-series equivalent is [U+1781 KHMER LETTER KHA].

U+1784 KHMER LETTER NGO

Used by: Khmer

្ង

Khmer

o-series consonantŋɔːŋ  Lo

UNGEN: ng

ŋ with inherent vowel ɔː or before a vowel, eg. ងងឹត ŋɔ.ˈŋɨt dark

(Note that this sound appears in syllable initial position in Khmer.)

Not used before a subscript consonant.

ŋ in final position, eg. ដឹង ɗəŋ to know

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations

The combination 1784 17C9 produces the ɑ-series equivalent, eg.

ង៉ាវ ŋaːw noise-maker, sabre

U+1785 KHMER LETTER CA

Used by: Khmer

្ច

Khmer

a-series consonantcɑːc  Lo

UNGEN: ch

c with inherent vowel ɑː or before a vowel, eg. ចង់ cɑŋ to want

c before a subscript consonant.

ច្រៀង criəŋ to sing

c in final position. [c] according to Huffman, eg. តូច touc / tuːc small

The ɔ-series equivalent is [U+1787 KHMER LETTER CO].

U+1786 KHMER LETTER CHA

Used by: Khmer

្ឆ*

Khmer

a-series consonantcʰɑː  Lo

UNGEN: chh

with inherent vowel ɑː or before a vowel.

ឆា cʰaː stir fry

or c before a subscript consonant.

ឆ្អឹង cʔəŋ bone

ឆ្នាំ cʰnam year

Not found in final position.

* Subscript consonant seldom used.

The ɔ-series equivalent is [U+1788 KHMER LETTER CHO].

U+1787 KHMER LETTER CO

Used by: Khmer

្ជ

Khmer

o-series consonantcɔː  Lo

UNGEN: ch

c with inherent vowel ɔː or before a vowel, eg. ជា ciə to be

c before a subscript consonant.

កញ្ជ្រេង kɑɲ.creːŋ fox

c in final position. [c] according to Huffman., eg. រាជ riəc reign

The ɑ-series equivalent is [U+1785 KHMER LETTER CA].

U+1788 KHMER LETTER CHO

Used by: Khmer

្ឈ*

Khmer

o-series consonantcʰɔːc̱ʰ  Lo

UNGEN: chh

with inherent vowel ɔː or before a vowel.

ឈឺ cʰɨː sick

c(ʰ) before a subscript consonant.

ឈ្លើង cʰləːŋ leech

Not found in final position.

* Subscript consonant seldom used.

The ɑ-series equivalent is [U+1785 KHMER LETTER CA].

U+1789 KHMER LETTER NYO

Used by: Khmer

្ញ

Khmer

o-series consonantɲɔːɲ  Lo

UNGEN: nh

ɲ with inherent vowel ɔː or before a vowel, eg. ញី ɲiː female

ɲ in final position, eg. ទាញ tiəɲ to pull

ŋ in final position after , eg. រីញ riːŋ dried up

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Shape

The bottom of this character is dropped when followed by a subscript consonant, eg. បញ្ឆោត ɓɑɲ.ˈcʰaot to trick

There are two shapes used for the subscript. When this character appears twice in a cluster, the full form is used. Elsewhere a reduced form is used. For example, compare ប្រាជ្ញា praːc.ˈɲaː intelligence កញ្ញា kaɲ.ˈɲaː young lady

Combinations

The combination ញ៉ [U+1789 KHMER LETTER NYO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the ɑ-series equivalent, eg. ញ៉ាំ ɲam to eat (coll.)

U+178A KHMER LETTER DA

Used by: Khmer

្ដ

Khmer

a-series consonantɗɑːɗ  Lo

UNGEN: d

ɗ with inherent vowel ɑː or before a vowel, eg. ដុល្លារ ɗol.ˈlaː dollar

ɗ before a subscript consonant.

t in final position, eg. ប្រាកដ praː.ˈkɑt exact

The subscript consonant is the same shape as the subscript of tɑː .

Combinations. The syllable-final combination ដ្ឋ [U+178A KHMER LETTER DA + U+17D2 KHMER SIGN COENG + U+178B KHMER LETTER TTHA] is also pronounced simply ast.

The ɔ-series equivalent is [U+178C KHMER LETTER DO].

U+178B KHMER LETTER TTHA

Used by: Khmer

្ឋ*

Khmer

a-series consonanttʰɑːṫʰ  Lo

UNGEN: th

with inherent vowel ɑː or before a vowel, eg. ឋាន tʰaːn place, site

t before a subscript consonant.

t in final position, eg. បាឋ ɓaːt to tell, recite (clergy)

This consonant is only used in a few words of Pali or Sanskrit origin.

* Subscript is seldom used: often a silent final subscript.

The ɔ-series equivalent is [U+178D KHMER LETTER TTHO].

U+178C KHMER LETTER DO

Used infrequently by: Khmer

្ឌ*

Khmer

o-series consonantrare ɗɔːɗ̱  Lo

UNGEN: d

ɗ with inherent vowel ɔː or before a vowel, eg. ឌុន ɗun alike

Not found before a subscript consonant.

t in final position, eg. គ្រុឌ krut Garuda

This consonant is rare and is only used in a few words of Pali or Sanskrit origin.

* Subscript is seldom used

The ɑ-series equivalent is [U+178A KHMER LETTER DA].

U+178D KHMER LETTER TTHO

Used infrequently by: Khmer

្ឍ*

Khmer

o-series consonantrare tʰɔːṯ̇ʰ  Lo

UNGEN: th

with inherent vowel ɔː or before a vowel, eg. ឍាល tʰiəl large drum

Not found before a subscript consonant.

t in final position, eg. អាសាឍ ʔaːsaːt June-July

This consonant is only used in a few words of Pali or Sanskrit origin.

* Subscript consonant is obsolete, or rarely, if ever, used.

The ɑ-series equivalent is [U+178B KHMER LETTER TTHA].

U+178E KHMER LETTER NNO

• as this character belongs to the first register, its correct transliteration is nna, not nno

Used by: Khmer

្ណ*

Khmer

a-series consonantnɑːn  Lo

UNGEN: n

n with inherent vowel ɑː or before a vowel, eg. ណាស់ nah very

n before a subscript consonant.

អណ្ដែត ʔɑn.ˈɗaet to float

n in final position, eg. បូរាណ ɓouraːn ancient

* Subscript consonant is seldom used: often a silent final subscript.

The ɔ-series equivalent is [U+1793 KHMER LETTER NO].

U+178F KHMER LETTER TA

Used by: Khmer

្ត

Khmer

a-series consonanttɑːt  Lo

UNGEN: t

t with inherent vowel ɑː or before a vowel, eg. តា taː grandfather

ɗ at the beginning of two syllable words where the first syllable ends with final nasal, eg. តង្វាយ ɗɔŋʋaaj gift

t before a subscript consonant, eg. ត្រី trəj fish

t in final position, eg. កាត់ kat to cut

Subscript consonant is the same shape as the subscript of ɗɑː .

The pronunciation when a subscript in medial position is unpredictable, sometimes t and sometimes ɗ. As a general rule, but not always, it is pronounced t when a subscript to nɔː , and ɗ when a subscript to nɑː , eg. បន្តុះ ɓɑntoh to criticize បណ្តុះ ɓɑnɗoh to grow

The ɔ-series equivalent is [U+1791 KHMER LETTER TO].

U+1790 KHMER LETTER THA

Used by: Khmer

្ថ

Khmer

a-series consonanttʰɑː  Lo

UNGEN: th

with inherent vowel ɑː or before a vowel, eg. ថា tʰaː that; to say

t(ʰ) before a subscript consonant.

ថ្លើម tʰlaəm liver

t in final position, eg. ប្រមាថ prɑmaːt to scorn, despise

The ɔ-series equivalent is [U+1792 KHMER LETTER THO].

U+1791 KHMER LETTER TO

Used by: Khmer

្ទ

Khmer

o-series consonanttɔː  Lo

UNGEN: t

t with inherent vowel ɔː or before a vowel, eg. ទម្ងន់ tum.ˈŋŭən weight

t before a subscript consonant.

t in final position, eg. បាទ ɓaːt male polite response

The ɑ-series equivalent is [U+178F KHMER LETTER TA].

U+1792 KHMER LETTER THO

Used by: Khmer

្ធ*

Khmer

o-series consonanttʰɔːṯʰ  Lo

UNGEN: th

with inherent vowel ɔː or before a vowel.

ភំ tʰom big

t before a subscript consonant.

ធ្មេញ tʰmɨɲ tooth

t in final position.

អាវុធ ʔaː.ˈʋut weapon

* Subscript consonant is seldom used: often a silent final subscript.

The ɑ-series equivalent is [U+1790 KHMER LETTER THA].

U+1793 KHMER LETTER NO

Used by: Khmer

្ន

Khmer

o-series consonantnɔː  Lo

UNGEN: n

n with inherent vowel ɔː or before a vowel, eg. នឹង nɨŋ future tense marker

Not found before a subscript consonant.

n in final position, eg. មាន miən to have

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations

In some words, the combination ហ្ន [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+1793 KHMER LETTER NO] produces the ɑ-series equivalent, eg. ហ្នឹង nəŋ that's right

The ɑ-series equivalent is [U+178E KHMER LETTER NNO].

U+1794 KHMER LETTER BA

Used by: Khmer

្ប

Khmer

a-series consonant (/p/ in some words, by convention.)ɓɑːb  Lo

UNGEN: b

ɓ with inherent vowel ɑː or before a vowel, eg. បន្ទប់ ɓɑn.ˈtup room

p when followed by a subscript consonant.

ប្រយ័ត្ន prɑ.ˈjat to be careful

p in final position, eg. ឈប់ cʰup to stop

p in some words just by convention, eg. បច្ច័យ pac.ˈcaj money

Shape. A ligature បា is used when this character is followed by sra-aː, to avoid similarity with hɑː , eg. បាយ ɓaːj cooked rice, food

The same applies when followed by sra-ao បោ and sra-aw បៅ.

Combinations. The combination ប៊ [U+1794 KHMER LETTER BA + U+17CA KHMER SIGN TRIISAP] produces the ɔ-series equivalent, eg. ប៊ុត ɓut a personal name

p in the combination ប៉ [U+1794 KHMER LETTER BA + U+17C9 KHMER SIGN MUUSIKATOAN], eg. ប៉ា paː father

U+1795 KHMER LETTER PHA

Used by: Khmer

្ផ*

Khmer

a-series consonantpʰɑː  Lo

UNGEN: ph

with inherent vowel ɑː, eg. ផាយ pʰaːj run at full speed

p before a subscript consonant, eg. ផ្សារ psaː market

p in final position. Not common.

* Subscript consonant is obsolete, or rarely, if ever, used.

The ɔ-series equivalent is [U+1797 KHMER LETTER PHO].

U+1796 KHMER LETTER PO

Used by: Khmer

្ព

Khmer

o-series consonantpɔː  Lo

UNGEN: p

p with inherent vowel ɔː or before a vowel, eg. ពី piː from

p before a subscript consonant.

ព្រៃ prɨj forest

p in final position, eg. ភាព pʰiəp aspect

Combinations. The combination ព៉ [U+1796 KHMER LETTER PO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the ɑ-series equivalent, eg. ព៉ី pəj flute

U+1797 KHMER LETTER PHO

Used by: Khmer

្ភ

Khmer

o-series consonantpʰɔːp̱ʰ  Lo

UNGEN: ph

with inherent vowel ɔː or before a vowel.

ភាសា pʰiə.ˈsaː language

p before a subscript consonant.

ភ្នែក pnɛːk eye

p in final position.

លោភ loːp greed

The ɑ-series equivalent is [U+1795 KHMER LETTER PHA].

U+1798 KHMER LETTER MO

Used by: Khmer

្ម

Khmer

o-series consonantmɔː  Lo

UNGEN: m

m with inherent vowel ɔː or before a vowel.

មុខ muk face, front

m before a subscript consonant.

សម្លាប់ sɑm.ˈlap to kill

m in final position.

តាម taːm to follow

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations

The combination ម៉ [U+1798 KHMER LETTER MO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the a-series equivalent.

ម៉ែ mae mother

It may also be produced by the combination ហ្ម [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+1798 KHMER LETTER MO].

ហ្មត់ mɑt fine, powdered

U+1799 KHMER LETTER YO

Used by: Khmer

្យ

Khmer

o-series consonantjɔː  Lo

UNGEN: y

j with inherent vowel ɔː or before a vowel, eg. យល់ jŭəl to understand

Not found before a subscript consonant.

in final position. Huffman says j, eg. បាយ ɓaːj cooked rice, food

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations. The combination យ៉ [U+1799 KHMER LETTER YO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the ɑ-series equivalent, eg. យ៉ាង jaːŋ kind; variety

U+179A KHMER LETTER RO

Used by: Khmer

្រ

Khmer

o-series consonantrɔː  Lo

UNGEN: r

r with inherent vowel ɔː or before a vowel, eg. រូប ruːp image, figure

Not found before a subscript consonant.

Silent in final position, eg. ខ្មែរ kʰmae Cambodian

There is no final -r sound in Cambodian, but this letter can sometimes disambiguate homonyms, eg. compare កា kaː to address (a letter) ការ kaː work, occupation ពី piː from ពីរ piː two

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations. The combination រ៉ [U+179A KHMER LETTER RO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the ɑ-series equivalent, eg. រ៉ូង ruːŋ rattling noise

U+179B KHMER LETTER LO

Used by: Khmer

្ល

Khmer

o-series consonantlɔː  Lo

UNGEN: l

l with inherent vowel ɔː or before a vowel.

លុយ luj money

l before a subscript consonant.

ពេលល្ងាច peil ˈlŋiəc evening

l in final position.

កាល kaːl time

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations. In some words, the combination ហ្ល [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+179B KHMER LETTER LO] produces the ɑ-series equivalent.

ហ្លួង luəŋ king

The ɑ-series equivalent letter is [U+17A1 KHMER LETTER LA].

U+179C KHMER LETTER VO

Used by: Khmer

្វ

Khmer

o-series consonantʋɔː  Lo

UNGEN: v

ʋ with inherent vowel ɔː or before a vowel, eg. វៃ ʋɨj quick, clever

Not found before a subscript consonant.

w in final position, eg. អាវ ʔaːw shirt

As a subscript this consonant doesn't determine the pronunciation of the vowel sound, it is determined by the class of the non-subscript consonant.

Combinations. The combination វ៉ [U+179C KHMER LETTER VO + U+17C9 KHMER SIGN MUUSIKATOAN] produces the ɑ-series equivalent, eg. វ៉ៃ ʋaj to hit

In some words the same effect is produced using the combination ហ្វ [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+179C KHMER LETTER VO], eg. ហ្វូង fouŋ / ʋouŋ / pʰouŋ flock, crowd

f is also produced by the combination ហ្វ [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+179C KHMER LETTER VO], eg. ហ្វឹក fək to practice កាហ្វេ kaː.ˈfei coffee

U+179D KHMER LETTER SHA

• used only for Pali/Sanskrit transliteration

Khmer

consonantobsolete ɕ  Lo

Used only for Pali/Sanskrit transliteration.

U+179E KHMER LETTER SSO

• used only for Pali/Sanskrit transliteration
• as this character belongs to the first register, its correct transliteration is ssa, not sso

Khmer

consonantobsolete ʂ  Lo

Used only for Pali/Sanskrit transliteration.

U+179F KHMER LETTER SA

Used by: Khmer

្ស

Khmer

a-series consonantsɑːs  Lo

UNGEN: s

s with inherent vowel ɑː or before a vowel.

សី səj wicker ball

s before a subscript consonant.

ខ្សែស្ពាយ kʰsae ˈspiəj shoulder strap

h in final position

ចាស់ cah old

s in final position in very formal reading style

សូមទោស soum ˈtoːh I beg your pardon

Combinations. The combination ស៊ [U+179F KHMER LETTER SA + U+17CA KHMER SIGN TRIISAP] produces the ɔ-series equivalent, eg. ស៊ី siː to eat (animals)

U+17A0 KHMER LETTER HA

Used by: Khmer

្ហ

Khmer

a-series consonanthɑːh  Lo

UNGEN: h

h with inherent vowel ɑː or before a vowel.

ហាង haːŋ shop, store

Silent before a subscript consonant. See Combinations below.

Not found in final position.

Combinations. f is produced by the combinations:

The following combinations with ɔː class subscripts make the following vowel behave as if they were ɑː class consonants:

ហ្វ [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+179C KHMER LETTER VO].

ហ្វូង fouŋ / ʋouŋ / pʰouŋ flock, crowd

ហ្ម [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+1798 KHMER LETTER MO], .

ហ្មត់ mɑt fine, powdered

ហ្ន [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+1793 KHMER LETTER NO].

ហ្នឹង nəŋ that's right

ហ្ល [U+17A0 KHMER LETTER HA + U+17D2 KHMER SIGN COENG + U+179B KHMER LETTER LO].

ហ្ល ហ្ល lɑː

Combinations. The combination ហ៊ [U+17A0 KHMER LETTER HA + U+17CA KHMER SIGN TRIISAP] produces the ɔ-series equivalent, eg. ហ៊ាន hiən to dare

U+17A1 KHMER LETTER LA

Used by: Khmer

្ឡ**

Khmer

a-series consonantlɑːl  Lo

UNGEN: l

l with inherent vowel ɑː or before a vowel, eg. ឡេង leiŋ personal name

Not found before a subscript consonant or in final position.

** Subscript consonant not used in Cambodia (only in Khmer spoken in Thailand).

The ɔ-series equivalent is [U+179B KHMER LETTER LO].

U+17A2 KHMER LETTER QA

• glottal stop

Used by: Khmer

អ ្អ

Khmer

a-series consonantʔɑːʔ  Lo

ʔ with inherent vowel ɑː or before a vowel

អង្កាល់ ʔɑŋ.ˈkal when

អិស្លាម ʔih.ˈslaːm / ʔeh.ˈslaːm Islam

ʔ or before a subscript consonant. From Wiktionary data, the latter seems predominant and occurs mostly in word-initial position before n.

អ្នក nĕəʔ person, you (s)

Not found in final position.

When used as a subscript at the beginning of a word this may add an extra syllable after the initial consonant.

ផ្អែម paʔaem sweet (adj)

ស្អាត saʔaːt clean

Combinations. The combination អ៊ [U+17A2 KHMER LETTER QA + U+17CA KHMER SIGN TRIISAP] produces the ɔ-series equivalent. For example, compare the following (first without, then with).

អុំទូក ʔom ˈtuːk to row, paddle

អ៊ុយក្រែន ʔuj.ˈkraen Ukraine

U+17A3 KHMER INDEPENDENT VOWEL QAQ

• use of this character is strongly discouraged; 17A2 should be used instead

Khmer

independent vowel (strongly deprecated)deprecated A  Lo

This should be considered an error in the encoding. Use of this character is strongly discouraged; [U+17A2 KHMER LETTER QA] should be used instead.

Originally intended only for Pali/Sanskrit transliteration, but not actually a separate character in Khmer.

U+17A4 KHMER INDEPENDENT VOWEL QAA

• use of this character is strongly discouraged; the sequence 17A2 17B6 should be used instead

Khmer

independent vowel (strongly deprecated)deprecated Ä  Lo

This should be considered an error in the encoding. Use of this character is discouraged; the sequence អា [U+17A2 KHMER LETTER QA + U+17B6 KHMER VOWEL SIGN AA​] should be used instead.

Originally intended only for Pali/Sanskrit transliteration, but not actually a separate character in Khmer.

U+17A5 KHMER INDEPENDENT VOWEL QI

Used by: Khmer

Khmer

independent vowelsraʔ ʔeʔស្រៈឥ  Lo

The corresponding dependent vowel is [U+17B7 KHMER VOWEL SIGN I​]

UNGEN: i

ʔə eg. ឥត ʔət not

ʔɨ eg. ឥន្ទុ ʔɨntuʔ the moon

ʔəj eg. ឥឡូវ ʔəjləw now

U+17A6 KHMER INDEPENDENT VOWEL QII

Used by: Khmer

Khmer

independent vowelsraʔ ʔəjស្រៈឦị̄  Lo

The corresponding dependent vowel is [U+17B8 KHMER VOWEL SIGN II​]

UNGEN: ī

ʔəj eg. ឦសាន ʔəj.ˈsaːn / ʔiː.ˈsaːn northeast

U+17A7 KHMER INDEPENDENT VOWEL QU

Used by: Khmer

Khmer

independent vowelsraʔ ʔoស្រៈឧ  Lo

The corresponding dependent vowel is [U+17BB KHMER VOWEL SIGN U​]

UNGEN: u

u, eg. ឧបម៉ា uppəmaː example

o, eg. ឧកញ៉ា ʔok.ˈɲaː official of ministerial rank

ao, eg. ឧបាសក ʔaoɓaːsɑʔ layman

U+17A8 KHMER INDEPENDENT VOWEL QUK

• obsolete ligature for the sequence 17A7 1780
• use of the sequence is now preferred

Khmer

independent vowelobsolete U  Lo

U+17A9 KHMER INDEPENDENT VOWEL QUU

Used by: Khmer

Khmer

independent vowelsraʔ ʔouស្រៈឩụ̄  Lo

The corresponding dependent vowel is [U+17BC KHMER VOWEL SIGN UU​]

UNGEN: ū

ou eg. ឩដ្ឋ ʔout camel

eg. ឩន ʔuːn deficient (lit.)

U+17AA KHMER INDEPENDENT VOWEL QUUV

Used by: Khmer

Khmer

independent vowelsraʔ ʔəwស្រៈឪɨ̣  Lo

The corresponding dependent vowel is [U+17B9 KHMER VOWEL SIGN Y​]

UNGEN: ýu

ʔəw eg. ឪពុក ʔəw.puk father

U+17AB KHMER INDEPENDENT VOWEL RY

Used by: Khmer

Khmer

vocalicsraʔ raស្រៈឫ  Lo

eg. ឫស rɨh root ឫទ្ធិ rɨt power, strength

U+17AC KHMER INDEPENDENT VOWEL RYY

Used infrequently by: Khmer

Khmer

vocalicrare sraʔ raːស្រៈឬr̥̄  Lo

rɨː eg. ឬស្សី rɨːhsəj bamboo rɨː or

U+17AD KHMER INDEPENDENT VOWEL LY

Used by: Khmer

Khmer

vocalicsraʔ laស្រៈឭ  Lo

eg. រំឭក rum.ˈlɨk to commemorate

U+17AE KHMER INDEPENDENT VOWEL LYY

Used infrequently by: Khmer

Khmer

vocalicrare sraʔ laːស្រៈឮl̥̄  Lo

lɨː eg. ស្ដាប់ឮ sɗap ˈlɨː to hear

U+17AF KHMER INDEPENDENT VOWEL QE

Used by: Khmer

Khmer

independent vowelsraʔ ʔaeស្រៈឯæ̣  Lo

The corresponding dependent vowel is [U+17C2 KHMER VOWEL SIGN AE​]

UNGEN: ae

ae eg. ʔae at, as, for

U+17B0 KHMER INDEPENDENT VOWEL QAI

Used by: Khmer

Khmer

independent vowelsraʔ ʔajស្រៈឰạʲ  Lo

The corresponding dependent vowel is [U+17C3 KHMER VOWEL SIGN AI​]

UNGEN: ai

aj, eg. ឰរាវ័ណ ʔajrəʋoə̆n Erawan

U+17B1 KHMER INDEPENDENT VOWEL QOO TYPE ONE

Used by: Khmer

Khmer

independent vowelsraʔ ʔaoស្រៈឱ  Lo

The corresponding dependent vowel is [U+17C4 KHMER VOWEL SIGN OO​]

UNGEN: o

ao, eg. ឱប ʔaop to embrace

U+17B2 KHMER INDEPENDENT VOWEL QOO TYPE TWO

• this is a variant for 17B1, used in only two words
• 17B1 is the normal variant of this vowel

Used by: Khmer

Khmer

independent vowelsraʔ ʔaoស្រៈឲạᵒ  Lo

UNGEN: âu

ao, eg. ឲ្យ ʔaoj to give

This is a variant of [U+17B1 KHMER INDEPENDENT VOWEL QOO TYPE ONE] that is only used, according to Unicode, in two words, one of which (the example shown just above) is very common however.

U+17B3 KHMER INDEPENDENT VOWEL QAU

Used by: Khmer

Khmer

independent vowelsraʔ ʔawស្រៈឳạʷ  Lo

The corresponding dependent vowel is [U+17C5 KHMER VOWEL SIGN AU​]

UNGEN: au

aw, eg. ឳទក ʔawtuə̆k water-plant

U+17B4 KHMER VOWEL INHERENT AQ​

Khmer

inherent vowel markdeprecated   Mn

An invisible character, originally intended (along with 17B5) to represent a phonetic difference not expressed by the spelling, so as to assist in phonetic sorting, however, the Unicode Standard considers those characters to be insufficient for that purpose and errors in the encoding. It should not be used.8677

U+17B5 KHMER VOWEL INHERENT AA​

Khmer

inherent vowel markdeprecated   Mn

An invisible character, originally intended (along with 17B4) to represent a phonetic difference not expressed by the spelling, so as to assist in phonetic sorting, however, the Unicode Standard considers those characters to be insufficient for that purpose and errors in the encoding. It should not be used.8677

U+17B6 KHMER VOWEL SIGN AA​

Used by: Khmer

Khmer

vowel signsrà ʼāស្រៈ អាā  Mc

UNGEN: ā

after an ɑː class consonant, eg. ណា naː which, where

after an ɔː class consonant, eg. ជា ciə to be

Combinations

ាំ [U+17B6 KHMER VOWEL SIGN AA + U+17C6 KHMER SIGN NIKAHIT] is regarded as a letter of the Khmer alphabet. Sounds are:

  • am after an ɑː class consonant, eg. ចាំ cam to remember
  • oə̯m after an ɔː class consonant, eg. ជាំ cŏəm bruised

ាំង [U+17B6 KHMER VOWEL SIGN AA + U+17C6 KHMER SIGN NIKAHIT + U+1784 KHMER LETTER NGO] gives:

  • after an ɑː class consonant, eg. តាំង taŋ to establish
  • ɛaŋ after an ɔː class consonant, eg. ទាំង tĕəŋ including, both

ា​់ [U+17B6 KHMER VOWEL SIGN AA + consonant + U+17CB KHMER SIGN BANTOC], where the bantoc sits over a following consonant, gives:

  • eə̯ before a velar consonant, eg. compare ពាក់ pĕəʔ to wear ពាក្យ piəʔ word
  • oə̯ elsewhere, eg. compare មាន់ mŏən chicken មាន miən to have

U+17B7 KHMER VOWEL SIGN I​

Used by: Khmer

Khmer

vowel signsraʔ ʔiស្រៈ អិi  Mn

The corresponding independent vowel is [U+17A5 KHMER INDEPENDENT VOWEL QI]

UNGEN: i

e after an ɑː class consonant, eg. ចិត្ដ

i after an ɔː class consonant, eg. វិញ ʋɨɲ instead, again

Glottal stop

It is followed by a glottal stop in stressed syllables but not in unstressed, eg. compare the following pairs: មតិ mĕəʔ.ˈteʔ / ma.ˈteʔ opinion កិរិយា ke.ri.jaː conduct, behaviour លទ្ធិ lat.ˈtʰiʔ precept និទាន ni.ˈtiən story

Combinations.

eh / ih is ិះ [U+17B7 KHMER VOWEL SIGN I + U+17C7 KHMER SIGN REAHMUK]. This is noted especially because the combination is regarded as a unit in the Khmer alphabet. This combination has the same sound as េះ [U+17C1 KHMER VOWEL SIGN E + U+17C7 KHMER SIGN REAHMUK], but is much less common. ជិះ cih to ride

əj can be –ិយ [U+17B7 KHMER VOWEL SIGN I + U+1799 KHMER LETTER YO] after an ɔː class consonant, eg. ចេតិយ caet.ˈɗəj stupa

U+17B8 KHMER VOWEL SIGN II​

Used by: Khmer

Khmer

vowel signsraʔ ʔəjស្រៈ អីī  Mn

The corresponding independent vowel is [U+17A6 KHMER INDEPENDENT VOWEL QII]

UNGEN: ī

əj after an ɑː class consonant, eg. បី ɓəj three

after an ɔː class consonant, eg. ពីរ piː two

Combinations.

in ិយ [U+17B7 KHMER VOWEL SIGN I + U+1799 KHMER LETTER YO], eg. ឥន្ត្រិយ ʔəntriː power

U+17B9 KHMER VOWEL SIGN Y​

Used by: Khmer

Khmer

vowel signsraʔ ʔəស្រៈ អឹɨ  Mn

The corresponding independent vowel is [U+17AA KHMER INDEPENDENT VOWEL QUUV]

UNGEN:

ə after an ɑː class consonant, eg. ដឹក ɗək transport

ɨ after an ɔː class consonant, eg. ទឹក tɨk water

Combinations.

əh is –ឹះ [U+17B9 KHMER VOWEL SIGN Y + U+17C7 KHMER SIGN REAHMUK], eg. ឆ្កឹះ cʰkəh to prise out

U+17BA KHMER VOWEL SIGN YY​

Used by: Khmer

Khmer

vowel signsraʔ ʔəɨស្រៈ អឺɨ̄  Mn

UNGEN: ȳ

əɨ after an ɑː class consonant, eg. ដឺ ɗəɨ fallow

ɨː after an ɔː class consonant, eg. គឺ kɨː as follows

U+17BB KHMER VOWEL SIGN U​

Used by: Khmer

Khmer

vowel signsraʔ ʔoស្រៈ អុu  Mn

The corresponding independent vowel is [U+17A7 KHMER INDEPENDENT VOWEL QU]

UNGEN: u

o after an ɑː class consonant. It is followed by a glottal stop in open stressed syllables but not in unstressed, eg. compare វត្តុ ʋoə̆ttoʔ artifact កុមារ ko.ˈmaː child

u after an ɔː class consonant. It is followed by a glottal stop in stressed syllables but not in unstressed, eg. compare វិទ្យុ ʋi.ˈtʰjuʔ radio គុលិកា ku.li.kaː pill, tablet

Combinations

ុះ [U+17BB KHMER VOWEL SIGN U + U+17C7 KHMER SIGN REAHMUK] is regarded as a letter of the Khmer alphabet, and is pronounced:

  • oh after an ɑː class consonant, eg. ចុះ coh to go down, fall
  • uh after an ɔː class consonant, eg. ពុះ puh to boil; to split (wood)

ុំ [U+17BB KHMER VOWEL SIGN U + U+17C6 KHMER SIGN NIKAHIT] is pronounced:

  • om after an ɑː class consonant, eg. ដុំ ɗom piece, morsel
  • um after an ɔː class consonant, eg. ទុំ tum ripe

U+17BC KHMER VOWEL SIGN UU​

Used by: Khmer

Khmer

vowel signsraʔ ʔūស្រៈ អូū  Mn

The corresponding independent vowel is [U+17A9 KHMER INDEPENDENT VOWEL QUU]

UNGEN: ū

ou after an ɑː class consonant, eg. កូរ kou to stir

after an ɔː class consonant, eg. គូ kuː pair

Combinations

ូវ [U+17BC KHMER VOWEL SIGN UU + U+179C KHMER LETTER VO] is
ɨw, eg. នូវ nɨw consisting of

U+17BD KHMER VOWEL SIGN UA​

Used by: Khmer

Khmer

vowel signsraʔ ʔuaស្រៈ អួuᵊ  Mn

UNGEN: au

after any class of consonant, eg. កួរ kuə pod, ear គួរ kuə proper

U+17BE KHMER VOWEL SIGN OE​

Used by: Khmer

Khmer

vowel signsraʔ ʔaəស្រៈ អើaᵊ  Mc

UNGEN: oe

after an ɑː class consonant, eg. បើ ɓaə if

əː after an ɔː class consonant, eg. ឈើ cʰəː wood

Combinations

–ើះ [U+17BE KHMER VOWEL SIGN OE + U+17C7 KHMER SIGN REAHMUK] is
əh, eg. ចង្កើះ cɑŋ.ˈkəh chopsticks

The normal spelling is ចង្កឹះ ចង្កឹះ

U+17BF KHMER VOWEL SIGN YA​

Used by: Khmer

Khmer

vowel signsraʔ ʔɨəស្រៈ អឿɨᵊ  Mc

UNGEN: ia

ɨə after any class of consonant, eg. តឿ tɨə dwarfed ជឿ cɨə to believe

U+17C0 KHMER VOWEL SIGN IE​

Used by: Khmer

Khmer

vowel signsraʔ ʔiəស្រៈ អៀiᵊ  Mc

UNGEN: ia

after any class of consonant, eg. តៀប tiəp bowl ទៀប tiəp nearly

U+17C1 KHMER VOWEL SIGN E​

Used by: Khmer

Khmer

vowel signsraʔ ʔeiស្រៈ អេe  Mc

UNGEN: e

ei after an ɑː class consonant, eg. កេរ្តិ៍ kei reputation, honor

after an ɔː class consonant, eg. គេ keː he, she, they (polite)

ɨ after an ɔː class consonant but before a palatal, eg. ពេញ pɨɲ full

Combinations

េះ [U+17C1 KHMER VOWEL SIGN E + U+17C7 KHMER SIGN REAHMUK] is pronounced:

  • eh after an ɑː class consonant, eg. សេះ seh horse
  • ih after an ɔː class consonant, eg. នេះ nih this, here

This combination has the same sound as ិះ [U+17B7 KHMER VOWEL SIGN I + U+17C7 KHMER SIGN REAHMUK], but is much more common.

U+17C2 KHMER VOWEL SIGN AE​

Used by: Khmer

Khmer

vowel signsraʔ ʔaeស្រៈ អែæ  Mc

The corresponding independent vowel is [U+17AF KHMER INDEPENDENT VOWEL QE]

UNGEN: ae

ae after an ɑː class consonant, eg. កែ kae to repair

ɛː after an ɔː class consonant, eg. គែ kɛː craw (of bird)

Combinations

ែះ [U+17C2 KHMER VOWEL SIGN AE + U+17C7 KHMER SIGN REAHMUK] is
eh after an ɔ-class consonant.

កែះ keh wild goat

U+17C3 KHMER VOWEL SIGN AI​

Used by: Khmer

Khmer

vowel signsraʔ ʔajស្រៈ អៃ  Mc

The corresponding independent vowel is [U+17B0 KHMER INDEPENDENT VOWEL QAI]

UNGEN: ai

aj after an ɑː class consonant, eg. ប្រៃ praj salty

ɨj after an ɔː class consonant, eg. ព្រៃ prɨj forest

U+17C4 KHMER VOWEL SIGN OO​

Used by: Khmer

Khmer

vowel signsraʔ ʔaoស្រៈ អោo  Mc

The corresponding independent vowel is [U+17B1 KHMER INDEPENDENT VOWEL QOO TYPE ONE]

UNGEN: o

ao after an ɑː class consonant.

កោរ kao to shave

after an ɔː class consonant, eg. គោ koː cow

Combinations

ោះ [U+17C4 KHMER VOWEL SIGN OO + U+17C7 KHMER SIGN REAHMUK] is pronounced:

  • ɑh after an ɑː class consonant.

    កោះ kɑh island

  • uəh or uh after an ɔː class consonant, eg. ឈ្មោះ cʰmŭəh (given) name នោះ nuh that

U+17C5 KHMER VOWEL SIGN AU​

Used by: Khmer

Khmer

vowel signsraʔ ʔawស្រៈ អៅ  Mc

The corresponding independent vowel is [U+17B3 KHMER INDEPENDENT VOWEL QAU]

UNGEN: au

aw after an ɑː class consonant, eg. តៅ taw bushel

ɨw after an ɔː class consonant, eg. ទៅ tɨw to go

U+17C6 KHMER SIGN NIKAHIT​

= srak am
= anusvara
• final nasalization
• this character is usually regarded as a vowel sign am, along with om and aam
→ (thai character nikhahit - 0E4D)
→ (myanmar sign anusvara - 1036)

Used by: Khmer

Khmer

anusvaranikkɔːhetនិគ្គហិត  Mn

UNGEN:

Marks final nasalisation, but also changes and shorten the inherent or other vowel. Although it can be equated with the anusvara in Sanskrit, this is usually regarded as a vowel sign or a part of a vowel sign in Khmer.

ɑm after an ɑː class consonant, eg. កំពុង kɑm.ˈpuŋ ongoing action

um after an ɔː class consonant, eg. រំភើប rumɓəːp excited

Combinations.

ុំ [U+17BB KHMER VOWEL SIGN U + U+17C6 KHMER SIGN NIKAHIT] is pronounced:

  • om after an ɑː class consonant, eg. ធំ tʰom big
  • um after an ɔː class consonant, eg. ខ្លាឃ្មុំ kʰlaːˈkʰmum sun bear

ាំ [U+17B6 KHMER VOWEL SIGN AA + U+17C6 KHMER SIGN NIKAHIT] is:

  • am after an ɑː class consonant, eg. ដាំ ɗam to plant
  • oə̯m after an ɔː class consonant, eg. នាំ nŏəm to lead

ាំង [U+17B6 KHMER VOWEL SIGN AA + U+17C6 KHMER SIGN NIKAHIT + U+1784 KHMER LETTER NGO] is:

  • after an ɑː class consonant, eg. ម្ហូបបារាំង mhouɓɓaːraŋ French food
  • eə̯ŋ after an ɔː class consonant, eg. ទាំង tĕəŋ including, both

In some words of Sanskrit origin, the niʔkəhət represents or an, eg. សំស្ក្រិត sɑŋskret Sanskrit សំយោគសញ្ញា saŋ.joːk saɲ.ɲaː name of a diacritic

U+17C7 KHMER SIGN REAHMUK​

= srak ah
= visarga
→ (myanmar sign visarga - 1038)

Used by: Khmer

Khmer

visargareə̆hmukរះមុខ  Mc

UNGEN:

Although it can be equated with the visarga in Sanskrit, this is regarded as a vowel sign or part of a vowel sign in Khmer.

ah with an ɑː class inherent vowel, eg. ខ្លះ kʰlah some

eə̆h with an ɔː class inherent vowel or an ɔː class [U+17B6 KHMER VOWEL SIGN AA], eg. ផ្ទះ pʰtĕəh house, home

Combinations

h after the normal sounds of short vowels [U+17B7 KHMER VOWEL SIGN I], [U+17B9 KHMER VOWEL SIGN Y], and [U+17BB KHMER VOWEL SIGN U], eg. ជិះ cih to ride កឹះ kəh to tap on the shoulder ពុះ puh to boil; to split (wood)

–េះ [U+17C1 KHMER VOWEL SIGN E + U+17C7 KHMER SIGN REAHMUK] is:

  • eh with an ɑː class vowel, eg. សេះ seh horse
  • ih with an ɔː class vowel, eg. នេះ nih this, here

–ែះ [U+17C2 KHMER VOWEL SIGN AE + U+17C7 KHMER SIGN REAHMUK] is also
eh with an ɑː class vowel, eg. កែះ keh wild goat

–ើះ [U+17BE KHMER VOWEL SIGN OE + U+17C7 KHMER SIGN REAHMUK] is:
əh with an ɑː class vowel, eg. ចង្កើះ cɑŋ.ˈkəh chopsticks

–ោះ [U+17C4 KHMER VOWEL SIGN OO + U+17C7 KHMER SIGN REAHMUK] is:

  • ɑh with an ɑː class vowel, eg. កោះ kɑh island
  • uə̆h with an ɔː class vowel, eg. គោះ kuə̆h to strike

U+17C8 KHMER SIGN YUUKALEAPINTU​

• inserts a short inherent vowel with abrupt glottal stop
• the preferred transliteration is yukaleakpintu

Used by: Khmer

Khmer

short voweljuʔkɔluə̆ʔpintuʔយុគលពិន្ទុa  Mc

Used as a vowel after consonants that are to be used as stressed syllables at the end of a word, or preceding an internal juncture in compounds.

as an ɑː class vowel, eg. ស្រៈ sraʔ vowel

eə̆ʔ as an ɔː class vowel, eg. ធុរៈ tʰureə̆ʔ preoccupation

This character is a modern innovation. It is also used in the official Cambodian Dictionary to mark any consonant symbol which is pronounced independently.

U+17C9 KHMER SIGN MUUSIKATOAN​

• changes the second register to the first
• the preferred transliteration is muusekatoan

Used by: Khmer

Khmer

consonant shiftermuseʔkətoə̯nមូសិកទន្ត  Mn

Changes the register of a consonant from ɔː to ɑː, affecting the inherent vowel and any other vowel following the consonant, eg. ម៉ត់ចត់ mɑtcɑt careful រ៉ាប់ rap to guarrantee

It is used for the following consonants that don't have equivalents in the ɑː class: ង ញ ម យ រ វ. It is usually written over the right-hand side of the consonant glyph. This is also especially useful for spelling foreign names. Eg. compare យ៉ាង jaːŋ kind; variety យាង jiəŋ go (royalty)

Changes the sound of ɓɑː from ɓ to p, eg. ប៉ះ pah to touch

This is the only way to write an ɑː register p. Eg. compare ប៉ាន paːn to cover បាន ɓaːn to have

When a vowel sign appears above the consonant, the ក្បៀសក្រោម kɓiəhkraom form of this character is used. This looks exactly like sra-o អុ, eg. ម៉ឺន məɨn 10,000 ញ៉ាំ ɲam to eat (coll.)

You can prevent this behaviour using a zero-width non-joiner between this character and the following one, eg. ញ៉‌ាំ ញ៉‌ាំ

tmɨɲ kɑnɗao means "rat's teeth".

U+17CA KHMER SIGN TRIISAP​

• changes the first register to the second
• the preferred transliteration is treisap

Used by: Khmer

Khmer

consonant shiftertrəisaɓត្រីសព្ទ  Mn

Changes the class of a consonant from ɔː to , affecting the inherent vowel and also any other vowel following the consonant, eg. compare ក្រុមហ៊ុន krom ˈhun / krom.ˈmaʔ hun company ហ៊ាន hiən to dare ហាន haːn shop អ៊ូ ʔuː dry dock អូ ʔou exclamation

This is especially useful for spelling foreign names.

When this appears with a vowel sign above the consonant, the ក្បៀសក្រោម kɓiəhkraom form is used. This looks exactly like sra-o អុ, eg. in ស៊ី siː to eat (animals)

You can prevent this behaviour using a zero-width non-joiner between this character and the following one, eg. ស៊‌ី ស៊‌ី

U+17CB KHMER SIGN BANTOC​

• shortens the vowel sound in the previous orthographic syllable
• the preferred transliteration is bantak

Used by: Khmer

Khmer

vowel shortenerɓɑntɑkបន្តក់˘  Mn

Always placed above the final consonant in modern Khmer. Basically shortens the preceding vowel. Affects the preceding vowel sound in one of the following ways:

  • After an inherent vowel
    • ɑ after an ɑː class consonant, eg. compare ចប់ cɑp to finish ចប cɑːp hoe
    • u after an ɔː class consonant and before a labial consonant, eg. compare លប់ lup bird trap លប lɔːp fish trap
    • uə̯ otherwise after an ɔː class consonant, eg.compare លក់ luə̆ʔ to sell លក lɔːk to channel
  • After following an ɔː class consonant
    • eə̯ before a velar consonant, eg. compare ពាក់ pĕəʔ to wear ពាក្យ piəʔ word
    • oə̯ elsewhere, eg. compare មាន់ mŏən chicken មាន miən to have
  • Otherwise, shortens a long vowel, eg. compare ចាប់ cap to catch ចាប caːp sparrow

U+17CC KHMER SIGN ROBAT​

• a diacritic historically corresponding to the repha form of ra in Devanagari

Used by: Khmer

Khmer

consonant silencerinfrequent rɔɓaːtរបាទʳ  Mn

Not a very common mark. It represents a final r on the previous consonant (like the Devanagari repha), although it may be pronounced as , eg. ទុគ៌ត tuːrəkuə̆t destitute

When it appears word-finally, it typically also silences the final consonant, eg. បរិបូណ៌ ɓɑ.ri.ɓou abundant

It can also convert the vowel sound of the previous consonant from ɔː to ɔə at the same time as silencing the consonant it appears over, eg. ពណ៌ pɔə colour

Hosken says that it is only used on a consonant with an inherent vowel, or with one followed by [U+17B6 KHMER VOWEL SIGN AA] or [U+17BB KHMER VOWEL SIGN U]36.

U+17CD KHMER SIGN TOANDAKHIAT​

• indicates that the base character is not pronounced
→ (thai character thanthakhat - 0E4C)

Used by: Khmer

Khmer

syllable silencertɔnɗɔkʰiətទណ្ឌឃាតˣ  Mn

Used over a consonant, particularly in loan words, to silence it and any attached vowels or subscripts, eg. សាសន៍ saːh race, ethnicity កេរ្តិ៍ kei reputation, honor រេហ៍ពល rɔ.puə̆l army

It visually ligates with [U+17B7 KHMER VOWEL SIGN I], eg. តែនតិ៍ taen tent

U+17CE KHMER SIGN KAKABAT​

• sign used with some exclamations

Used by: Khmer

Khmer

exclamation signinfrequent kaːkɑɓaːtកាកបាទ  Mn

Very rare, but used over a consonant to convey excited emphasis, like an exclamation mark, eg. ណែ៎ nɛː Hey ! នុ៎ះន៎ nuh.nɔː Over there !

U+17CF KHMER SIGN AHSDA​

• denotes stressed intonation in some single-consonant words

Used by: Khmer

Khmer

vowel sign/word specifierinfrequent leːk ʔahsɗaːលេខអស្ដាɑ̄  Mn

Used over two consonants to indicate that they represent two specific words: ក៏ ក៏ kɑː auxiliary: also, then, therefore ដ៏ ដ៏ ɗɑː pronoun which; very

U+17D0 KHMER SIGN SAMYOK SANNYA​

• denotes deviation from the general rules of pronunciation, mostly used in loan words from Pali/Sanskrit, French, and so on

Used by: Khmer

Khmer

vowel changer/vowel signsanjoːksaɲɲaːសំយោគសញ្ញាă  Mn

Generally regarded to be a vowel-changer, but actually acts much like a vowel-sign.

a over an ɑː class consonant, eg. ស័ក្តិ saʔ rank ស័ក saʔ era

eə̆ before velar finals, eg. ល័ខ lĕəʔ red dye វ័ង ʋŭəŋ palace

oə̯ over an ɔː class consonant, in general, eg. ទ័ព toə̆p army

(Note that all of the above are the same as if a sra-aː plus niʔkəhət had been used.)

Combinations

័យ [U+17D0 KHMER SIGN SAMYOK SANNYA + U+1799 KHMER LETTER YO] is:

aj after an ɑː class consonant, eg. សម័យ saʔ.ˈmaj era ស្រៈនិស្ស័យ sraʔ nih.ˈsaj dependent vowel

ɨj after an ɔː class consonant, eg. ជ័យ cɨj victory

Gilbert & Hang say that the sound is e usually, when over ʋɔː or nɔː .

–័រ [U+17D0 KHMER SIGN SAMYOK SANNYA + U+179A KHMER LETTER RO] is:
ɔə (these are always ɔː class consonants), eg. ជ័រ cɔə resin

U+17D1 KHMER SIGN VIRIAM​

• mostly obsolete, a 'killer'
• indicates that the base character is the final consonant of a word without its inherent vowel sound

Used by: Khmer

Khmer

viramainfrequent ʋiriəmវិរាមˣ̣  Mn

The sanskrit virama, sometimes used in Sanskrit words to indicate that a final consonant has no vowel sound, eg. អាត្មន៑ ʔaːt.man soul

Sometimes this appears as a bar over the consonant, sometimes as a wide, inverted v-shape under the consonant.

The Cambodian way of spelling such words uses the sanjoːksaɲɲaː អ័.

U+17D2 KHMER SIGN COENG​

• functions to indicate that the following Khmer letter is to be rendered subscripted
• shape shown is arbitrary and is not visibly rendered

Used by: Khmer

Khmer

coeng signcəːŋជើង͞  Mn

Serves to indicate in Unicode text that the following consonant should be rendered as a subscript. The shape is arbitrary, since it is never visible in Khmer (unlike Devanagari etc.).

cəːŋ (transcribed in Unicode as COENG) is actually the name given to the subscripted consonants themselves, and this should more accurately be called a cəːŋ generator.

This virama-based model used by Unicode is consistent with the approach to other Indic scripts, however Cambodian people regard the subscripted consonants as different entities to the normal consonant characters, and need to be taught to use a cəːŋ sign to type in Unicode.

U+17D3 KHMER SIGN BATHAMASAT​

• originally intended as part of lunar date symbols
• use of this character is discouraged in favor of the complete set of lunar date symbols
→ (khmer symbol pathamasat - 19E0)

Khmer

lunar date signdeprecated °  Mn

Use discouraged in favor of the complete set of lunar date symbols.

U+17D4 KHMER SIGN KHAN

• functions as a full stop, period
→ (thai character paiyannoi - 0E2F)
→ (myanmar sign little section - 104A)

Used by: Khmer

Khmer

full stopkʰanខ័ណ្ឌ.  Po

UNGEN: .

Equivalent of a period, placed at the end of a sentence.

Also used in the following combination to mean etcetera ។ល។ ។ល។ lanəŋla

U+17D5 KHMER SIGN BARIYOOSAN

• indicates the end of a section or a text
→ (thai character angkhankhu - 0E5A)
→ (myanmar sign section - 104B)

Used by: Khmer

Khmer

section terminatorɓɑːrijaosaːnបរិយោសាន|  Po

Used at the end of a chapter or an entire text.

U+17D6 KHMER SIGN CAMNUC PII KUUH

• functions as colon
• the preferred transliteration is camnoc pii kuuh
→ (division sign - 00F7)
→ (tibetan mark gter tsheg - 0F14)

Used by: Khmer

Khmer

coloncɑmnoc piː kuːhចំណុចពីរគូស  Po

Used much like a colon in English.

It is typically used after the quotative particle ថា tʰaː that; to say and the copulative kɔː mute

U+17D7 KHMER SIGN LEK TOO

• repetition sign
→ (thai character maiyamok - 0E46)

Used by: Khmer

Khmer

repetition signstuən / leːktoːស្ទួន / លេខទោ&  Lm

Repetition sign. Repeats the word directly before. A common way of providing emphasis, eg. ខ្លាំង ៗ klaŋ klaŋ very strong គាត់មានផ្ទះថ្មី ៗ kaːtʰ miən pʰteə̯h tʰməitʰməi he has a brand new house

Sometimes repeats a phrase rather than a word, eg. បន្តិចម្ដង ៗ ɓɑn.təc mɗɑːŋ ɓɑn.təc mɗɑːŋ little by little

It is also occasionally used to repeat the word at the end of a sentence for the beginning of a new sentence (Huffman), eg. ខ្ញុំទៅផ្ទះខ្ញុំ ៗ នៅជិតផ្សារ kɲomtɨwpteə̆hkɲom pteə̆hkɲom nɨwcɨtpsaː

The sign is usually separated from the text by a space.

U+17D8 KHMER SIGN BEYYAL

• et cetera
• use of this character is discouraged; other abbreviations for et cetera also exist
• preferred spelling: 17D4 179B 17D4

Used infrequently by: Khmer

Khmer

etc.best avoided lanəŋla / laʔ។ល។ / ល៉ៈ⋯̇  Po

UNGEN:

Means et cetera. Use of this character is discouraged. The preferred representation uses the individual characters, eg. ។ល។ ។ល។

Other spellings for et cetera also exist. These include:

  • ។បេ។
  • –បេ–
  • –ល–

Pronounced lɑʔ.

U+17D9 KHMER SIGN PHNAEK MUAN

• indicates the beginning of a book or a treatise
• the preferred transliteration is phnek moan
→ (thai character fongman - 0E4F)

Used by: Khmer

Khmer

text startpnɛːkmoə̆nភ្នែកមាន់  Po

Marks the beginning of literary and religious texts.

Forms a pair with , which ends a text.

Means cock's eye. It is said to represent the trunk of the elephant-god Ganesha.

U+17DA KHMER SIGN KOOMUUT

• indicates the end of a book or treatise
• this forms a pair with 17D9
• the preferred transliteration is koomoot
→ (thai character khomut - 0E5B)

Used by: Khmer

Khmer

text endkoːmoutគោមូត្រ  Po

Marks the absolute end of a text. Usually used for poetic or religious texts.

Forms a pair with , which starts a text.

Sometimes used in combination as ។៚.

Means cow's urine.

U+17DB KHMER CURRENCY SYMBOL RIEL

Used by: Khmer

Khmer

currency symbolsaɲ ɲaː riəlសញ្ញារៀល¤  Sc

Placed after the amount, eg. ៣០០០ ៛ ɓej poan riəl 3000 riel

U+17DC KHMER SIGN AVAKRAHASANYA

• rare, shows an omitted Sanskrit vowel, like an apostrophe
• the preferred transliteration is avakraha sannya
→ (devanagari sign avagraha - 093D)

Khmer

marker for omitted Sanskrit vowelunused avakraha sannya˟  Lo

U+17DD KHMER SIGN ATTHACAN​

• mostly obsolete
• indicates that the base character is the final consonant of a word with its inherent vowel sound
→ (khmer sign viriam - 17D1)

Used by: Khmer

Khmer

final vowel retainerinfrequent   Mn

A rarely used sign that indicates that the consonant retains its inherent vowel sound.

U+17E0 KHMER DIGIT ZERO

Used by: Khmer

Khmer

digitsounសូន  Nd

0 digit.

U+17E1 KHMER DIGIT ONE

Used by: Khmer

Khmer

digitmuəjមួយ  Nd

1 digit.

U+17E2 KHMER DIGIT TWO

Used by: Khmer

Khmer

digitpiːពីរ  Nd

2 digit.

U+17E3 KHMER DIGIT THREE

Used by: Khmer

Khmer

digitɓəjបី  Nd

3 digit.

U+17E4 KHMER DIGIT FOUR

Used by: Khmer

Khmer

digitɓuənបួន  Nd

4 digit.

U+17E5 KHMER DIGIT FIVE

Used by: Khmer

Khmer

digitpramប្រាំ  Nd

5 digit.

U+17E6 KHMER DIGIT SIX

Used by: Khmer

Khmer

digitprammuəjប្រាំមួយ  Nd

6 digit.

U+17E7 KHMER DIGIT SEVEN

Used by: Khmer

Khmer

digitprampiː / prampɨlប្រាំពីរ  Nd

7 digit.

U+17E8 KHMER DIGIT EIGHT

Used by: Khmer

Khmer

digitpramɓəjប្រាំបី  Nd

8 digit.

U+17E9 KHMER DIGIT NINE

Used by: Khmer

Khmer

digitpramɓuənប្រាំបួន  Nd

9 digit.

Khmer Symbols

U+19E0 KHMER SYMBOL PATHAMASAT

• represents the first Ashadha (eighth month of the lunar calendar)

U+19F0 KHMER SYMBOL TUTEYASAT

• represents the second Ashadha in the lunar calendar during the Adhikameas leap year

References & sources

1Peter T. Daniels and William Bright, The World's Writing Systems, Oxford University Press, ISBN 0-19-507993-0

2Franklin Huffman, Cambodian System of Writing

3Martin Hosken (2021), Khmer Encoding (retr. Dec 2021)

4Richard Gilbert & Sovandy Hang, Cambodian for Beginners, Paiboon Publishing, ISBN 1-887521-35-6

5Omniglot, Bamum syllabary

6Richard Ishida, Ready-made Counter Styles

7SIL, The order of components in Khmer orthographical syllables

8Unicode Consortium, The Unicode Standard, Version 13.0, Chapter 16.4: Southeast Asia, Khmer, 653-664, ISBN 978-1-936213-16-0.

9Unicode Consortium, Unicode Line Breaking Algorithm (UAX#14)

10Wikipedia, Khmer phonology

11Wikipedia, Khmer alphabet

See recent changes.  •  Make a comment.  •  Licence CC-By © r12a.