Eastern Cham

orthography notes

Updated 27 October, 2025

This page brings together basic information about the Cham script and its use for the Eastern Cham language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Eastern Cham using Unicode.

It was difficult to find comprehensive sources that map the letters and marks to sounds using unambiguous IPA transcriptions, and it was often difficult to interpret what sound was meant by the varying transcriptions used. The sources tended to vary and seemingly contradict each other in some areas. What is included here is what was felt to be reasonably sure, but further review and research is needed.

Referencing this document

Richard Ishida, Eastern Cham Orthography Notes, 27-Oct-2025, https://r12a.github.io/scripts/cham/cjm

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size:   36px

ꨢꨯꨮ ꨖꩌꨠꨩ ꨨꨯꨮꨓꨭꩇꨚꨞꨥꨩ ꨓꨯꨮꨧꩌ ꨨꨯꨮꨓꨭꩌ ꨓꨔꨩꨈꨓꨯ ꨀꨩꨨ ꩝ ꨓꨯꨮꨧꩆꨌ ꨢꨯ ꨘꨪꨣꨯꨖꨯ ꨃꨥꩌ ꨥꨩꨕꨫ ꨠꨨꨩꨧꨠꨘꨯ ꩞

Source: Aksharamukha : Script Converter

Usage & history

Speakers of the Eastern Cham language number about 132,000 in Bình Thuận, Ninh Thuận, and Đồng Nai provinces in southern Vietnam, as well as in Hồ Chí Minh City. The Ethnologue estimates the L1 literacy rate to be 5%–10%. The Cham script ( ꨀꨇꩉ ꨌꩌ ) is the primary orthography for the Eastern Cham, but the largely muslim Western Cham peoples of Cambodia prefer the Arabic script.eth Historically, the Eastern Cham script was learned by boys once they reached a certain age, but not by women and girls.ws

Coming to Southeast Asia with the expansion of Indian religions, Cham was one of the first scripts to develop from the Pallava script some time around 200 CE.ws

Unicode 17 has 1 dedicated block, comprising 83 characters.

More information: WikipediaEndangered Alphabets

Basic features

The Cham script is an abugida, ie. each consonant contains an inherent vowel sound. See the table to the right for a brief overview of features for the Eastern Cham orthography.

Cham text runs left-to-right in horizontal lines. There is no case distinction. Words are separated by spaces.

❯ consonantSummary

Cham represents native consonant sounds using 35 basic letters. The orthography has 4 medial consonants, some of which can be combined. Syllable codas are written using one of 15 final letters (distinguished from the original by a lengthened stroke to the right), or one of 3 combining marks.

Consonant clusters only occur when a syllable with a coda is followed by another syllable, and there are no conjunct forms or stacking involved.

❯ basicV

Cham has 2 inherent vowel sounds, usually pronounced a but by default pronounced ɨ after nasals. An extra set of nasal consonant letters is available to write a syllable with an inherent a vowel (each having a special stroke below the main glyph), and an ɨ vowel sign is available to represent that sound after non-nasals.

Other vowels are written using one or more of 10 combining vowel signs per consonant (this page lists 7 composite vowel signs). There is 1 pre-base vowel, but no circumgraphs (although composite vowel signs may add glyphs to more than one side of the base).

Standalone vowels are written in one of 2 ways. The orthography has 6 independent vowel letters, and any other vowels are written by appending a vowel sign to the letter AA00.

Cham uses native digits, although ASCII digits are also used. There are 4 native punctuation marks in addition to ASCII punctuation.

Notable features

Character index

Letters

Show

Basic consonants

ꨆ,ꨇ,ꨈ,ꨉ,ꨊ,ꨋ,ꨌ,ꨍ,ꨎ,ꨏ,ꨐ,ꨑ,ꨒ,ꨓ,ꨔ,ꨕ,ꨖ,ꨗ,ꨘ,ꨙ,ꨚ,ꨛ,ꨜ,ꨝ,ꨞ,ꨟ,ꨠ,ꨡ,ꨢ,ꨣ,ꨤ,ꨥ,ꨦ,ꨧ,ꨨ

Vowel letters

ꨀ,ꨁ,ꨂ,ꨃ,ꨄ,ꨅ

Final letters

ꩀ,ꩁ,ꩂ,ꩄ,ꩅ,ꩆ,ꩇ,ꩈ,ꩉ,ꩊ,ꩋ

Combining marks

Show

Vowel marks

ꨩ,ꨪ,ꨫ,ꨬ,ꨭ,ꨮ,ꨯ,ꨰ,ꨱ,ꨲ

Medials

ꨶ,ꨴ,ꨵ,ꨳ

Final marks

ꩌ,ꩃ,ꩍ

Numbers

Show
꩐,꩑,꩒,꩓,꩔,꩕,꩖,꩗,꩘,꩙

Punctuation

Show
꩜,꩝,꩞,꩟

ASCII

:,?
Items to show in lists

Phonology

The following represents the repertoire of the Eastern Cham language.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Plain vowels

i ɨ ɨ u e o ə ə ɛ ɔ a

Complex vowels

ia iɯʔ ua
ea oa
ɛə ɔə
auʔ

Observation: The above long list of diphthongs is from Wikipedia. Other descriptions have different lists. For example , Smithas describes 4 which don't appear in the above list (uy, yu/iw, əy, ɛy), but only includes 2 of those that are there (aw, ay).

Consonant sounds

labial alveolar palatal velar glottal
stop p b t d c k ɡ͡ɣ ʔ
  ɡʰ  
  ɓ ɗ      
affricate     d͡ʑ    
      d͡ʑʰ    
fricative   s     h
nasal m n ɲ ŋ
approximant w l j  
trill/flap   ɾ

Tone

Cham is not a tonal language.

Structure

tbd

Vowels

Vowel summary table

This table summarises only basic vowel to character assignments. Click on the phonetic transcriptions for more detail.

ⓘ represents the inherent vowel. A number of diphthongs listed in the phonology section are missing from this table.

  Post-consonant Standalone
Simple
ꨪ,ꨫ,,ꨲ,ⓘ,ꨲꨩ,ꨩ,,ꨭ,ꨭꨩ
ꨁ,ꨀꨲ,ꨂ
ꨯ,ꨯꨩ
ꨮ,ꨱ
ꨀꨮ
ꨯꨮ,ꨯꨮꨩ,ꨯꨱ
ꨃ,ꨀꨯꨱ
ⓘ,ꨩ
Complex
 
ꨰ,ꨮꨭ

ɨ is an inherent vowel only after nasals (see just below).

Observation: The phonology sources say that the Cham language distinguishes between the phones e, ə, and ɛ, but it isn't clear how to write e. There is a letter than appears to be transcribed as , which is close - could this be a plain vowel rather than a diphthong?

Inherent vowel

ka

Cham has 2 inherent vowels.

Most consonants have an inherent vowel a.

eg.

ꨓꨊꨪꩆ

ꨓ,ꨊꨪ,ꩆ

For nasals, however, the default inherent vowel is ɨ or ə.

eg.

ꨀꨗꩀ

ꨀ,ꨗ,ꩀ

ꨟꨓꨩ

ꨟ,ꨓꨩ

To produce a nasal followed by a Cham appends a special mark, called kai, below the nasal (see fig_na). In Unicode these combinations are fused and become atomic code points.

eg.

ꨘꩌ

Inherent ɨInherent a
The two types of nasal letter.

Since Cham consonants normally include an inherent vowel, the orthography has ways to indicate a consonant that is not followed by a vowel sound. See novowel.

Post-consonant vowels

Vowels that follow consonants are written using one or more of 10 combining vowel signs per consonant (this page lists 7 composite vowel signs). There is 1 pre-base vowel, but no circumgraphs (although composite vowel signs may add glyphs to more than one side of the base).

There are no circumgraphs, but there are pre-base vowel signs and composite vowel signs.

All vowel signs are typed and stored after the base consonant, whether or not they precede it when displayed. The glyph rendering system takes care of the positioning at display time. Onsets followed by medial consonants are treated as indivisible units when it comes to rendering vowel signs, meaning that pre-base vowel signs are rendered before the onset as a whole (see prebase).

Two of the vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

Simple vowels

ꨆꨪ ki

The simple Cham vowels are written as follows. All sounds are represented using one or more combining marks (see also compositeV).

ꨪ,ꨫ,ꨲ,ꨲꨩ,ꨭ,ꨭꨩ,ꨯ,ꨯꨩ,ꨮ,ꨱ,ꨯꨮ,ꨯꨮꨩ,ꨯꨱ,ꨩ
eg.

The unique components, which may be used on their own or in combination with others, are as follows.

ꨩ,ꨪ,ꨫ,ꨭ,ꨮ,ꨯ,ꨱ,ꨲ

The position and size of may need to be changed if it follows a medial consonant.

eg.

ꨚꨵꨭꩍ

Diphthongs

ꨆꨬ keɪ

Cham diphthongs are written as follows. There are 2 new combining marks, and one composite vowel sign combining the vowel signs used for ə and u.

ꨬ,ꨰ,ꨮꨭ
eg.

ꨖꨬ

ꨆꨵꨮꨭ

Pre-base vowel signs

ꨆꨯ ko

Cham has 2 characters whose glyphs, when rendered, appear before the consonant letter. They may be used alone, or as a component in 4 more composites.

ꨯ,ꨰ
eg.

ꨝꨯꨱꩍ

ꨚꨈꨯꨮꨩ

This combining mark is always typed and stored after the base consonant. The rendering process places the glyph before the base consonant. The first code point just above can be used on its own, but is also used in conjunction with other vowel combining marks as shown in the examples just above (see compositeV). In this case, it should be typed and stored immediately after the consonant and the other combining marks follow it.

When an orthographic syllable begins with a consonant cluster, the vowel sign is placed before the start of the onset, ie. to the left of the syllable as a whole. This is neatly illustrated in the following example, where the pre-base part of the vowel sign appears before the medial consonant, which itself appears before the base consonant. The encoded sequence, however, follows the pronounciation order.

eg.

ꨚꨴꨯꨱꩃ

ꨚ,ꨴ,ꨯꨱ,ꩃ
ꨚꨴꨯꨱꩃ
A pre-base vowel glyph (light highlight colour) placed before both consonants in an onset cluster, rather than just before the consonant pronounced immediately before it.

Composite vowel signs

ꨆꨯꨮꨩ kɛː

The following 7 vowel sounds are represented using more than one combining mark:

ꨲꨩ,ꨭꨩ,ꨯꨩ,ꨯꨮ,ꨯꨮꨩ,ꨯꨱ,ꨮꨭ

Although vowel-related glyphs can appear on multiple sides of the base consonant, Cham has no circumgraphs, ie. no single code points that can simultaneously produce multiple glyphs.

Vowel length

and əː are represented by a dedicated code points, but the other long vowels are indicated by adding the combining mark AA29 after the short vowel.

ꨫ,ꨲꨩ,ꨭꨩ,ꨯꨩ,ꨱ,ꨯꨮꨩ,ꨩ

Standalone vowels

ʔi

Cham has several ways of writing standalone vowel sounds. IPA transcriptions show a glottal stop before word-initial standalone vowels.

Independent vowels

Six standalone vowels can be written using dedicated independent vowel letters.

ꨁ,ꨂ,ꨃ,ꨅ,ꨀ,ꨄ
eg.

ꨀꨗꩀ

ꨁꨗꨤꩃ

ꨂꨣꩃ

The independent vowels may also appear inside a word at a syllable break.

eg.

ꨓꨂꩀ ta.ʔuk

Vowel signs

Other standalone vowel sounds can be written using dependent vowels attached to AA00. In fact, that arrangement can also sometimes be used for the sounds normally represented by independent vowels.

Medial glides

AA00 can also support medial consonants -j and -we,1.

eg.

ꨀꨳꨩ

ꨀ,ꨳ,ꨩ

Independent vowels with vowel signs

Independent vowels may support combining vowels, which may be redundant or may alter the sound of the vowel.

Vowel sounds to characters

This section maps Eastern Cham vowel sounds to common graphemes in the Cham orthography.

NOTE: Some areas of this list need examples to provide greater accuracy. Use with care.

Plain vowels

i

dependent

standalone

dependent

ɨ

inherent vowel after the default set of nasal letters, eg. ꨈꨤꨪꨟꩃ spider ɡ͡ɣa.li.mɨŋ

dependent

standalone ꨀꨲ

ɨː

dependent after the default set of nasal letters.

dependent ꨲꨩ

u

dependent

standalone

dependent ꨭꨩ

o

dependent

standalone

dependent ꨯꨩ

ə

dependent

standalone ꨀꨮ

əː

dependent

ɛ

dependent ꨯꨮ

standalone

ɛː

dependent ꨯꨮꨩ

ɔ

dependent ꨯꨱ

standalone ꨀꨯꨱ

a

inherent vowel eg. ꨧꨤꨚꩆ nine sa.la.pʌn

standalone

dependent

Complex vowels

dependent

dependent

standalone

dependent ꨮꨭ

Vowel absence

Cham has no code point to kill the inherent vowel, and no conjuncts.

When consonant clusters occur in onsets, the medial consonants are represented using combining marks (see onsets).

Syllable codas are also written using dedicated code points, both letters and combining marks (see finals).

Consonants

Consonant summary table

This table summarises only basic consonant to character assignments. Click on the phonetic transcriptions for more detail.

Onsets
ꨚ,ꨛ,ꨝ,ꨡ,ꨓ,ꨕ,ꨙ,ꨌ,ꨎ,ꨆ,ꨈ
ꨜ,ꨞ,ꨔ,ꨖ,ꨍ,ꨏ,ꨇ,ꨉ
ꨦ,ꨧ,ꨨ
ꨟ,ꨠ,ꨗ,ꨘ,ꨐ,ꨑ,ꨊ,ꨋ
ꨥ,ꨣ,ꨤ,ꨒ,ꨢ
Medials
ꨶ,ꨴ,ꨵ,ꨳ
Codas
ꩇ,ꩅ,ꩄ,ꩀ,ꩁ,ꩀ,ꩋ,ꩍ,ꩌ,ꩆ,ꩂ,ꩃ,ꩉ,ꩊ,ꩈ

For additional details see consonant_mappings.

Basic consonants

These are the basic consonant letters in Cham.

Click on each letter for more details and for examples of usage.

ꨚ,ꨛ,ꨜ,ꨝ,ꨞ,ꨡ,ꨓ,ꨔ,ꨕ,ꨖ,ꨙ,ꨌ,ꨍ,ꨎ,ꨏ,ꨆ,ꨇ,ꨈ,ꨉ,ꨦ,ꨧ,ꨨ,ꨟ,ꨠ,ꨗ,ꨘ,ꨐ,ꨑ,ꨊ,ꨋ,ꨥ,ꨣ,ꨤ,ꨒ,ꨢ

Observation: It's not clear why there are 2 letters for p.

There are 2 sets of nasal letters. The first has the inherent vowel ɨ, which is the default for nasals, and the second, with the extra strokes below, has the normal default inherent vowel a (see inherent).

Although they look complicated, many of the Cham letters share similar shapes. See similar for examples.

Onsets

ꨶ,ꨴ,ꨵ,ꨳ

Cham has the 4 medial consonants shown just above. They are all combining marks, and should be typed and stored immediately after the initial consonant. They carry an inherent vowel, and kill the inherent vowel of the initial consonant.

eg.

ꨕꨶ

ꨆꨴꨩ

ꨆꨵꨮꨭ

ꨟꨣꨳꩍ

Medials can be combined for the following cases: -rj-, -rw-, -lj-, and -lw-.e,2

Medials can also be used after (see standalone_medial).

Codas

Syllable codas are not followed by a vowel and, since Cham has no virama, they are written using a dedicated set of letters, or in 3 cases combining marks.

ꩇ,ꩅ,ꩄ,ꩀ,ꩁ,ꩌ,ꩆ,ꩂ,ꩃ,ꩋ,ꩍ,ꨥ,ꩉ,ꩊ,ꩈ

The final consonant letters are generally similar to the non-final consonants, but they have a lengthened stroke to the right side of the glyph.

eg.

ꨀꨆꨯꨱꩀ

ꨔꩆꨕꨩ

The significant exception is , which uses the same code point and glyph whether it appears as an initial or final consonant.

eg.

ꨎꨥꨩ

ꨟꨢꨥ

Examples of the combining marks.

eg.

ꨁꨗꨤꩃ

ꨚꨵꨭꩍ

ꨌꩌ

Consonant sounds to characters

This section maps Eastern Cham consonant sounds to common graphemes in the Cham orthography.

p

onset

onset

coda Coda. Sound is unreleased, ie .

onset

b

onset

onset

ɓ

onset

t

onset

coda Coda. Sound is unreleased, ie .

onset

d

onset

onset

ɗ

onset

c

onset

coda Coda. Sound is unreleased, ie .

onset

d͡ʑ

onset

d͡ʑʰ

onset

k

onset

coda Coda.

onset

ɡ͡ɣ

onset

ɡ

coda Coda.

ɡʰ

onset

ʔ

coda Coda.

s

onset

onset

coda Coda.

h

onset

 

coda Coda.

m

onset

onset

coda Coda.

n

onset

onset

coda Coda.

ɲ

onset

onset

ŋ

onset

coda Coda.

coda Coda.

ŋʰ

onset

w

onset

medial Medial.

ɾ

onset

medial Medial.

coda Coda.

l

onset

medial Medial.

coda Coda.

j

onset

onset

medial Medial.

coda Coda.

Encoding choices

Codepoint sequences

Characters in a syllable, if they occur, should always occur in the following order.

  1. A consonant or independent vowel.
  2. One or 2 medial consonants.
  3. A pre-base dependent vowel.
  4. A non-prebase dependent vowel.
  5. A vowel lengthener.
  6. A final consonant letter or combining mark.

Numbers

Cham has a set of native digits

꩐,꩑,꩒,꩓,꩔,꩕,꩖,꩗,꩘,꩙

ASCII digits are also used because of the influence of Vietnamese.u

Text direction

Cham text runs left to right in horizontal lines.

Show default bidi_class properties for characters in the Cham orthography described here.

Glyph shaping & positioning

Experiment with examples using the Eastern Cham character app.

Context-based shaping & positioning

Cham letters don't form conjuncts and don't normally involve significant shaping in different contexts, but there are a number of combining marks, which often occur in multiples for a given base letter. The positioning of these combining marks may involve adaptations to fit the space available, as well as the need to position them correctly around the base letter.

For instance, the small wavy line below the base is positioned and sized differently in the following 2 words.

ꨔꨮꨭ

ꨆꨵꨮꨭ

Another example of contextual shaping occurs at the end of the following word. Compared to the code point sequence, the glyph sequence switches the order of the 2 rightmost combining marks, and ligates them (in addition to moving the pre-base vowel sign before the consonant).

ꨚꨈꨯꨮꨩ

ꨚ,ꨈ,ꨯ,ꨮ,ꨩ

Similarities in letter shapes

Although they look complicated, many of the Cham letters share similar shapes on the left side. The following lists informally group together a number of letters that begin with similar looking shapes.

ꨆ,ꩀ,ꨇ
ꨚ,ꨧ,ꩇ,ꨨ,ꨣ,꩖,ꩉ,꩗,ꨜ,ꨏ
ꨈ,ꨤ,ꩁ,ꩊ,꩓,ꨁ,ꨄ,ꨐ,ꨑ,ꨒ,ꨓ,ꩅ,꩔
ꨌ,ꨗ,ꩄ,ꩆ,ꨙ,ꨘ,꩘,ꨍ
ꨝ,ꨔ,ꨞ,꩙
ꨖ,ꨟ,ꨢ,ꩈ,ꨉ,ꨠ,ꨥ,ꨡ,ꨃ
ꨦ,ꩋ
ꨎ,ꨛ,ꨊ,ꨋ,꩕
꩑,ꩍ,

Typographic units

Word boundaries

Words are separated by spaces.

Graphemes

tbd

Phrase, sentence, and section delimiters are described in phrase.

Punctuation & inline features

Phrase & section boundaries

Cham uses a mixture of ASCII and native punctuation.u

phrase

:

sentence

?

paragraph

general divider

often begins a section of text. It may also occur in conjunction with a danda.u

Line & paragraph layout

Line breaking & hyphenation

Line break opportunities occur at inter-word spaces or after a danda, but also between syllables.u e,3

Line-edge rules

As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.

Show line-breaking properties for characters in the Eastern Cham orthography.

The following list gives examples of typical behaviours for some of the characters used in modern Bangla. Context may affect the behaviour of some of these and other characters.

Click/tap on the characters to show what they are.

  • “ ‘ (   should not be the last character on a line.
  • ” ’ ) . , ; ! ? ꩝ ꩞ ꩟ %   should not begin a new line.

Line breaking should not move a danda or double danda to the beginning of a new line even if they are preceded by a space character.

Page & book layout

Online resources

  1. Austronesidan Basic Vocabulary Database Phan Rang Cham (Eastern Cham)

References