Chakma orthography notes

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size: 28px

𑄙𑄢 𑄷 𑄝𑄬𑄇𑄴 𑄟𑄚𑄪𑄌𑄴 𑄚𑄨𑄢𑄨𑄞𑄨𑄣𑄨 𑄥𑄧𑄁 𑄃𑄨𑄌𑄴𑄎𑄮𑄖𑄴 𑄃𑅅 𑄃𑄇𑄴𑄇𑄥𑄁 𑄚𑄨𑄚𑄬𑄭 𑄎𑄧𑄚𑄴𑄟𑄚𑄴𑅁 𑄖𑄢𑄢𑄴 𑄃𑄬𑄘 𑄃𑅅 𑄝𑄪𑄖𑄴𑄙𑄨 𑄃𑄊𑄬; 𑄥𑄬𑄚𑄧𑄖𑄳𑄠𑄴 𑄝𑄬𑄇𑄴𑄅𑄚𑄧𑄢𑄴 𑄃𑄬𑄇𑄴𑄎𑄧𑄚𑄴 𑄃𑄢𑄬𑄇𑄴 𑄎𑄧𑄚𑄧𑄢𑄴 𑄛𑄳𑄢𑄧𑄖𑄨 𑄉𑄧𑄟𑄴 𑄘𑄮𑄣𑄴 𑄌𑄨𑄘𑄳𑄠𑄬 𑄚𑄨𑄚𑄬𑄭 𑄌𑄧𑄣𑄚 𑄅𑄪𑄌𑄨𑄖𑄴𑅁

𑄙𑄢 𑄸 𑄃𑄬 𑄣𑄬𑄊𑄨 𑄇𑄧𑄠𑄬 𑄘𑄬𑄉𑄬𑄠𑄬 𑄥𑄙𑄩𑄚𑄧𑄖 𑄃𑅅 𑄃𑄇𑄴𑄇𑄥𑄧𑄁𑄢𑄴 𑄉𑄪𑄖𑄨𑄨, 𑄙𑄧𑄢𑄴𑄟𑄧, 𑄝𑄧𑄢𑄴𑄚𑄧, 𑄥𑄨𑄇𑄴𑄈, 𑄞𑄌𑄴, 𑄢𑄎𑄧𑄚𑄨𑄖𑄨𑄇𑄴 𑄝 𑄚𑄚𑄇𑄳𑄦𑄴𑄚𑄴 𑄟𑄧𑄖𑄴, 𑄎𑄖𑄩𑄠𑄴 𑄝 𑄥𑄟𑄎𑄨𑄇𑄴 𑄜𑄪𑄢𑄨𑄅𑄪𑄖𑄳𑄠𑄴, 𑄎𑄧𑄚𑄴𑄟𑄧, 𑄥𑄧𑄟𑄴𑄛𑄧𑄖𑄨𑄨 𑄝 𑄃𑄧𑄚𑄳𑄠𑄧 𑄇𑄧𑄚𑄧 𑄃𑄨𑄌𑄴𑄎𑄮𑄖𑄴 𑄝𑄌𑄴𑄝𑄨𑄎𑄬𑄢𑄴 𑄍𑄢 𑄝𑄬𑄇𑄴𑄅𑄚𑄬 𑄥𑄧𑄁 𑄃𑄇𑄴𑄇𑄥𑄁 𑄗𑄬𑄝𑄧𑅁 𑄇𑄧𑄚𑄧 𑄘𑄬𑄌𑄴 𑄝𑄧 𑄟𑄘𑄨𑄞𑄨𑄘𑄬𑄢𑄴 𑄢𑄎𑄧𑄚𑄨𑄖𑄨𑄇𑄴, 𑄥𑄨𑄟𑄨𑄚𑄬 𑄝 𑄛𑄨𑄖𑄨𑄨𑄟𑄨𑄢𑄴 𑄃𑄨𑄌𑄴𑄎𑄮𑄘𑄧 𑄅𑄫𑄉𑄪𑄢𑄬 𑄖𑄢𑄴 𑄇𑄧𑄚𑄧 𑄃𑄧𑄙𑄨𑄝𑄥𑄩𑄢𑄴 𑄛𑄳𑄢𑄧𑄖𑄨 𑄇𑄧𑄚𑄧𑄢𑄧𑄉𑄧𑄟𑄴 𑄖𑄬𑄢𑄧𑄌𑄴 𑄟𑄬𑄢𑄧𑄌𑄴 𑄉𑄧𑄢 𑄚𑄧 𑄦𑄧𑄝𑄧; 𑄥𑄬 𑄘𑄬𑄌𑄴 𑄝 𑄟𑄘𑄨𑄞𑄨𑄘𑄬 𑄥𑄙𑄩𑄚𑄴 𑄦𑄮𑄇𑄴, 𑄦𑄮𑄇𑄴 𑄃𑄧𑄍𑄨𑄞𑄪𑄇𑄧𑄧, 𑄃𑄧𑄥𑄠𑄨𑄖𑄧𑄧𑄥𑄥𑄨𑄖𑄧 𑄇𑄨𑄁𑄝 𑄥𑄢𑄴𑄝𑄧𑄞𑄯𑄟𑄧𑄖𑄧𑄧𑄢𑄴 𑄃𑄧𑄚𑄳𑄠𑄧 𑄇𑄧𑄚𑄧 𑄥𑄨𑄟𑄨𑄚𑄬𑄢𑄴 𑄞𑄨𑄘𑄨𑄢𑄬𑅁

Source: Universal Declaration of Human Rights - Chakma, articles 1 & 2

Basic features

The Chakma script is an abugida, ie. each consonant contains an inherent vowel sound.

❯ basicV

Vowels The inherent vowel is pronounced aː.

Plain post-consonant vowels are written using 7 combining marks (vowel signs), and 3 more are used for diphthongs.

There is 1 pre-base glyph and 2 circumgraphs. Chakma has no composite vowel signs.

Nasalisation is indicated using 𑄀, which can be combined with either an anusvara or a visarga diacritic.

Four standalone vowel sounds are written using independent vowels. Others are written using 11103 with an attached vowel sign.

❯ consonantSummary

Consonants 32 consonant letters are used for native consonant sounds, supplemented by a couple more for specialised orthographies.

Vowel absence Vowel absence is usually indicated in modern text by an explicit combining mark. However some conjunct forms are also possible.

11134 (maayyaa) is used to kill the inherent vowel, but not to form conjuncts. It is always visible.

The always invisible 11133 is used with 5 consonants (and occasionally more) to create conjunct forms, which occur as stacked consonants or a conjoined pair, although a more old-fashioned alternative is to create ligatures rather than stacks.

11134 is also used to indicate geminated consonants, in which case the base consonant typically supports this diacritic plus a vowel sign.

Medial consonants are written using 3 dedicated combining marks.

Syllable codas are most commonly written using 11134 to kill the vowel of a syllable-final consonant letter, but the diacritics 𑄁 and 𑄂 may be used for -ŋ and -h, respectively

Numbers Chakma has a set of native digits, but sometimes Bengali digits may be used.

Layout Chakma text runs left to right in horizontal lines. Words are separated by spaces. There is no case distinction.

It has a mixture of ASCII and Chakma code points for punctuation marks.

Notable features

the inherent vowel is long
there is a simplified orthography, which is in increasing use by the Chakma community, and a traditional orthography (which has more complicated vowel representations)
diphthongs may be written after a consonant using a combination of virama and independent vowel, and may be followed by an additional vowel sign
consonant clusters may be indicated either by an always visible diacritic, or as stacks formed using a virama
where a consonant cluster has a geminated consonant followed by another consonant, both the visible vowel killer and the virama are used for the same stack

Vowels

	Post-consonant	Standalone
Simple	𑄨,𑄩,𑄪,𑄫	𑄄,𑄅
	𑄬,𑄮	𑄆
	𑄬,𑄧
	ⓘ	𑄃
Diphthongs	𑄰,𑄭,𑄯

ⓘ represents the inherent vowel. Diacritics are added to the vowels to indicate nasalisation (not shown here).

Inherent vowel

𑄇 ka

The inherent vowel for Chakma is aː (longer than the inherent vowels in Bangla and Hindi). So kaː is written by simply using the consonant letter, eg.

𑄇𑄋𑄢

𑄇,𑄋,𑄢

Since Chakma consonants normally include an inherent vowel, the orthography has ways to indicate a consonant that is not followed by a vowel sound. See novowel.

Post-consonant vowels

𑄇𑄨 ki

Chakma has two orthographies, simplified and traditional. The traditional orthography has a more complex approach to writing vowels, and includes a number of composite vowel signs. The current Unicode block may require some additional code points for proper support of the traditional orthography, but the various vowels are achievable. According to Bivuti Chakma, the general populace is adopting the simplified approach for modern Chakma.

The descriptions of the traditional vowels relies on an interpretation of the list of vowels in 𑄌𑄋𑄴𑄟𑄳𑄦 𑄃𑄨𑄇𑄴𑄳𑄠𑄬 𑄝𑄪𑄙𑄨 (by the District School Education Board, CADC). It needs to be tweaked somewhat, and should be treated as suggestive rather than authoritative. That document uses English approximations, rather than IPA, to indicate sound equivalents. The code point sequences used are those that, through trial and error, appear to produce the necessary shapes, given that the original document is not Unicode-encoded.

Simplified plain post-consonant vowel sounds are written using 7 combining marks and 3 more are used for diphthongs. Chakma has 1 pre-base vowel sign and 2 circumgraphs.

Two of the vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

All vowel signs are typed and stored after the base consonant, and the glyph rendering system takes care of the positioning at display time. Conjuncts (stacks) are treated as indivisible units when it comes to rendering vowel signs, meaning that pre-base vowel signs and left-side glyphs of circumgraphs are rendered before the conjunct as a whole (see prebase and circumgraphs).

Plain vowels

Simplified orthography. Chakma uses the following dedicated combining marks for plain vowels in the simplified orthography. They are all vowel signs.

𑄨,𑄩,𑄪,𑄫,𑄬,𑄮,𑄧,𑅅

Traditional orthography. Chakma appears to use the following vowel signs for basic vowels in the traditional orthography. (See the caveat.)

𑄨,𑄩,𑄪,𑄫,𑄬,𑄬𑄬,𑄧,𑄱,𑄧𑅅,𑄲,𑄯,𑅅

Diphthongs

Simplified orthography. The following are used to write diphthongs in the simplified orthography. There may be more or less than shown.

𑄰,𑄭,𑄯,𑅆,𑄪𑄭,𑄬𑄭

Traditional orthography. Chakma appears to use the following for diphthongs in the traditional orthography. (See the caveat.) Some of these diphthongs are written using the virama between the consonants and the vowel sign. To make things clearer, the diphthongs are shown after 𑄃 in the list below.

𑄃𑄮,𑄃𑅆,𑄃𑄬𑄳𑄆,𑄃𑄬𑅆,𑄃𑄳𑄆,𑄃𑄳𑄆𑄧𑄤𑄴,𑄃𑄳𑄅𑄧,𑄃𑄮𑅅,𑄃𑄯𑅅,𑄃𑄭,𑄃𑄳𑄆𑄧,𑄃𑄧𑅆

Observation: It's not clear how to write the third sound in the list, since the font doesn't support the expected sequence.

Observation: It appears to be an inconsistency in the encoding that a single combining mark is available to write 𑅆, but the shape 𑄳𑄆 has to be written using a (highly unusual) combination of virama plus independent vowel.

Nasalisation

Nasalisation is indicated using 11100.

This can also be used in syllables that end with an anusvara or a visarga.mh§2 For example, 𑄃𑄂𑄀.

Since both diacritics have the same combining class, the order in typing and storage should reflect the increasing distance from the base character.

Vowel length

Dedicated vowel signs are available for long vowel sounds.

Standalone vowels

At the beginning of a word standalone vowels can be written using either one of four independent vowels or using combinations of vowel signs with 𑄃.

The independent vowels are the following.

𑄄,𑄅,𑄆,𑄃

Other standalone vowels are written using vowel signs attached to 𑄃, but there is also a modern trend to represent the sounds covered by the independent vowels, too, using combinations. The following list shows just a few examples.

𑄃𑄨,𑄃𑄩,𑄃𑄪,𑄃𑄫,𑄃𑄬,𑄃𑄰

Examples in use:

𑄃𑄘𑄢

𑄃𑄧𑄏𑄛𑄖𑄴

Vowel components

This section describes various vowel components and behaviours associated with this orthography.

Pre-base vowel sign

𑄇𑄬 ke

Chakma has one pre-base vowel sign.

𑄬

eg.

𑄛𑄬𑄇𑄴

This is a combining mark that is always typed and stored after the base consonant(s), ie. the codepoints follow the order in which the items are pronounced. The rendering process places the glyph before the base consonant without changing the code points. The following shows the sequence of code points that make up the word just above.

𑄛,𑄬,𑄇

Because conjuncts are never split, the vowel sign is placed before the start of a stack. The vowel sign is typed and stored after the second consonant in the cluster but is displayed before the first consonant, eg.

𑄝𑄬𑄌𑄴𑄳𑄦𑄬𑄉

𑄌𑄴𑄳𑄦𑄬,𑄌,𑄴,𑄳,𑄦,𑄬

Circumgraphs

𑄇𑄮 ko

𑄮,𑄯

Chakma has 2 circumgraphs. These are single combining marks that are always stored after the base consonant. When rendered, the single code point produces multiple glyphs, which are placed on different sides of the base consonant, eg.

𑄦𑄮𑄢𑄮𑄋

𑄦,𑄮,𑄢,𑄮,𑄋

Like pre-base glyphs, these do not split a conjunct, but instead they treat the conjunct as a single unit and place glyphs either side of it.

These circumgraphs have canonically equivalent decomposed forms (see encoding).

𑄮,𑄯

The code point 𑄧 is commonly used alone to represent the sound ɔ, but the 𑄱 and 𑄲 code points are not usually found in text.

Composite vowel signs

Composite vowels are only produced when the 2 circumgraphs are decomposed (see encoding).

Vowel sounds to characters

This section maps Chakma vowel sounds to common graphemes in the Chakma orthography.

The left column shows dependent vowels, and the right column independent vowel letters.

Plain vowels

dependent 𑄨

standalone 𑄄

iː

dependent 𑄩

dependent 𑄪

standalone 𑄅

uː

dependent 𑄫

dependent 𑄬

standalone 𑄆

dependent 𑄮

dependent 𑄬

dependent 𑄧

diphthong 𑄬𑄭

aː

inherent vowel eg. 𑄇𑄋𑄢.

dependent 𑅅 Used by the Baarah Maatraa orthography.

standalone 𑄃

Complex vowels

ui̯

diphthong 𑄪𑄃𑄨

eːi̯

diphthong 𑅆 Used by the Baarah Maatraa orthography.

oi̯

diphthong 𑄰

ou̯

diphthong 𑄯

ai̯

diphthong 𑄭

◌̃

nasalisation marker 𑄀

Vowel absence

Vowel absence principally occurs either when a consonant is a syllable coda, or when a consonant is part of a consonant cluster.

Given that consonants normally include an inherent vowel, the orthography needs a way to indicate when a consonant is not followed by a vowel.

The absence of an inherent vowel is usually indicated in modern text by the explicit diacritic 11134 (maayyaa). However, 5 consonants (and occasionally more) may be subjoined to indicate a consonant cluster. A more old-fashioned alternative is to create ligatures rather than stacks.

Follow these links for more information.

Vowel killer: Use 𑄴 above the consonant that has no following vowel.
Create a conjunct. There are a couple of possibilities here:
1. Stack or conjoin characters. The non-initial consonant in a cluster is reduced in size and positioned below or alongside the first.
2. Create a ligature. A fusion of the letter shapes of consonants in a cluster, where it may be difficult to identify one or more of the components.
Medials: Special combining marks exist for non-initial consonants in a syllable onset that kill the vowel of the previous consonant.
Coda marks: Combining marks used for codas have no following vowel sound.

Vowel killer

This is the most common way of indicating vowel absence in modern Chakma writing.mh§3 11134 is a combining mark typed after and appearing above the first consonant in a cluster or above a coda. It is always visible, and no shaping is applied to consonants.

eg.

𑄌𑄖𑄴

𑄖𑄨𑄚𑄴

𑄈𑄧𑄢𑄴𑄉𑄧𑄌𑄴

𑄞𑄌𑄴𑄟𑄖𑄴

11134 is also used to kill the inherent vowel when no cluster is involved (as shown at the end of the examples above).

Note, however, that it is also used to indicate gemination when combined with a vowel sign. When it appears above a stack or conjoined form it indicates gemination of the initial consonant; it is not being used as a vowel killer.

eg.

𑄞𑄌𑄴𑄳𑄦𑄪𑄢𑄨

𑄝𑄧𑄖𑄴𑄳𑄠

Conjuncts

As a rule, consonant clusters only involve 2 consonants.mh§5

Stacking & conjoining

Consonant clusters can also be indicated by stacking the consonants. To tell the font to stack the letters, use the invisible character 11133 between them.

In 2001 an orthographic reform was proposed that would limit conjuncts to just 5 subjoined lettersmh§3, shown below in combination with 𑄇.

𑄇𑄳𑄤,𑄇𑄳𑄢,𑄇𑄳𑄣,𑄇𑄳𑄠,𑄇𑄳𑄚

The 'subjoined' form of 𑄠 is actually conjoined, as in:

𑄌𑄚𑄴𑄘𑄳𑄠 t͡ʃaːndjɛ cāndẏā

Observation: The letter HA commonly appears in subjoined form, but it isn't clear whether this indicates an aspirated onset or a final -h.

𑄛𑄉𑄢𑄳𑄦

Ligated forms

Ligated forms are now considered old-fashioned.mh§3 In this style of writing, the second consonant in the cluster is often alongside the first, and both are shaped so that they join together.

𑄘𑄳𑄙 — Examples of ligated conjunct forms.

𑄇𑄳𑄑 — Examples of ligated conjunct forms.

More examples of these conjunct forms can be found in Everson & Hosken, p4.

Consonants

	unaspirated	aspirated
	𑄛,𑄝,𑄖,𑄘,𑄑,𑄓,𑄇,𑄉	𑄜,𑄞,𑄗,𑄙,𑄒,𑄔,𑄈,𑄊
	𑄌,𑄎	𑄍,𑄏
	𑄞,𑄥,𑄌,𑄍,𑄡,𑄥,𑄦,𑄇,𑄈,𑄂
	𑄟,𑄚,𑄕,𑄐,𑄋,𑄁
	𑄤,𑄢,𑄢,𑄣,𑅄,𑄠

The pronunciation of a few letters is not entirely clear, and some sources appear to contradict others. Treat the above as summarising the information found so far, principally using the few IPA transcriptions found, and the pronunciations provided by Chave-Rong Chakma.

Basic consonants

These are the basic consonant letters in Chakma. The pronunciation of a few letters is not entirely clear, and some sources appear to contradict others. Treat the following as summarising the information found so far, principally using pronunciations given in the few IPA transcriptions found, and the pronunciations provided by Chave-Rong Chakma.

Click on each letter for more details and for examples of usage, especially where more than one sound is indicated. For a list of unique characters, click on #.

𑄛,𑄜,𑄝,𑄞,𑄖,𑄗,𑄘,𑄙,𑄑,𑄒,𑄇,𑄈,𑄉,𑄊,𑄌,𑄍,𑄎,𑄏,𑄜,𑄞,𑄥,𑄡,𑄥,𑄦,𑄇,𑄈,𑄟,𑄚,𑄕,𑄐,𑄋,𑄤,𑄢,𑄣,𑄠

Ganguly et al. say that native speakers don't distinguish between s and ʃ, and that there is also much interchangeability between s and t͡ʃ. The following 2 examples with IPA transcriptions in Wikipedia appear to illustrate both this and an ambivalence between kʰ and h, but more research is needed to completely map out the correspondences between written letters and sounds, and for now we will stick with the correspondences conventionally ascribed in the resources seen.

𑄍𑄮𑄣𑄉𑄧𑄢𑄴

𑄍𑄮,𑄍,𑄮

𑄈𑄧𑄢𑄴𑄉𑄧𑄌𑄴

𑄍𑄮,𑄍,𑄮

Observation: It is worth noting, however, that recordings on YouTube by Bivuti Chakma pronounce 𑄇 and 𑄈 as haː. He also tends to pronounce 𑄌 and 𑄍 as saː. It isn't clear whether this is a dialect, or idiolect, or standard pronunciation.

Observation: Bivuti also appears to pronounce 𑄛 and 𑄜 as faː.

Other consonants

The following consonants were introduced for use with specialised orthographies.

𑅇,𑅄

𑅇 is used for the sound v when writing Pali.

𑅄 is used for the aspirated sound lʰ in the Baarah Maatraa orthography.

Onsets

Medial consonants

It would appear that Chakma uses 3 medial consonants in onsets: w, r, and j. These may be written after a virama, so that they are rendered as part of a stack, eg.

𑄟𑄳𑄢𑄨𑄖𑄴𑄨𑄇

The shapes of 2 of these medial consonants are significantly different when subjoined.

𑄢,𑄳𑄢

𑄠,𑄳𑄠

Subjoined HA

Another seemingly common stack of consonants in an onset involves a subjoined 𑄦. It's not clear from the sources consulted whether a subjoined HA represents a way of indicating an aspirated or breathy consonant, or a syllable-initial h, or even a syllable-final h. In the following example, the stack with subjoined HA is contrasted with the use of an atomic letter for the sound bh.

𑄌𑄋𑄴𑄟𑄳𑄦 𑄞𑄌𑄴

Observation: In the word for Chakma above the HA looks like it is syllable-initial. However, there are other occurrences of a subjoined HA with IPA transcriptions that put the h at the end of the syllable, eg.

𑄟𑄟𑄳𑄦

Codas

General vowel suppression The dropping of the inherent vowel for syllable codas in Chakma is marked using 𑄴.

eg.

𑄌𑄖𑄴

𑄖𑄨𑄚𑄴

𑄈𑄧𑄢𑄴𑄉𑄧𑄌𑄴

The same diacritic is also used to signal consonant clusters and gemination.

Syllable codas are generally marked using 𑄴 over an ordinary consonant letter, but some are indicated by stacking (or in older texts ligation) of consonant glyphs (see clusters).

eg.

𑄉𑄧𑄖𑄴

Marks for codas Final ŋ and h can also be marked using the anusvara and visarga diacritics, 𑄁 and 𑄂, respectively.

eg.

𑄦𑄨𑄠𑄧𑄁

𑄥𑄧𑄁𑄥𑄧𑄞

Consonant length

Gemination is indicated using 𑄴. Usually, this use can be distinguished from the use for consonant clusters because a vowel sign is combined with the same base consonant.gc

eg.

𑄇𑄟𑄇𑄴𑄭

𑄇𑄨𑄖𑄴𑄬

When the maayyaa appears with a stacked consonant cluster, it is used in this role, ie. not to kill the vowel, but to lengthen the initial consonant, eg.

𑄞𑄌𑄴𑄳𑄦𑄪𑄢𑄨

𑄞𑄣𑄧𑄇𑄴𑄳𑄦𑄚𑄨

The Noto and RibengUni fonts allow maayyaa to appear immediately after the initial consonant in a stack, or after the final consonant, with no difference in the rendered result, and it is possible to find examples encoded in both ways. Everson and the Unicode Standard (whose text is derived from Everson's proposal) seem to assume that both the virama and the maayyaa are present to kill a vowel, and their texts indicate that there is no justification for having both combining marks side by side in storage. However, since the maayyaa doesn't have the role of killing the vowel here, but instead indicates gemination of the initial character in the cluster, it is logical to use the order:

C𑄴𑄳C

This order is also confirmed as the appropriate one by Glass.cldt§177

Consonant sounds to characters

This section maps Chakma consonant sounds to common graphemes in the Chakma orthography.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

consonant 𑄛

pʰ

consonant 𑄜

consonant 𑄝

bʰ

consonant 𑄞

consonant 𑄖

tʰ

consonant 𑄗

t͡ʃ

consonant 𑄌

t͡ʃʰ

consonant 𑄍

consonant 𑄘

dʰ

consonant 𑄙

d͡ʒ

consonant 𑄎

d͡ʒʰ

consonant 𑄏

consonant 𑄑

ʈʰ

consonant 𑄒

consonant 𑄓

ɖʰ

consonant 𑄔

consonant 𑄇

kʰ

consonant 𑄈

consonant 𑄉

ɡʰ

consonant 𑄊

consonant 𑄛

fʰ

consonant 𑄜

consonant 𑅇 Used for Pali.

consonant 𑄥

consonant 𑄌

consonant 𑄍 A possible pronunciation according to Wiktionary.

consonant 𑄡

consonant 𑄥

consonant 𑄦

final aspiration 𑄂 Final aspiration.

consonant 𑄇

consonant 𑄈

consonant 𑄟

consonant 𑄚

consonant 𑄕

consonant 𑄐

consonant 𑄋

final nasal 𑄁 Coda.

consonant 𑄤

consonant 𑄢

consonant 𑄣

lʰ

consonant 𑅄 Used by the Baarah Maatraa orthography.

consonant 𑄠

consonant 𑄡 (Confirmation needed.)

Encoding choices

This section offers advice about characters or character sequences to avoid, and what to use instead. It takes into account the relevance of Unicode Normalisation Form D (NFD) and Unicode Normalisation Form C (NFC)..

Although usage is recommended here, content authors may well be unaware of such recommendations. Therefore, applications should look out for the non-recommended approach and treat it the same as the recommended approach wherever possible.

Canonically equivalent encodings

Two letters can be represented as an atomic character (the norm), or as a sequence of combining marks. The parts are separated in Unicode Normalisation Form D (NFD), and atomic in Unicode Normalisation Form C (NFC), so both approaches should be treated as canonically equivalent.

Atomic (recommended)	Decomposed ( NOT recommended )
𑄮	11131 11127
𑄯	11132 11127

Normally, text will use the atomic form, and this is generally recommended by the Unicode Standard.

False friends

The following atomic characters look as if they could be composed of parts, but in fact there is no equivalence during normalisation, and so the atomic characters only should be used.

Atomic	Sequence ( DO NOT use! )
𑄰	1112D 11127
𑄮	11127 11133 11124
𑄫	1112A 1112A
𑄂	11101 11101

Codepoint order

Combining marks always follow the based character.

Where present, characters in an orthographic syllable should always occur in the following order.

A consonant or independent vowel.
𑄴
𑄳C followed by another consonant.
One of 𑄱 or 𑄲 (in decomposed text only!).
A dependent vowel.
𑄁, or 𑄂.
𑄀.

Adjacent maayyaa and virama

A number of words contain both 𑄴 and 𑄳 in the same consonant cluster. It is possible to find both of the following sequences of characters in online text:

C𑄳C𑄴

C𑄴𑄳C

The Noto and RibengUni fonts support either ordering, with no difference in the rendered result.

Everson and the Unicode Standard (whose text is derived from Everson's proposal) seem to assume that both the virama and the maayyaa are present to kill a vowel, and they have text to indicate that there is no justification for having both combining marks side by side in storage. However, since the maayyaa doesn't have the role of killing the vowel here, but instead indicates gemination of the initial character in the cluster, it is logical to use the order:

C𑄴𑄳C

The second consonant is usually 𑄠 or 𑄦. The following are examples found in a single page.

With YA: 𑄆𑄙𑄮𑄇𑄴𑄳𑄠𑄚𑄴 • 𑄉𑄧𑄖𑄴𑄳𑄠 • 𑄝𑄚𑄬𑄝𑄖𑄴𑄳𑄠 • 𑄞𑄣𑄧𑄇𑄴𑄳𑄦𑄚𑄨 • 𑄟𑄧𑄖𑄴𑄳𑄠 • 𑄥𑄧𑄇𑄴𑄳𑄠
With HA: 𑄆𑄇𑄴𑄳𑄦𑄚𑄴 • 𑄇𑄧𑄙𑄞𑄇𑄴𑄳𑄦𑄚𑄨 • 𑄇𑄧𑄙𑄞𑄇𑄴𑄳𑄦𑄚𑄴 • 𑄇𑄩𑄝𑄮𑄖𑄴𑄳𑄦𑄚𑄴 • 𑄑𑄬𑄇𑄴𑄌𑄴𑄳𑄦𑄚𑄴

It is worth noting that the maayyaa is rendered over the initial letter in the conjunct, regardless of the code point sequence in memory.

Glyph shaping & positioning

Experiment with examples using the Chakma character workbench.

Context-based shaping & positioning

The glyphs used for Chakma in India and Bangladesh differ slightly in roundness (similar to variation in the Tai Tham script as used in Northern Thai and Tai Khün).mh§1

Base characters can carry multiple combining marks. For example, in addition to a vowel sign a base consonant may carry one or more of the following diacritics: 𑄴, 𑄁, 𑄂, 𑄀. In some cases the glyphs for multiple combining marks need to be positioned side by side or carefully positioned relative to each other, as shown in the examples just below.

𑄇𑄟𑄇𑄴𑄭

𑄇,𑄟,𑄇,𑄴,𑄭

𑄇𑄨𑄠𑄮𑄁

𑄇,𑄨,𑄠,𑄮,𑄁

Generally speaking, there is no interaction between consonant characters, but where consonant characters are stacked or ligated then it becomes necessary for the font to apply the needed shaping and placement of glyphs.

eg.

𑄌𑄋𑄴𑄟𑄳𑄦

See a list of all conjuncts.

Most subjoined letters are just smaller versions of the original consonant letter, but significantly different shapes are used for subjoined r and y. Compare the following:

components	rendered
𑄇𑄳𑄢	𑄇𑄳𑄢
𑄇𑄳𑄠	𑄇𑄳𑄠

eg.

𑄟𑄳𑄢𑄨𑄖𑄴𑄨𑄇

Observation: The Noto Sans Chakma font has various special glyph forms. Among those are forms such as the following, which appear to represent sounds such as ʈjeːi̯ and ʈje.

Complex glyph shapes in the Noto Sans Chakma font.

𑄑𑄳𑄠𑅆,𑄑,𑄳,𑄠,𑅆

𑄑𑄳𑄠𑄳𑄆,𑄑,𑄳,𑄠,𑄳,𑄆

phrase	, ;
sentence	𑅁 𑅃 𑅂
section	𑅀

	start	end
standard	(	)

Notes, footnotes, etc

See inlinenotes for purely inline annotations, such as ruby or warichu. This section is about annotation systems that separate the reference marks and the content of the notes.

	labial	labio- dental	alveolar	post- alveolar	retroflex	palatal	velar	glottal
stop	p b		t d		ʈ ɖ		k ɡ
	pʰ bʰ		tʰ dʰ		ʈʰ ɖʰ		kʰ ɡʰ
affricate				t͡ʃ d͡ʒ
				t͡ʃʰ d͡ʒʰ
fricative		v	s z	ʃ				h
nasal	m	ɱ	n		ɳ	ɲ	ŋ
approximant	w		l			j
trill/flap			r		ɽ

Chakma

Sample

Usage & history

Basic features

Notable features

Character index

Letters

Basic consonants

Extended consonants

Vowel letters

Combining marks

Vowel marks

Bindus

Visarga

Pure killer

Invisible stacker

Numbers

Bengali digits

Punctuation

ASCII

Other

To be investigated

Phonology

Vowel sounds

Plain vowels

Consonant sounds

Tone

Structure

Vowels

Inherent vowel

Post-consonant vowels

Plain vowels

Diphthongs

Nasalisation

Vowel length

Standalone vowels

Vowel components

Pre-base vowel sign

Circumgraphs

Composite vowel signs

Vowel sounds to characters

Plain vowels

Complex vowels

Vowel absence

Vowel killer

Conjuncts

Stacking & conjoining

Ligated forms

Consonants

Basic consonants

Other consonants

Onsets

Medial consonants

Subjoined HA

Codas

Consonant length

Consonant sounds to characters

Encoding choices

Canonically equivalent encodings

False friends

Codepoint order

Adjacent maayyaa and virama

Numbers

Digits

Text direction

Glyph shaping & positioning

Context-based shaping & positioning

Typographic units

Word boundaries

Graphemes

Punctuation & inline features

Phrase & section boundaries

Bracketed text

Line & paragraph layout

Line breaking & hyphenation

Line-edge rules

Counters, lists, etc.

Numeric

Prefixes and suffixes

Page & book layout