Kirat Rai, Bantawa

orthography notes

Updated 23 October, 2025

This page brings together basic information about the Kirat Rai script and its use for the Bantawa language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Bantawa using Unicode.

Referencing this document

Richard Ishida, Kirat Rai (Bantawa) Orthography Notes, 23-Oct-2025, https://r12a.github.io/scripts/krai/bap

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size:   28

š–µ¢š–µ£š–µˆš–µœš–µ€š–µ›š–µ¤ š–µƒš–µ‘š–µ„š–µ–š–µ£š–µƒš–µ© š–µƒš–µ¢š–µ£š–µˆš–µžš–µ£š–µ…š–µ¤š–µ› š–µ›š–µ¤š–µ–š–µ«š–µš–µ©š–µ†š–µ¬š–µœš–µ£š–µˆ-š–µ”š–µ£š–µžš–µ¤š–µ š–µ£š–µƒš–µ© ā€œš–µƒš–µ©ā€ š–µš–µ„š–µ—š–µžš–µ£š–µ”š–µ£ š–µ™š–µ§š–µˆš–µŸš–µ£-𖵙𖵄𖵠𖵄𖵢𖵩𖵖 š–µ¢š–µ©š–µˆš–µ…š–µ£š–µ›š–µ”š–µ£ š–µœš–µˆš–µ…š–µ£š–µ– š–µ–š–µˆ š–µƒš–µ§š–µ–š–µ…š–µ£š–µ›š–µ‰š–µ¤ š–µ—š–µ©š–µœš–µ£š–µ„š–µžš–µ£ š–µ›š–µ„š–µ›š–µ£š–µ…š–µ£š–µš–µ«š–µ›š–µ£š–µƒš–µ© 𖵠𖵣𖵖𖵀 š–µ”š–µ„š–µˆš–µ”š–µ£ š–µ‰š–µ„š–µˆš–µ¢š–µ£š–µˆ-𖵛𖵣 𖵛𖵄𖵛𖵣 š–µžš–µ¤š–µ š–µ£š–µƒš–µ© š–µœš–µ£š–µƒš–µ£š–µƒš–µ®

Source: Sikkim Herald article

Usage & history

Kirat Rai is a South Asian abugida used in the Indian state of Sikkim to write the Bantawa language. It is sometimes called ā€œKhambu Rai Lipiā€ in West Bengal. The script is used for poetry, newspapers, educational materials, and government records, among others. It was devised developed by Late Kripasalyan Rai in 1981-1982 from the Devanagari script. The Bantawa language is taught in schools up to the primary level ever since it was recognized as one of the official languages of Sikkim in 1997.

Kirat Rai doesn't have the conjunct forms, reordering, or combining marks of most Brahmi-derived scripts. Words are separated using spaces.

More information: Unicode Proposal • Unicode

Basic features

The Kirat Rai script is an abugida, ie. each consonant contains an inherent vowel sound. See the table to the right for a brief overview of the Kirat Rai script's features (not specific to this orthography).

Bantawa text runs left to right in horizontal lines. There is no case distinction. Words are separated by spaces.

āÆ consonantSummary

Bantawa represents native consonant sounds using 29 basic letters, but 2 more consonants are used to represent sounds in borrowed words.

Given that consonants normally include an inherent vowel, the orthography needs a way to indicate when a consonant is not followed by a vowel.

All consonant clusters are written using standard letters with a virama letter between them. The virama is a letter, rather than a combining mark. Onset clusters use 𖵬 as a virama, whereas other locations use 𖵫. Geminated consonants are also written by repeating the consonant letter with a virama letter between them. Kirat Rai has no conjunct shaping.

Syllable codas can be written with ordinary consonant letters, or using one of 2 dedicated letters resembling Sanskrit's bindu and visarga marks.

āÆ basicV

Kirat Rai consonants carry the inherent vowel a, or in some regions ʌ.

The 'vowel signs' used to write all vowels in Kirat Rai are all free-standing letters. There are no combining marks. All vowel letters follow the consonant letter; there are no pre-base glyphs or circumgraphs.

Vowels can be nasalised using the punctuation-like letter 𖵁 after the vowel.

Kirat Rai has no independent vowel letters, but instead uses š–µƒ as a carrier for standalone vowels. The vowel to be pronounced is indicated by following it with a vowel letter.

Kirat Rai has a set of native digits. It uses ASCII code points for punctuation marks, but also has its own dandas for sentence and section delimiters.

Character index

Letters

Show

Basic consonants

š–µƒ,𖵄,š–µ…,𖵆,𖵇,š–µˆ,𖵉,š–µŠ,𖵋,š–µŒ,š–µ,š–µŽ,š–µ,𖵐,𖵑,š–µ’,𖵓,š–µ”,𖵕,š–µ–,š–µ—,š–µ˜,š–µ™,š–µš,š–µ›,š–µœ,š–µ,š–µž,š–µŸ,š–µ ,š–µ”,š–µ¢

Vowels

š–µ£,𖵤,𖵄,𖵦,š–µ§,𖵨,𖵩,š–µŖ

Other

š–µ€,𖵁,𖵂,𖵫,𖵬

Numbers

Show
š–µ°,š–µ±,š–µ²,š–µ³,𖵓,š–µµ,š–µ¶,š–µ·,š–µø,š–µ¹

Punctuation

Show
š–µ­,š–µ®,𖵯,ā€˜,’,ā€œ,ā€

ASCII

!,(,),␣,.,:,;,?
Items to show in lists

Phonology

The following represents the repertoire of the Bantawa language.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Plain vowels

ĩ i ɨ ɨ u uː ũ ẽ e o oː ʌ ʌ a aː ã a aː ã

Complex vowels

aɪ aʊ

Consonant sounds

labial labio-
dental
dental alveolar post-
alveolar
retroflex palatal velar glottal
stop p b     t d   ʈ ɖ c ɟ k É” Ź”
  pʰ bʰ     tʰ dʰ   ʈʰ ɖʰ   kʰ ɔʰ  
affricate       tĶ”s dĶ”z          
        tĶ”sʰ dĶ”zʰ          
fricative       s ʃ       h
nasal m     n     ɲ ŋ
approximant w     l     j  
trill/flap       r    
  

Tone

Bantawa is not a tonal language.

Structure

tbd

Vowels

Vowel summary table

This table only summarises basic vowel to character assignments.

ā“˜ represents the inherent vowel. The letter 𖵁 follows a vowel to indicate nasalisation (not shown here).

  post-consonant standalone
Simple
𖵤,𖵦,𖵄
š–µƒš–µ¤,š–µƒš–µ¦,š–µƒš–µ„
š–µ§,𖵩
š–µƒš–µ§,š–µƒš–µ©
ā“˜,š–µ£
š–µƒ,š–µƒš–µ£
Diphthongs
𖵨,š–µŖ
š–µƒš–µØ,š–µƒš–µŖ

For additional details click on the phonemes above to see equivalents in vowel_mappings.

Inherent vowel

𖵄 ka

The inherent vowel for Kirat Rai is pronounced a or in some regions ʌ. So ka is written by simply using the consonant letter.

eg.

š–µ„š–µ¤š–µš–µ’ š–µš–µØ

𖵄,𖵤,š–µ,š–µ’

Since Kirat Rai consonants normally include an inherent vowel, the orthography has ways to indicate a consonant that is not followed by a vowel sound. See novowel.

Post-consonant vowels

𖵄𖵤 ki

The 'vowel signs' used to write all vowels in Kirat Rai are all free-standing letters. Even though this is an abugida, there are no combining marks. All vowel letters follow the consonant letter; there are no pre-base glyphs or circumgraphs.

Although several of these characters are clearly built from the same components, it is recommended to use the atomic code points.

Vowels can be nasalised using a punctuation-like letter after the vowel.

Plain vowels

The plain vowels of Kirat Rai are written using the following letters.

𖵤,𖵦,𖵄,š–µ§,𖵩,š–µ£
eg.

𖵠𖵄𖵅𖵫𖵅𖵤𖵛

𖵗𖵦

Diphthongs

Kirat Rai has the following dedicated letters for post-consonant diphthongs.

𖵨,š–µŖ
eg.

š–µŠš–µŖš–µš–µŸš–µ£

Nasalisation

Vowel nasalisation is indicated using 𖵁 after the vowel.

eg.

š–µœš–µ£š–µ š–µ£š–µš–µŸš–µ£

š–µŠš–µŖš–µš–µŸš–µ£

Standalone vowels

š–µƒš–µ¤ i

Kirat Rai has no independent vowel letters, but instead uses š–µƒ as a carrier for standalone vowels. This can represent a zero or glottal stop onset. The vowel to be pronounced is indicated by following it with the appropriate vowel letter.

eg.

š–µƒš–µ¤š–µ›š–µ£

š–µƒš–µ¤,š–µ›,š–µ£

š–µŠš–µ§š–µš–µƒš–µ¤š–µŸš–µ£

š–µŠš–µ§š–µ,š–µƒš–µ¤,š–µŸš–µ£

Used alone, this letter represents the standalone vowel a.

eg.

š–µƒš–µˆš–µ«š–µ„š–µ£

The following list shows how to write each of the standalone vowels.

š–µƒš–µ¤,š–µƒš–µ¦,š–µƒš–µ„,š–µƒš–µ§,š–µƒš–µ©,š–µƒ,š–µƒš–µ£, ,š–µƒš–µØ,š–µƒš–µŖ

Vowel sounds to characters

This section maps Bantawa vowel sounds to common graphemes in the Kirat Rai orthography.

Plain vowels

i

vowel 𖵤

ÉØ

vowel 𖵦

u

vowel 𖵄

e

vowel š–µ§

o

vowel 𖵩

a

inherent vowel

vowel carrier š–µƒ Pronounced /a/ when used as a standalone vowel in a syllable onset.

aː

vowel š–µ£

Complex vowels

aÉŖ

vowel 𖵨

aʊ

vowel š–µŖ

Nasalisation

ā—ŒĢƒ

nasalisation 𖵁

Vowel absence

Vowel absence principally occurs either when a consonant is a syllable coda, or when a consonant is part of a consonant cluster.

Follow these links for more information.

  1. A visible virama after the consonant with no following vowel.
  2. Unmarked vowel absence (word finally).
  3. Coda diacritics. Combining marks for syllable codas.

Visible virama

Word-internally, suppression of the inherent vowel is marked, but Kirat Rai is unusual in that it has 2 vowel killers:

𖵫,𖵬

𖵬 is only used to mute the inherent vowel of the first letter of the word, whereas 𖵫 is the most frequently used and appears in all other positions. Also unusual, these vowel killers are letters, rather than combining marks.

eg.

š–µšš–µ¬š–µœš–µ£š–µ„š–µ«š–µ š–µ¤š–µ–

𖵛𖵬𖵢𖵄𖵛𖵣

š–µ…š–µ£š–µŽš–µ«š–µ›š–µ£

š–µ‰š–µ£š–µƒš–µ«š–µŸš–µ£

Unmarked vowel absence

The inherent vowel is not usually pronounced at the end of a word, though the absence is not marked in writing.

eg.

š–µ›š–µ£š–µ€š–µ„š–µ©š–µžš–µ§š–µ–

𖵛𖵣𖵀,𖵄𖵩,š–µžš–µ§š–µ–

Consonants

Consonant summary table

This table only summarises basic consonant to character assignments.

Ordinary consonant letters can also appear as a syllable codas; we only list under ā€˜Codas’ the dedicated code points that are not used in onsets.

Onsets & codas
š–µ—,š–µ™,š–µ’,š–µ”,š–µŽ,𖵐,𖵄,𖵆,š–µƒ
š–µ˜,š–µš,𖵓,𖵕,š–µ,𖵑,š–µ…,𖵇
𖵉,𖵋,š–µŠ,š–µŒ
š–µ ,š–µ”,š–µ¢
š–µ›,š–µ–,š–µ,š–µˆ
š–µŸ,š–µ,š–µž,š–µœ
Coda only
𖵂,š–µ€,š–µ€

For additional details see consonant_mappings.

Basic consonants

The following are the basic consonant letters used for writing Bantawa.

Click on each letter for more details and for examples of usage.

š–µ—,š–µ˜,š–µ™,š–µš,š–µ’,𖵓,š–µ”,𖵕,š–µŽ,š–µ,𖵐,𖵄,š–µ…,𖵆,𖵇,š–µƒ,𖵉,š–µŠ,𖵋,š–µŒ,š–µ ,š–µ”,š–µ¢,š–µ›,š–µ–,š–µˆ,š–µŸ,š–µ,š–µž,š–µœ

š–µƒ represents Ź” as a syllable coda, and also as part of a standalone vowel sequence. Alone as a syllable onset it represents a. See standalone.

Repertoire extension

In addition, Bantawa texts may contain the following letters, but they are used for borrowed words.up,4

š–µ,𖵑

Onsets

Bantawa does have some consonant clusters at the beginning of a syllable, but these are indicated using 𖵬 between consonant letters. There are no dedicated code points for medial consonants, but it's worth noting that the SAAT character is only used for killing the vowel in the syllable onset.

eg.

š–µ¢š–µ¬š–µœš–µ„š–µ–š–µ¤

𖵛𖵬𖵢𖵄𖵛𖵣

See also standalone.

Codas

Syllable codas are generally written using ordinary consonant letters (followed by a virama if word-internal).

eg.

š–µšš–µ¬š–µœš–µ£š–µ„š–µ«š–µ š–µ¤š–µ–

š–µ…š–µ£š–µŽš–µ«š–µ›š–µ£

Kirat Rai does have a couple of code points dedicated to syllable codas that look familiar to users of Indic scripts, but all of the glyphs representing final consonants have the general category of letter, rather than combining mark.

Nasal codas. š–µ€ is used for nasal codas. The phonetic value, n or ŋ, depends on the articulation of the following consonant.

eg.

š–µ›š–µ£š–µ€š–µ„š–µ©š–µžš–µ§š–µ–

š–µ›š–µ£š–µ€š–µ š–µ£š–µŸš–µ£

Glottal stop codas. A syllable-final glottal stop can be written in one of two ways: (a) using 𖵂, or (b) using š–µƒ. The same word can be written in either way.

eg.

š–µƒš–µ£š–µ‚š–µŸš–µ£

š–µƒš–µ£š–µƒš–µ«š–µŸš–µ£

š–µ‰š–µ£š–µƒš–µ«š–µŸš–µ£

š–µ‰š–µ£š–µ‚š–µŸš–µ£

Note that word-medially a virama is needed after the vowel, but not after the visarga.

Consonant length

Geminated consonants occur in Kirat Rai, but they are simply written using 2 instances of the letter with an intervening 𖵫.

eg.

𖵠𖵄𖵅𖵫𖵅𖵤𖵛

Consonant sounds to characters

This section maps Bantawa consonant sounds to common graphemes in the Kirat Rai orthography.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

p

consonant š–µ—

pʰ

consonant š–µ˜

b

consonant š–µ™

bʰ

consonant š–µš

t

consonant š–µ’

tʰ

consonant 𖵓

t͔s

consonant 𖵉

t͔sʰ

consonant š–µŠ

d

consonant š–µ”

dʰ

consonant 𖵕

d͔z

consonant 𖵋

d͔zʰ

consonant š–µŒ

ʈ

consonant š–µŽ

ʈʰ

consonant š–µ

ɖ

consonant 𖵐

ɖʰ

consonant 𖵑

k

consonant 𖵄

kʰ

consonant š–µ…

É”

consonant 𖵆

ɔʰ

consonant 𖵇

Ź”

glottal stop 𖵂

vowel carrier š–µƒ As a vowel carrier. (Pronounced /a/ when used as a standalone vowel in a syllable onset.)

s

consonant š–µ 

ʃ

consonant š–µ” Used in borrowed words only.

h

consonant š–µ¢

m

consonant š–µ›

n

consonant š–µ–

nasal coda š–µ€

ɲ

consonant š–µ Used in borrowed words only.

ŋ

consonant š–µˆ

nasal coda š–µ€

w

consonant š–µŸ

r

consonant š–µ

l

consonant š–µž

j

consonant š–µœ

Encoding choices

This section offers advice about characters or character sequences to avoid, and what to use instead. It takes into account the relevance of Unicode Normalisation Form D (NFD) and Unicode Normalisation Form C (NFC)..

Although usage is recommended here, content authors may well be unaware of such recommendations. Therefore, applications should look out for the non-recommended approach and treat it the same as the recommended approach wherever possible.

Canonically equivalent encodings

Several vowel letters can be visually analysed as an atomic character (the norm), or as a sequence of component parts. The Unicode Standard recommends that authors use the atomic code points, rather than a sequence of component parts.

Atomic (recommended) Decomposed ( NOT recommended )
𖵩 𖵩
𖵨 š–µ§š–µ§
š–µŖ 𖵪
š–µŖ 𖵪

Inappropriate sequence

One additional vowel letter was sometimes composed from 2 code points in pre-Unicode encodings. In Unicode there is no equivalence between the atomic character and the sequence shown below, so the atomic character must be used. This also preserves better semantics.

Use this Do NOT use
𖵦 𖵄𖵭

Numbers

Digits

Kirat Rai has a set of native, decimal digits

š–µ°,š–µ±,š–µ²,š–µ³,𖵓,š–µµ,š–µ¶,š–µ·,š–µø,š–µ¹

Text direction

Kirat Rai text runs left to right in horizontal lines.

Show default bidi_class properties for characters in the Kirat Rai orthography described here.

Glyph shaping & positioning

Experiment with examples using the Kirat Rai character app.

Context-based shaping & positioning

Kirat Rai letters don't interact, so no special shaping is needed.

The are no combining marks, so no context-sensitive shaping is needed, either.

Typographic units

Word boundaries

Words are separated by spaces.

Some words are hyphenated.

eg.

š–µ š–µ£š–µœš–µ£-š–µ™š–µ„š–µˆ

Graphemes

Graphemes in Kirat Rai consist of single letters or letters with no combining marks. This means that text can be segmented into typographic units using grapheme clusters.

Phrase, sentence, and section delimiters are described in phrase.

Punctuation & inline features

Phrase & section boundaries

Kirat Rai uses ASCII punctuation marks for the most part.

phrase

,

;

:

sentence

š–µ®

.

?

!

section

𖵯

It also has its own dandas, used to end a sentence and a section, respectively.

Bracketed text

See type samples.

Wancho commonly uses ASCII parentheses to insert parenthetical information into text.

  start end
standard

(

)

Quotations & citations

See type samples.

Kirat Rai texts use the following punctuation around quotations. Of course, due to keyboard design, quotations may also be surrounded by ASCII double and single quote marks.

  start end
initial

ā€œ

ā€

nested

ā€˜

’

The row labelled 'initial' indicates the usual default quote marks. When an additional quote is embedded within the first, the quote marks are those in the 'nested' row.

Abbreviation, ellipsis & repetition

Kirat Rai uses š–µ­ to indicate abbreviation.

eg.

𖵐𖵭𖵋𖵭𖵗𖵤𖵭

Line & paragraph layout

Line breaking & hyphenation

Lines are generally broken between words.

Text alignment & justification

Full justification is achieved by adjusting the spaces between words.

Page & book layout

Online resources

  1. Universal Declaration of Human Rights - Assyrian Neo-Aramaic
  2. The Bible in Assyrian Neo-Aramaic

References