Mundari orthography notes

Usage & history

The Mundari language is spoken in northeast India, primarily in the states of Jharkand, West Bengal, and Odisha, by around 1.1 million people.@Ethnologue,https://www.ethnologue.com/language/unr/. Mundari may be written in Devanagari, Bengali, and Oriya scripts, as well as Nag Mundari (also known as Mundari Bani). The Unicode proposal describes a 'huge' surge of interest in recent years in the script, with schools teaching Mundari Bani as well as workshops many Munda-inhabited districts. The state government of Odisha runs the Multilingual Education (MLE) Programme for tribal children to teach children their mother tongue. Mundari is one of the languages covered in their programme.m It is estimated that approximately 10% of Mundari speakers can read Mundari Bani.wm

𞓧𞓟𞓨𞓜𞓕𞓣𞓚

The script was invented by Rohidas Singh Nag (1934 - 2012) in the latter half of the 20th century. A significant reform of the script took place in 2008, and all material printed today uses that version of the orthography. The reform simplified or changed the shape of some letters, and added symbols for ɳ and w. The Unicode proposalwm,7-8 has a table showing the differences.

Basic features

The Nag Mundari script is an alphabet. Both consonants and vowels are indicated by letters, and the script is mostly quite straightforward. See the table to the right for a brief overview of features for the modern Mundari orthography.

Mundari text runs from left to right in horizontal lines. Words are separated by spaces. The orthography is unicameral.

❯ consonantSummary

Mundari represents consonants using 22 basic letters, plus one diacritic.

An unusual feature of Nag Mundari is that the sound w is written using a diacritic below the letter representing the following vowel. This applies for standalone vowels as well as for syllable onset clusters.

Another unusual feature of Mundari is that word-final b and d may be pronounced as checked sounds ˀb̥(ᵐ) or ˀd̥(ⁿ). This can be signalled by placing the special letter 𞓫 before the consonant.

A diacritic can be used to extend the repertoire when close transcription of words from neighbouring languages is desired.

There are no conjunct forms or ligatures. Gemination is not a feature of the Mundari language, but 𞓫 may be used to indicate gemination in close transcriptions of neighbouring languages.

❯ basicV

The Mundari orthography is an alphabet that writes vowels using 5 vowel letters. Two diacritics indicate vowel length and nasalisation; they are placed above and to the right side of the base, and may overlap following letters. The use of these diacritics varies from one author to another.

Standalone vowel sounds are written using the normal vowel letters.

Punctuation includes that used for the Latin script. Mundari has its own set of digits.

Line-breaking and justification are primarily based on inter-word spaces.

Phonology

The following represents the general repertoire of the Mundari language.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Vowels can be long or short, but length is not phonemically contrastive.o,100

Vowels following a nasal, following d͡z (optional), or preceding ɳ are normally nasalised. Nasalisation is not contrastive, either.o,100

Consonant sounds

	labial	alveolar	retroflex	palatal	velar	glottal
stop	p b	t d	ʈ ɖ		k ɡ	ʔ
aspirated	pʰ	tʰ	ʈʰ		kʰ
affricate		t͡ɕ d͡ʑ
aspirated		d͡ʑʰ
fricative		s				h
nasal	m	n	ɳ	ɲ	ŋ
approximant	w	l		j
trill/flap		r	ɽ

Aspiration only occurs in the Naguri and Kera dialects.wl

Checked consonants

Checked consonants are a special feature of Mundari phonology. They may occur in morpheme-final position as ˀb̥(ᵐ) or ˀd̥(ⁿ). These are pronounced by closing the glottis while articulating the stop, then the glottal stop is released (in monosyllabic words only) with an optional nasal release.

Tone

Mundari has no tones.

Structure

See page 101 of Osada for detailed information about where sounds appear in words.

w and j never occur in word-initial position.o,101

Vowels

Vowel summary table

This table summarises basic vowel to character assignments.

Another diacritic is added to the vowels to indicate nasalisation (not shown here).

	𞓚␣𞓚𞓭␣ ␣𞓟␣𞓟𞓭
	𞓤␣𞓤𞓭␣ ␣𞓐␣𞓐𞓭
	𞓕␣𞓕𞓭

For additional details see vowel_mappings.

Post-consonant vowels

Nag Mundari writes vowels that follow consonants using 5 vowel letters. Two diacritics indicate vowel length and nasalisation; they are placed above and to the right side of the base, and may overlap following letters. The use of these diacritics varies from one author to another.

There are no special mechanisms to indicate the absence of a vowel.

Vowel letters

The standard vowel sounds for Mundari are written as follows.

𞓚␣𞓟␣𞓤␣𞓐␣𞓕

Diacritics used with vowels

𞓭␣𞓬

𞓭 indicates vowel length (see vlength).

𞓬 indicates nasalisation (see nasalisation).

These diacritics are placed above and to the right side of the base, and may overlap following letters. Their use varies from one author to another.

Long vowels

𞓭

1E4ED is used to indicate long vowels, but not all long vowels are so marked.

𞓐𞓪𞓕𞓭

𞓗𞓚𞓭𞓔

Nasalisation

𞓬

Nasalisation of vowels is indicated using 1E4EC.

𞓧𞓟𞓬

Standalone vowels

Standalone vowel sounds are simply represented using ordinary vowel letters.

Vowel sounds to characters

This section maps Mundari vowel sounds to common graphemes in the Nag Mundari orthography.

vowel 𞓚

iː

vowel 𞓚𞓭

ĩː

vowel 𞓚𞓬

vowel 𞓟

uː

vowel 𞓟𞓭

ũː

vowel 𞓟𞓬

vowel 𞓤

eː

vowel 𞓤𞓭

ẽː

vowel 𞓤𞓬

vowel 𞓐

oː

vowel 𞓐𞓭

õː

vowel 𞓐𞓬

vowel 𞓕

aː

vowel 𞓕𞓭

ãː

vowel 𞓕𞓬

Consonants

Consonant summary table

This table summarises basic consonant to character assignments.

Additional sounds can be produced to match those of surrounding languages using 1E4EF. Those combinations are not shown here.

Onsets	𞓑␣𞓗␣𞓝␣𞓡␣𞓩␣𞓜␣𞓢␣𞓦␣𞓙
	𞓠␣𞓖
	𞓛␣𞓞
	𞓧␣𞓨␣𞓘␣𞓥␣𞓔
	𞓮␣𞓣␣𞓪␣𞓒␣𞓓
Finals	𞓫𞓗␣𞓫𞓡

For additional details see consonant_mappings.

Basic consonants

Basic consonant sounds in Nag Mundari are written using the following letters.

Click on each letter for more details and for examples of usage.

𞓑␣𞓗␣𞓝␣𞓡␣𞓩␣𞓜␣𞓢␣𞓦␣𞓙␣𞓠␣𞓖␣𞓛␣𞓞␣𞓧␣𞓨␣𞓘␣𞓥␣𞓔␣𞓮␣𞓣␣𞓪␣𞓒␣𞓓

The sound w

The sound w is written by adding the combining mark 1E4EE to the vowel that follows. It is used this way for onset consonant clusters such as 𞓢𞓕𞓮 kʷa, 𞓢𞓮𞓚 kʷi, etc., but the sources imply that it is also used for sounds such as 𞓕𞓮 wa, 𞓚𞓮 wi, etc., although Osadao,101 says that w never occurs word initially.

𞓨𞓕𞓐𞓬𞓕𞓮 𞓢𞓕𞓧𞓚𞓨𞓕𞓒𞓕

Observation: This doesn't appear to be common. Most sources show this diacritic centred below the vowel letter, rather than placed to the right as it appears in the Noto font used for this page.

Repertoire extension

𞓯

If a writer wants to closely transliterate Devanangari, Bengali, or Odia they may extend the Mundari Bani repertoire using 1E4EF. For example, 𞓛𞓯 can be used to specify Devanagari ष U+0937 DEVANAGARI LETTER SSA, Bengali ষ U+09B7 BENGALI LETTER SSA, or Oriya ଷ U+0B37 ORIYA LETTER SSA.wm,5

Onsets

Mundari syllables can begin with a consonant cluster, but they are just written using the relevant letters, except for CʷV, as described just above, where the medial glide is written using a combining mark on the vowel letter.

Finals

Syllable-final consonants are also generally just represented by ordinary letters, but again there is an exception.

Checked finals

𞓫

Final b or d in Mundari may be pronounced 'checked'. These checked sounds are written using 𞓫 before the consonant.

𞓒𞓕𞓩𞓕𞓫𞓗

Some authors don't use this.wm,5

is an alternative shape which can be found in recent texts.

Consonant clusters

Mundari has no conjunct forms for consonant clusters.

𞓧𞓟𞓨𞓜𞓕𞓣𞓚

Gemination

Consonant gemination isn't common in Mundari, but may occur in words from other, neighbouring languages when they are transcribed. 𞓫 can be used for this.

Consonant sounds to characters

This section maps Mundari consonant sounds to common graphemes in the Nag Mundari orthography.

consonant 𞓑

consonant 𞓗

consonant 𞓫𞓗 when checked in final position.

consonant 𞓝

t͡ʃ

consonant 𞓠

consonant 𞓡

consonant 𞓫𞓡 when checked in final position.

d͡ʒ

consonant 𞓖

consonant 𞓩

consonant 𞓜

consonant 𞓢

consonant 𞓦

consonant 𞓙

checked final mark 𞓫 Glottalisation mark, applied by some authors to represent 'checked' forms of word-finals.

consonant 𞓛

consonant 𞓞

consonant 𞓧

consonant 𞓨

consonant 𞓥

consonant 𞓘

consonant 𞓔

w~ʷ

labialisation mark 𞓮 placed below the letter representing the following vowel sound.

consonant 𞓣

consonant 𞓪

consonant 𞓒

consonant 𞓓

Glyph shaping & positioning

You can experiment with examples using the Mundari character app.

Context-based shaping & positioning

There are no conjuncts in Mundari, and no shaping is needed for letters.

Mundari does have a few combining marks, and these need to be combined with the base appropriately. Some marks may be far enough to the right to slightly overlap the following letter as well.

𞓩𞓭𞓖 𞓩𞓬𞓖 𞓩𞓯𞓖 — Three combining marks over Mundari letters in the Noto Sans Nag Mundari font. They slightly overlap the following letter also.

Observation: Phonetic transcriptions of words show many that have long, nasalised vowels. It is not clear whether the script shows 2 combining marks together in such a case – no such combinations were spotted in the online resources linked to below.

Punctuation & inline features

Phrase & section boundaries

See type samples.

,␣;␣:␣.␣?␣!

Phrase and section boundaries in Mundari use ASCII punctuation.

phrase	, ; :
sentence	. ? !

phrase

;

sentence

Bracketed text

(␣)

Mundari commonly uses ASCII parentheses to insert parenthetical information into text.

	start	end
standard	(	)

Quotations & citations

See type samples.

“␣”␣‘␣’

Mundari texts may use quotation marks around quotations. Of course, due to keyboard design, quotations may also be surrounded by ASCII double and single quote marks.

	start	end
initial	“	”
nested	‘	’

Notes, footnotes, etc

See inlinenotes for purely inline annotations, such as ruby or warichu. This section is about annotation systems that separate the reference marks and the content of the notes.

Nag Mundari

Sample

Usage & history

Basic features

Character index

Letters

Consonants

Vowels

Other

Combining marks

Punctuation

ASCII

Numbers

Alternative digits

Other

To be investigated

Phonology

Vowel sounds

Consonant sounds

Checked consonants

Tone

Structure

Vowels

Vowel summary table

Post-consonant vowels

Vowel letters

Diacritics used with vowels

Long vowels

Nasalisation

Standalone vowels

Vowel sounds to characters

Consonants

Consonant summary table

Basic consonants

The sound w

Repertoire extension

Onsets

Finals

Checked finals

Consonant clusters

Gemination

Consonant sounds to characters

Numbers, dates, currency, etc

Digits

Text direction

Glyph shaping & positioning

Context-based shaping & positioning

Typographic units

Word boundaries

Graphemes

Punctuation & inline features

Phrase & section boundaries

Bracketed text

Quotations & citations

Line & paragraph layout

Line breaking & hyphenation

Text alignment & justification

Baselines, line height, etc.

Page & book layout

Online resources

References