Balinese

orthography notes

Updated 27 April, 2026

This page brings together basic information about the Balinese script and its use for the Balinese language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Balinese using Unicode.

Referencing this document

Richard Ishida, Balinese Orthography Notes, 27-Apr-2026, https://r12a.github.io/scripts/bali/ban

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size: 28px

ᬫᬓᬲᬫᬶᬫᬦᬸᬲᬦᬾᬓᬳᭂᬫ᭄ᬩᬲᬶᬦ᭄ᬫᬳᬃᬤᬶᬓᬮᬦ᭄ᬧᬢᭂᬄᬲᬚ᭄ᬭᭀᬦᬶᬂᬓᬳᬦᬦ᭄ᬮᬦ᭄ᬓ᭄ᬯᬲ᭟ ᬳᬶᬧᬸᬦ᭄ᬓᬵᬦᬸᬕ᭄ᬭᬳᬶᬦ᭄ᬯᬶᬯᬾᬓᬮᬦ᭄ᬩᬸᬤ᭄ᬥᬶ᭞ ᬧᬦ᭄ᬢᬭᬦᬶᬂᬫᬦᬸᬲᬫᬗ᭄ᬤᬦᬾᬧᬭᬲ᭄ᬧᬭᭀᬲ᭄ᬫᬲᭂᬫᭂᬢᭀᬦᬦ᭄᭟

Source: UDHR, article 1 in Omniglot

Usage & history

Origins of the Balinese script, 11thC – today.

Phoenician

└ Aramaic

└ Brahmi

└ Pallava

└ Old Kawi

└ Balinese

+ Batak

+ Baybayin

+ Javanese

+ Lontara

+ Makasar

+ Old Sundanese

+ Recong

+ Rejang

The Balinese script ( ᬅᬓ᭄ᬱᬭᬩᬮᬶ ạk͓ṡ̂rbli aksara bali Balinese script ) is used for writing the Balinese language spoken on the Indonesian islands of Java and Bali. It may also be used for Old Javanese, and liturgical Sanskrit. With some additions, it is also used to write Sasak in the neighbouring island of Lombok.

Everyday use of the script has largely been eclipsed by the Latin alphabet, but Balinese has a significant presence in traditional ceremonies and texts of the Hindu religion. It is also used for signage on roads, at the entrances to villages, and on government buildings. Traditional literature is published on a small scale, but little modern literature. Sekaha Pesantian community groups gather to read the Balinese script in a social context, commonly in song form.

Balinese script is derived from Old Kawi, and ultimately from Brahmi. Historically, Balinese was written on palm leaves or inscribed in stone. Its similarity to the Javanese script in form and behaviour leads some to propose that they are typological variants of each other.

Unicode 17 has 1 dedicated block, comprising 127 characters, around 30 of which are musical symbols.

More information: Scriptsource • Wikipedia

See the comparison table

Basic features

The Balinese script is an abugida, ie. each consonant contains an inherent vowel sound.

❯ basicV

Vowels The inherent vowel is generally pronounced a, but ə when word-final or in some affixes.

Post-consonant vowels are written using 11 combining marks (vowel signs).

There are 2 pre-base glyphs and 6 circumgraphs. In principle, Balinese has no composite vowel signs, however the 6 circumgraphs can also be decomposed into 2 parts. Those can involve up to 2 glyphs, and glyphs can surround the base consonant(s) on up to 3 sides.

Standalone vowel sounds are written using independent vowels at the beginning of a word. Inside a word they are written using ᬳ with an attached vowel sign.

Balinese has vocalics, and their use is required for certain consonant-vowel combinations.

❯ consonantSummary

Consonants 18 consonant letters are used for pure Balinese words, supplemented by 15 more used for Sanskrit and Kawi loanwords. Some of these letters are used as honorifics, a little like capital letters in English proper nouns.

Vowel absence Vowel absence is indicated using conjunct forms, a visible ᭄, or dedicated syllable-final characters. There are no dedicated medial consonant characters.

᭄ is used to kill the inherent vowel and to form conjuncts. Usually the adeg adeg is invisible, but it is rendered visibly when no other consonant follows, or occasionally in special circumstances, when it can be forced to appear using an invisible formatting character.

Conjunct forms occur as stacked consonants or conjoined pairs. The shape of many subjoined consonant glyphs differs from the normal shape.

Medial consonants are written using 1B44 followed by one of 4 ordinary consonants or a special vocalic combining mark.

Syllable codas are most commonly written using an ordinary consonant followed by 1B44. If another consonant follows, the consonant shapes are combined into a conjunct form — even if the consonants represent the end of one word and the beginning of another ! Alternatively, three codas may be represented by one of 3 diacritics, two of which only occur word-finally.

Numbers Balinese has a set of native digits.

Layout Balinese text runs left to right in horizontal lines. Words are not separated by spaces, however syllables may be separated by ZWSP, as long as they don't fall inside a stack. There are no case distinctions.

It uses native punctuation marks.

Stacked consonants and conjoined pairs span word boundaries. This means that text must be wrapped at orthographic syllable boundaries, and not at word boundaries. Hyphenation occurs, using 1B60 at the line end to indicate the break.

Notable features

circumgraphs exist (unlike Javanese)
conjuncts form across word boundaries
independent vowels used word-initially only (unlike Javanese)
vocalics have both letters and vowel signs (unlike Javanese)
consonants classed as basic or aksara sualalita
no dedicated medial consonants (unlike Javanese)

Character index

Letters

Show

Basic consonants

ᬧ,ᬩ,ᬢ,ᬤ,ᬘ,ᬚ,ᬓ,ᬕ,ᬲ,ᬳ,ᬫ,ᬦ,ᬗ,ᬜ,ᬯ,ᬭ,ᬮ,ᬬ

Honorifics

ᬨ,ᬝ,ᬣ,ᬥ,ᬖ,ᬰ,ᬱ,ᬡ,ᬪ

Extended consonants

ᬞ,ᬟ,ᬠ,ᬔ,ᬛ,ᬙ

Vowels

ᬇ,ᬈ,ᬉ,ᬊ,ᬏ,ᬑ,ᬅ,ᬆ,ᬐ,ᬒ

Vocalics

ᬋ,ᬌ,ᬍ,ᬎ

Not used for contemporary Balinese

ᭅ,ᭆ,ᭇ,ᭈ,ᭉ,ᭊ,ᭋ

Combining marks

Show

Vowel signs

ᬾ,ᬿ,ᭀ,ᭃ,ᭁ,ᬶ,ᬷ,ᬸ,ᬹ,ᭂ,ᬵ

Vocalics

ᬺ,ᬻ,ᬼ,ᬽ

Bindu

ᬀ,ᬁ,ᬂ

Finals

ᬃ

Nukta

᬴

Virama

᭄

Visarga

ᬄ

Numbers

Show

᭐,᭑,᭒,᭓,᭔,᭕,᭖,᭗,᭘,᭙

Punctuation

Show

᭚,᭛,᭜,᭝,᭞,᭟,᭠,᭽,᭾,᭿,᭎,᭏

Other

Show

‌,

To be investigated

!,%,(,),-,;,?,[,],«,»,ʼ,͏,‍,‑,‒,–,—,‘,’,“,”,…,‰,‹,›

Items to show in lists

Codepoint

IPA

LOC

Transliteration

Phonology

The following represents the repertoire of the Balinese language.

Click on the sounds to see where else in the document they are referred to.

Phones in a lighter colour are non-native or allophones .

Vowel sounds

Plain vowels

Diphthongs

The sources are not very clear about Balinese vowel length. Wiktionary IPA transcriptions make no distinction in pronunciation between the long and short vowel graphemes, and this is backed up in some sources. One study describes Balinese speakers reduce long vowels to short when speaking English. ClynesClynes: Topics in the Phonology and Morphosyntax of Balinese§https://openresearch-repository.anu.edu.au/bitstream/1885/10744/5/Clynes%20Thesis%201996.pdf argues that some apparently long vowels are parts of separate syllables and split into different sounds under morphological changes.

On the other hand, sources including Ida Bagus Adi Sudewasb and Wikipediawl indicate that there is a difference in vowel length.

Consonant sounds

	labial	dental	alveolar	post- alveolar	palatal	velar	pharyngeal	glottal
stop	p b	t d				k ɡ
affricate				t͡ʃ d͡ʒ
fricative	f v		s z			x ɣ	ħ ʕ	h
nasal	m		n		ɲ	ŋ
approximant	w		l		j
trill/flap			r

Tone

Balinese is not a tonal language.

Vowels

	Post-consonant	Standalone
Plain	ᬶ,ᬷ, ,ᬸ,ᬹ	ᬇ,ᬳᬶ,ᬈ,ᬳᬷ, ,ᬉ,ᬳᬸ,ᬊ,ᬳᬹ
	ᬾ, ,ᭀ	ᬏ,ᬳᬾ, ,ᬑ,ᬳᭀ
	ᭂ,ᭃ	ᬳᭂ,ᬳᭃ
	ᬾ, ,ᭀ	ᬏ,ᬳᬾ, ,ᬑ,ᬳᭀ
	ⓘ,ᬵ	ᬅ,ᬆ,ᬳᬵ
Diphthongs	ᬿ,ᭁ	ᬐ,ᬳᬿ,ᬒ,ᬳᭁ
Vocalics	ᬺ,ᬻ,ᬼ,ᬽ	ᬋ,ᬌ,ᬍ,ᬎ

ⓘ represents the inherent vowel. Multipart forms are not shown here because all vowels and diphthongs are normally represented using one of the atomic characters listed here.

Inherent vowel

ᬓ ka

The inherent vowel for Balinese is pronounced a, so ka is written by simply using the consonant letter. However, it is pronounced ə at the end of a word and also in prefixes ma-, pa- and da-.

eg.

ᬫᬦᬯ

ᬫ,ᬦ,ᬯ

Since Balinese consonants normally include an inherent vowel, the orthography has ways to indicate a consonant that is not followed by a vowel sound. See novowel.

Vowels after consonants

Post-consonant vowels are written using 11 combining marks (vowel signs). There are 2 pre-base glyphs and 6 circumgraphs.

In principle, Balinese has no multipart vowels, however the 6 circumgraphs can also be decomposed into 2 parts. Those can involve up to 2 glyphs, and glyphs can surround the base consonant(s) on up to 3 sides.

Six of the vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

All vowel signs are stored after the base consonant, and the rendering process puts them in the correct place for display. Conjuncts are treated as indivisible units when it comes to rendering vowel signs, meaning that pre-base vowel signs and left-side glyphs of circumgraphs are rendered before the conjunct as a whole (see prebase and circumgraphs).

Simple vowels

ᬓᬶ ki

Balinese uses the following dedicated combining marks for vowels. They are all vowel signs.

ᬶ,ᬷ,ᬸ,ᬹ,ᬾ,ᭀ,ᭂ,ᭃ,ᬵ

To represent the sounds rə or lə, Balinese uses vocalic letters. A sequence such as *ᬭᭂ U+1B2D LETTER RA + U+1B42 VOWEL SIGN PEPET is not used. See vocalics.

Diphthongs

Balinese uses the following additional vowel signs for diphthongs.

ᬿ,ᭁ

One is a pre-base vowel sign, and the other is a circumgraph.

eg.

ᬤᬿᬢ᭄ᬬ

Vowel length

The sources are not very clear about whether Balinese vowels vary in length during pronunciation (see phonemesV). The Balinese vowel sign repertoire does, however, contain glyphs that distinguish between short and long vowels (see basicV).

Nasalisation

If Balinese nasalises any vowel sounds, it is not explicitly marked in the orthography.

Standalone vowels

Balinese has 2 ways to represent standalone vowels: using independent vowels, or using vowel signs.

Independent vowels

ᬇ,ᬈ,ᬉ,ᬊ,ᬏ,ᬑ,ᬅ,ᬆ, ,ᬐ,ᬒ

At the beginning of a word, most standalone vowels are represented using one of the 10 independent vowel characters. The set includes a character to represent the inherent vowel sound.

eg.

ᬉᬱᬥ

ᬆᬤᬶ

The vowel signs for ə (1B42) and əː (1B43) don't have an independent form, and have to be used after ᬳ at the beginning of a word, ie. 1B33 1B42 and 1B33 1B43, respectively.

eg.

ᬳᭂᬫ᭄ᬧᬢ᭄

In Sasak, independent vowel ᬅ can be followed by an explicit 1B44 in word- or syllable-final position, where it indicates the glottal stop. Other consonants can also be subjoined to it.

eg.

ᬳᬫᬅ᭄ hmạ͓ amaʔ

Vowel signs

ᬳᬶ,ᬳᬷ,ᬳᬸ,ᬳᬹ,ᬳᬾ,ᬳᭀ,ᬳᭂ,ᬳᭃ,ᬳᬵ, ,ᬳᬿ,ᬳᭁ

Typically, a standalone vowel is represented by a vowel sign attached to ᬳ, which acts as a carrier.

eg.

ᬳᬶᬩᬶ

ᬤᬳᬾᬭᬄ

Without a vowel sign the letter ᬳ may represent a.

eg.

ᬳᬮᬲ᭄

However, it may be unclear from the written text whether ᬳ represents the sound h or is used as a carrier for a vowel, eg. compare

ᬩᬳᬸᬂ

ᬩᬳᬸ

Vowel components

Pre-base vowel signs

ᬓᬾ ke

ᬾ,ᬿ

Two vowel signs appear to the left of the base consonant letter or cluster.

eg.

ᬘᬾᬮᬾᬂ

These are combining marks that are always stored after the base consonant or conjunct, ie. the code points follow the order in which the items are pronounced. The rendering process places the glyph before the base consonant without changing the order of the code points. The following shows the sequence of code points that make up the word just above.

ᬘ,ᬾ,ᬮ,ᬾ,ᬂ

Conjuncts are treated as indivisible units when it comes to rendering vowel signs, meaning that pre-base vowel signs are rendered before the conjunct, even though pronounced after the consonants.

eg.

ᬩᭂᬦ᭄ᬤᬾᬰ

ᬦ᭄ᬤᬾ,ᬦ,᭄,ᬤ,ᬾ

show composition

ᬩᭂᬦ᭄ᬤᬾᬰ

Circumgraphs

ᬓᭀ ko

This section includes some vowel signs described in the section vocalics.

ᭀ,ᭃ,ᭁ,ᬻ,ᬼ,ᬽ

Five vowel or vocalic sounds are represented by a vowel sign that is a single code point in memory, but when displayed it has visually separate parts that appear on different sides of the preceding consonant or cluster.

eg.

ᬢᭀᬕᭀᬕ᭄

ᬢ,ᭀ,ᬕ,ᭀ,ᬕ,᭄

Like pre-base glyphs, these are combining marks that are always stored after the base consonant or conjunct. The rendering process places the glyphs around the base consonant(s), as needed.

show composition

ᬤᭀᬦ᭄

Glyphs can appear on up to 3 sides of the base. Some of the glyphs merge with the base character's glyph (see context).

These circumgraphs have canonically equivalent decomposed forms (see vs_encoding).

Composite vowel signs

Composite vowel signs are only produced when text is decomposed; 5 of the circumgraphs split off the 1B35 glyph, to create the following pairs:

ᭀ,ᭃ,ᭁ,ᬻ,ᬽ

Vowel sign placement

Show details about vowel glyph positioning.

The following list shows where vowel signs, including vocalics, are positioned around a base consonant to produce vowels, and how many instances of that pattern there are.

2 pre-base, eg. ᬓᬾ ᬓᬿ
1 post-base, eg. ᬓᬵ kɑ̄
3 above-base, eg. ᬓᬶ ᬓᬷ ᬓᭂ
3 below-base, eg. ᬓᬸ ᬓᬹ ᬓᬺ
2 pre+post-base, eg. ᬓᭀ ᬓᭁ
1 below+post-base, eg. ᬓᬻ
1 below+above-base, eg. ᬓᬼ
1 below+above+post-base, eg. ᬓᬽ
1 above+post-base, eg. ᬓᭃ kə̄

At maximum, vowel components can occur concurrently on 3 sides of the base.

Vowel sounds to characters

This section maps Balinese vowel sounds to common graphemes in the Balinese orthography. Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.

Vowel signs are post-consonant, dependent vowels. Independent vowels are usually only used in word-initial position. Word-internal standalone vowels (and word-initial in the case of ə and əː) use the vowel sign over a silent 1B33. Vowel signs that decompose are shown only in precomposed form.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

Plain vowels

vowel sign ᬶ

independent ᬇ

medial standalone ᬳᬶ

iː

vowel sign ᬷ

independent ᬈ

medial standalone ᬳᬷ

vowel sign ᬸ

independent ᬉ

medial standalone ᬳᬸ

uː

vowel sign ᬹ

independent ᬊ

medial standalone ᬳᬹ

vowel sign ᬾ

independent ᬏ

medial standalone ᬳᬾ

vowel sign ᭀ

independent ᬑ

medial standalone ᬳᭀ

inherent vowel at the end of a word and also in prefixes ma-, pa- and da-.

vowel sign ᭂ

medial standalone ᬳᭂ

əː

vowel sign ᭃ

medial standalone ᬳᭃ

vowel sign ᬾ

independent ᬏ

medial standalone ᬳᬾ

vowel sign ᭀ

independent ᬑ

medial standalone ᬳᭀ

inherent vowel eg. ᬅᬯᬢᬵᬭ awatarə avatar

independent ᬅ

ɑː

vowel sign ᬵ

independent ᬆ

medial standalone ᬳᬵ

Diphthongs and other combinations

aːi

vowel sign ᬿ

independent ᬐ

medial standalone ᬳᬿ

aːu

vowel sign ᭁ

independent ᬒ

medial standalone ᬳᭁ

Vocalics

ᬋ,ᬌ,ᬍ,ᬎ,ᬺ,ᬻ,ᬼ,ᬽ

At the beginning of a syllable following a vowel the standalone form of the vocalic is used.

eg.

ᬓᭂᬋᬂ

ᬢᬍᬃ

As a second component in a consonant cluster, the vocalic has a postfixed form and a subjoined form. The examples that follow are for the sound rə.

When the sound occurs directly after a syllable-final consonant, ie. as the onset of a new syllable, the sequence of Unicode characters is C᭄ᬋ. This produces the conjoined (postfix) form ᭄ᬋ.

eg.

ᬧᬓ᭄ᬋᬋᬄ

When the sound occurs after a syllable-initial consonant, ie. when it occurs as a medial consonant within the same syllable, the sequence of characters is simply Cᬺ, using the vowel sign. This produces the subjoined form 1B3A.

eg.

ᬓᬺᬰ᭄ᬡ

Vowel absence

Vowel absence principally occurs either when a consonant is a syllable coda, or when a consonant is part of a consonant cluster.

Given that consonants normally include an inherent vowel, the orthography needs a way to indicate when a consonant is not followed by a vowel.

Follow these links for more information.

Conjuncts. There are a couple of possibilities here:
1. Stacked consonants, where the non-initial (subjoined) consonant appears below the initial, often with a different shape from normal.
2. Conjoined consonants, where consonants sit side-by-side but the non-initial consonant has a slightly different form than usual.
A visible adeg adeg following the initial consonant.
Coda marks. These are dedicated combining marks used to write syllable codas.

Conjuncts

Conjunct formation

Stacked and conjoined consonant clusters are referred to as conjuncts.

In Unicode, the stacking and conjoining behaviour is achieved by adding 1B44 between the consonants. The font hides the glyph automatically when a conjunct is formed.

ᬲ,᭄,ᬢ,ᬲ᭄ᬢ

See a table of 2-consonant clusters.
The table allows you to test results for various fonts.

Word boundaries. Conjuncts span word boundaries. Because there are no spaces between words, a cluster is created when a consonant with no following vowel at the end of a word is followed by a consonant at the beginning of the next word.

ᬓᬳᬦᬦ᭄ᬮᬦ᭄ᬓ᭄ᬯᬲ — In the sequence of words kahanan lan kwasa the initial consonant of each word is subjoined below the final consonant of the preceding word.

Stacks and conjoined sequences are not normally split at line ends (see word and linebreak for the ramifications of this).

Stacking

To represent consonant sounds without intervening vowels, the non-initial consonant letter is typically drawn below the initial consonant letter, and with a slightly different shape. These subjoined forms are called gantungan (ᬕᬦ᭄ᬢᬸᬗᬦ᭄).

Many of the subjoined forms are just slightly smaller versions of the original, but several have very different shapes altogether, most of which ligate with the cluster initial consonant by joining strokes.

ᬦ,᭄,ᬤ,ᬦ᭄ᬤ

ᬭ,᭄,ᬬ,ᬭ᭄ᬬ

ᬜ,᭄,ᬚ,ᬜ᭄ᬚ

Examples of stacked conjuncts.

There can be up to 3 consonants combined in this way, but the third consonant must be one of ya, ra, la or wa, eg.

ᬲ,᭄,ᬢ,᭄,ᬭ,ᬲ᭄ᬢ᭄ᬭ

ᬫ,᭄,ᬩ,᭄,ᬮ,ᬫ᭄ᬩ᭄ᬮ

Stacked conjuncts with 3 components.

Show more stacked conjuncts.

The lists below show consonants in their normal and subjoined forms

Native letters

ᬩ᭄ᬩ,ᬢ᭄ᬢ,ᬤ᭄ᬤ,ᬘ᭄ᬘ,ᬚ᭄ᬚ,ᬓ᭄ᬓ,ᬕ᭄ᬕ,ᬳ᭄ᬳ,ᬫ᭄ᬫ,ᬦ᭄ᬦ,ᬜ᭄ᬜ,ᬗ᭄ᬗ,ᬯ᭄ᬯ,ᬭ᭄ᬭ,ᬮ᭄ᬮ,ᬬ᭄ᬬ

Sanskrit letters

ᬞ᭄ᬞ,ᬟ᭄ᬟ,ᬠ᭄ᬠ,ᬔ᭄ᬔ,ᬙ᭄ᬙ,ᬛ᭄ᬛ

Kawi letters

ᬝ᭄ᬝ,ᬣ᭄ᬣ,ᬥ᭄ᬥ,ᬖ᭄ᬖ,ᬰ᭄ᬰ,ᬡ᭄ᬡ,ᬪ᭄ᬪ

Conjoined consonants

In conjoined clusters, the consonant glyphs remain side by side, but the non-initial consonant is reduced on the left side. These conjoined forms are called gempelan (ᬕᬾᬫ᭄ᬧᬾᬮᬦ᭄).

ᬫ,᭄,ᬧ,ᬫ᭄ᬧ

ᬓ,᭄,ᬱ,ᬓ᭄ᬱ

ᬓ,᭄,ᬲ,ᬓ᭄ᬲ

Examples of stacked conjuncts.

The conjoined ᬲ is unusual in that it also adds a stroke below the initial consonant (see fig_conjoined_sa). This helps distinguish it from the conjoined p.

show composition

ᬅᬓ᭄ᬱᬭ

show composition

ᬧᬓ᭄ᬲ

Show more conjoined conjuncts.

This list shows consonants in their normal and conjoined forms

native letters

ᬧ᭄ᬧ,ᬲ᭄ᬲ,ᬋ᭄ᬋ

Kawi letters

ᬨ᭄ᬨ,ᬱ᭄ᬱ

Visible adeg adeg

Balinese uses ᭄ (the Balinese equivalent of the Sanskrit virama) to kill the inherent vowel after a consonant.

The adeg adeg is always used and visible at the end of a word that ends in a consonant and isn't followed by another consonant.

eg.

ᬘᬭᬶᬓ᭄

In consonant clusters, it is used to produce a conjunct (see the word 'membership' just below). When used this way the adeg adeg becomes invisible (see clusters).

Sometimes it is used to clarify the distinction between a word-final consonant and a medial consonant by preventing the stacking of the final consonant in the previous word and the first consonant in the next. To create this effect, add 200C or 200B immediately after the adeg adeg, eg. compare:

ᬧᬓ᭄ᬭᬫᬦ᭄

ᬧ,ᬓ,᭄,ᬭ,ᬫ,ᬦ,᭄

ᬧᬓ᭄‌ᬭᬫᬦ᭄

ᬧ,ᬓ,᭄,‌,ᬭ,ᬫ,ᬦ,᭄

Because there is no word separator, consonants at the end of one word and beginning of the following word are normally stacked, too.

In some cases this leads to ambiguity about whether this is one or two words. If you really want to make clear which is which, you can use an explicit adeg-adeg, eg. compare ᬧᬓ᭄ᬭᬫᬦ᭄ ᬧᬓ᭄‌ᬭᬫᬦ᭄

The Unicode Standard recommends the use of 200C (ZWNJ) after the adeg-adeg in order to prevent conjunct formation. However, not many people understand the function of ZWNJ or can access it easily from the keypad. It also doesn't introduce line-break opportunities. A better solution may be to use 200B (ZWSP). This character is needed anyway on most systems in order to allow line-breaking, and it appears to work equally well for this.

A somewhat ambiguous situation arises where conventions prevent certain combinations stacking. For example, the name of the village tamblung should not stack the mbl. Compare the default conjunct (top) with the desired village name (bottom):

ᬢᬫ᭄ᬩ᭄ᬮᬂ

ᬢᬫ᭄‌ᬩ᭄ᬮᬂ

The Unicode Standard advises to use a zero-width non-joiner after ma, to achieve this.

Observation: Note that this may also be achieved by intelligence in the font, as was actually the case when I generated this example (click on it to see). It's not clear to me what is the preferred approach: put ZWNJ in only when the font doesn't do what you want, or use it always. The latter may lead to more consistent content where different fonts are applied to the text (eg. after cut and paste). In theory, this shouldn't affect searching and sorting, although some applications may not ignore the ZWNJ as they should.

Consonants

	Native Balinese sounds	Kawi, Sanskrit, etc. loan words	With rerekan
Onsets & codas	ᬧ,ᬩ,ᬢ,ᬘ,ᬤ,ᬚ,ᬓ,ᬕ	ᬨ,ᬪ,ᬞ,ᬝ,ᬣ,᭄ᬙ,ᬟ,ᬠ,ᬥ,ᬛ,ᬔ,ᬖ	ᬤ᬴,ᬗ᬴
	ᬲ,ᬳ	ᬰ,ᬱ	ᬧ᬴,ᬯ᬴,ᬚ᬴,ᬓ᬴,ᬕ᬴,ᬳ᬴
	ᬫ,ᬦ,ᬜ,ᬗ	ᬡ
	ᬯ,ᬭ,ᬮ,ᬬ
Medials	᭄ᬯ,᭄ᬭ,ᬺ,᭄ᬮ,᭄ᬬ
Finals	ᬂ,ᬃ,ᬄ

Consonants used for native Balinese words are shown in the left-hand column. On the right are consonants used for words from Kawi, Sanskrit, and other languages.

Basic consonants

Balinese uses 18 basic consonants known as aksara wreṣāstra (ᬅᬓ᭄ᬱᬭᬯᬺᬱᬵᬲ᭄ᬢ᭄ᬭ).

Click on each letter for more details and for examples of usage, especially where more than one sound is indicated.

ᬧ,ᬩ,ᬢ,ᬤ,ᬓ,ᬕ,ᬘ,ᬚ,ᬲ,ᬳ,ᬫ,ᬦ,ᬗ,ᬜ,ᬯ,ᬭ,ᬮ,ᬬ

The characters listed here (and in the following sections) also have subjoined/conjoined shapes, which may differ significantly from those shown here. See clusters for a list of glyph shapes.

ᬳ at the beginning of a word or after a preceding vowel is mostly used as a support for a vowel sign (see standalone), and is not pronounced or transcribed. Word finally with a suffix vowel, however, it is transcribed.loc

Additional/honorific consonants

These are called aksara sualalita (ᬅᬓ᭄ᬱᬭᬰ᭄ᬯᬮᬮᬶᬢ).

Many of the additional consonants are commonly used in words originating from Arabic and Dutch, and are most common in north Bali and Lombok. When used in pure Balinese words, they are similar to capital letters and are used to create an honorific effect. There are similar characters in Javanese.

They don't add any consonant sounds to the Balinese repertoire. In words originating from Sanskrit, Old Javanese, or Old Balinese, they represent aspirated or other consonants.loc

Additional consonants used for Sanskrit words.

ᬞ,ᬟ,ᬠ,ᬔ,᭄ᬙ,ᬛ

Additional consonants used for words from Kawi.

ᬨ,ᬪ,ᬝ,ᬣ,ᬥ,ᬖ,ᬰ,ᬱ,ᬡ

The following are particularly noteworthy points about certain characters listed above. More details for each character can be revealed by clicking on the lists above. See also the sound to character mapping table.

Two consonants, ᬔ and ᬙ, are considered very rare, and one other, ᬛ, seems to be known from only one word:

ᬦᬶᬃᬛᬭ

(It is possible that an original ai may have been lost in Balinese, to be replaced by the glyph for jʰa.)

A number of the Sanskrit or Kawi consonants are rather poorly attested. The letter ᬙ is only found in non-initial position following ᬘ.

ie.

ᬘ᭄ᬙ c͓C

Most of the series that originally represented retroflex sounds is often omitted in books about the script.

Rerekan

The combining mark ᬴ is used, as is a similar sign in Javanese, to extend the character repertoire for foreign sounds. However, according to Perdanaabp§13 the use of this sign is specific to Lombok texts, and even there its use is sporadic and inconsistent. While the sign can theoretically be used in Balinese settings, common Balinese users would not be familiar with the sign and normally render foreign consonants using the nearest sounding native sound without any additional markings.

See Perdana p13 for many more details.

The first 7 of the 8 combinations listed below are attested in Library of Congress transliterations and in earlier Sasak orthography. The 8th, ᬤ᬴ could be used for one-to-one transliteration for Javanese ɖ.

ᬧ᬴,ᬯ᬴,ᬚ᬴,ᬓ᬴,ᬕ᬴,ᬳ᬴,ᬗ᬴,ᬤ᬴

eg.

ᬩᬮᬶᬕ᭄ᬭᬧ᬴ᬶ

In rendering, the dots of these letters appear above the top character, which can cause some ambiguity in reading. The following are all visually indistinguishable: ᬓ᬴᭄ᬚ kˑ͓ʤ xja ᬓ᭄ᬚ᬴ k͓ʤˑ kza ᬓ᬴᭄ᬚ᬴ kˑ͓ʤˑ xza

In practice these combinations are probably rather rare.

Sasak

In recent times, Sasak users abandoned the use of the Javanese-influenced rerekan in favour of a series of modified letters (see above), making use, in addition, of some of unused Kawi letters for the Arabic sounds. In place of ᬓ᬴ x and ᬕ᬴ ɣ, for instance, the new fusion of KA and HA,ᭆ and the Kawi letter ᬖ are used.

See Perdana p15 for many more details.

(Does the fact that these relate to aspirated or retroflex forms originally affect the pronunciation?)

ᭅ,ᭆ,ᭇ,ᭈ,ᭉ,ᭊ,ᭋ

Onsets

᭄ᬯ,᭄ᬭ,ᬺ,᭄ᬮ,᭄ᬬ

The medial consonants ya, ra, la and wa regularly appear immediately after the initial consonant in a syllable. Unlike Javanese, Balinese has no special characters for these medial sounds (other than the vocalics mentioned earlier); they are just written using the normal approach for dealing with consonant clusters. These shapes are called pangangge aksara (ᬧᬗ᭢ᬗ᭄ᬕᬅᬓ᭄ᬱᬭ).

eg.

ᬓ᭄ᬭᬫ

ᬓ᭄ᬭ,ᬓ,᭄,ᬭ

Multiple medials can occur: r or l can be followed by w or y.

eg.

ᬩ᭄ᬭ᭄ᬬᬕ᭄

ᬩ᭄ᬭ᭄ᬬ,ᬩ,᭄,ᬭ,᭄,ᬬ

In addition, the vocalics can produce consonant sounds (tied to a specific vowel) in medial position.

eg.

ᬓᬺᬰ᭄ᬡ

ᬓᬺ,ᬓ,ᬺ

See clusters for more details on shaping of glyphs.

Codas

Normally, syllable and word-final consonant sounds with no following consonant are represented using an ordinary consonant character followed by ᭄.

eg.

ᬢᬧᭂᬮ᭄

ᬳᬮᬲ᭄

If the consonant is followed by another consonant, either in the middle or at the end of a word, the adeg adeg code point remains, but becomes invisible as the consonant shapes combine vertically or horizontally (see clusters).

eg.

ᬢᬦ᭄ᬤᬸᬓ᭄

Combining marks

However, there is also a set of combining marks for syllable-final consonants that don't need to be followed by the adeg adeg.

ᬄ,ᬂ,ᬃ

1B02 and 1B04 normally only appear at the end of a word.

eg.

ᬩᬶᬤᬂ

ᬩᬭᬄ

unless the word involves repetition.

eg.

ᬘᬾᬂᬘᬾᬂ

1B03 can appear at the end of any syllable.

ᬓᬃᬡ

A syllable-final diacritic may appear above a stack. It is typed and stored after the other components in the stack.

eg.

ᬩᬗ᭄ᬓᬸᬂ

ᬗ᭄ᬓᬸᬂ,ᬗ,᭄,ᬓ,ᬸ,ᬂ

When the syllable has a spacing vowel sign, any above-base final-consonant mark appears over the base character, rather than over the vowel sign. This is positioned by the font; the final consonant mark is still typed and stored after the other syllable components.

eg.

ᬕᭂᬤᭀᬂ

When one of these diacritics appears over a consonant that already has a vowel sign above it, the two combining marks appear side by side.

eg.

ᬳᬸᬓᬶᬃ

ᬧᭂᬢᭂᬂ

Consonant sounds to characters

This section maps Balinese consonant sounds to common graphemes in the Balinese orthography orthography.

The table distinguishes between native Balinese letters and letters borrowed from Sanskrit or Kawi, or extended with rerekan. The right-hand edge shows how conjuncts look by doubling up the letter with an adeg adeg between.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

ᬧ᭄ᬧ basic ᬧ

ᬨ᭄ᬨ kawi ᬨ in Kawi loan words.

ᬩ᭄ᬩ basic ᬩ

ᬪ᭄ᬪ kawi ᬪ in Kawi loan words.

ᬢ᭄ᬢ basic ᬢ

ᬞ᭄ᬞ honorific ᬞ

ᬝ᭄ᬝ kawi ᬝ in Kawi loan words.

ᬣ᭄ᬣ kawi ᬣ in Kawi loan words.

t͡ʃ

ᬘ᭄ᬘ basic ᬘ

ᬙ᭄ᬙ honorific ᭄ᬙ Very rare. Only found in subjoined form.

ᬤ᭄ᬤ basic ᬤ

ᬥ᭄ᬥ kawi ᬥ in Kawi loan words.

ᬟ᭄ᬟ honorific ᬟ

ᬠ᭄ᬠ honorific ᬠ

d͡ʒ

ᬚ᭄ᬚ basic ᬚ

ᬛ᭄ᬛ honorific ᬛ Used in one word only.

extension ᬤ᬴ Used in Lombok texts, but even then only sporadically.

ᬓ᭄ᬓ basic ᬓ

ᬔ᭄ᬔ honorific ᬔ Very rare.

ᬕ᭄ᬕ basic ᬕ

ᬖ᭄ᬖ kawi ᬖ in Kawi loan words.

extension ᬗ᬴

extension ᬧ᬴ Used in Lombok texts, but even then only sporadically.

extension ᬯ᬴ Used in Lombok texts, but even then only sporadically.

ᬲ᭄ᬲ basic ᬲ

ᬰ᭄ᬰ kawi ᬰ in Kawi loan words.

ᬱ᭄ᬱ kawi ᬱ in Kawi loan words.

extension ᬚ᬴ Used in Lombok texts, but even then only sporadically.

extension ᬓ᬴ Used in Lombok texts, but even then only sporadically.

extension ᬕ᬴ Used in Lombok texts, but even then only sporadically.

extension ᬳ᬴ Used in Lombok texts, but even then only sporadically.

ᬳ᭄ᬳ basic ᬳ

coda —ᬄ

ᬫ᭄ᬫ basic ᬫ

coda —ᬀ Holy letter, only used in Sanskrit texts.

ᬦ᭄ᬦ basic ᬦ

ᬡ᭄ᬡ kawi ᬡ in Kawi loan words.

ᬜ᭄ᬜ basic ᬜ

ᬗ᭄ᬗ basic ᬗ

coda —ᬂ

coda —ᬁ Holy letter, only used in Sanskrit texts.

ᬯ᭄ᬯ basic ᬯ

᭄ᬯ medial ᭄ᬯ Medial consonant.

ᬭ᭄ᬭ basic ᬭ

᭄ᬭ medial ᭄ᬭ Medial consonant.

coda —ᬃ

rə

ᬋ᭄ᬋ vocalic ᬋ

medial ᬺ

rəː

vocalic ᬌ

medial ᬌ

medial ᬻ

ᬮ᭄ᬮ basic ᬮ

᭄ᬮ medial ᭄ᬮ Medial consonant.

lə

vocalic ᬍ

vocalic ᬼ

ləː

vocalic ᬎ

vocalic ᬽ

ᬬ᭄ᬬ basic ᬬ

᭄ᬬ medial ᭄ᬬ Medial consonant.

Symbols

Modre symbols

Two combining marks have a specialist usage related to (usually religious) Sanskrit words.

ᬀ,ᬁ

ᬀ when combined with certain syllables becomes part of the Aksara Modre, or holy letters, which are used to write words in Sanskrit, usually part of prayers. This character only appears in Sanskrit texts.

eg.

ᬰᬶᬤ᭄ᬥᬀsiddham

ᬁ appears only in holy letters.

eg.

ᬫᬁ mŋ̽ (Mang)

When combined with independent vowel ạʷ it becomes a special symbol called omkara and is pronounced m. In this form it is used to represent god.

eg.

ᬒᬁᬱᬦ᭄ᬢᬶ᭞ᬱᬦ᭄ᬢᬶ᭞ᬱᬦ᭄ᬢᬶ᭞ᬒᬁ

Musical marks and symbols

The other symbols in the Balinese block are all musical symbols, and are not described here.

᭡,᭢,᭣,᭤,᭥,᭦,᭧,᭨,᭩,᭪,᭴,᭵,᭶,᭷,᭸,᭹,᭺,᭻,᭼

There is also a set of musical diacritical marks, which are not described here.

᭫,᭬,᭭,᭮,᭯,᭰,᭱,᭲,᭳

For an in-depth look at musical symbols in Balinese see Perdana.

Encoding choices

Balinese is a script where different sequences of Unicode characters may produce the same visual result. Here we look at those related to vowels.

Encoding vowel signs

Five of the circumgraphs can be written as a single character, or as two characters, the second being ᬵ [U+1B35 BALINESE VOWEL SIGN TEDUNG] in all cases.

Atomic	Decomposed
ᭀ	1B3E 1B35
ᭃ	1B42 1B35
ᭁ	1B3F 1B35
ᬻ	1B3A 1B35
ᬽ	1B3C 1B35

The single code point per vowel sign is preferred, however the parts are separated in Unicode Normalisation Form D (NFD), and recomposed in Unicode Normalisation Form C (NFC), so both approaches are canonically equivalent.

Whichever approach is used, the vowel signs must be typed and stored after the consonant characters they surround, and in left to right order.

Encoding independent vowels

Four of the independent vowels can be written as a single character, or as two. The alternatives are regarded as canonically equivalent in Unicode. Again, this always involves ᬵ.

Atomic	Decomposed
ᬈ	1B07 1B35
ᬊ	1B09 1B35
ᬒ	1B11 1B35
ᬆ	1B05 1B35

The precomposed characters decompose in NFD, and reform again in NFC. It is generally recommended to use the precomposed character.

Combining mark order

The following indicates the expected ordering of Unicode characters within a Burmese combining character sequence. The labels are those used for the Unicode Indic Syllabic Categories. Follow the links to see what characters are represented by a given label.

Burmese has 2 types of combining character sequence (CCS).

The first type is a base plus Virama. This is the non-final part of a consonant cluster or a consonant with a killed vowel, and consists of just the base and the virama.

The general CCS type uses the following preferred ordering after a base.

Ordering characters as shown above avoids potential ambiguities and maximises the likelihood of success when rendering the text.

Numbers

There is a set of Balinese digits, and they are used in the same way as ASCII digits in Latin text.

᭑,᭒,᭓,᭔,᭕,᭖,᭗,᭘,᭙,᭐

However, because many of the digit symbols are indistinguishable from other Balinese letters, numbers are typically surrounded by ᭞, so that they are clearly distinguished.

eg.

ᬩᬮᬶ᭞᭓᭞ᬚᬸᬮᬶ᭞᭑᭙᭘᭒᭟

Text direction

Balinese text is written horizontally, left to right.

Show default bidi_class properties for characters in the Balinese orthography described here.

Glyph shaping & positioning

You can experiment with examples using the Balinese workbench.

Context-based shaping & positioning

Balinese text relies on OpenType rules to correctly position glyphs and shape them according to the surrounding text.

One major area where this applies is in the use of conjunct forms for consonant clusters. See the relevant sections for lists of stacked and conjoined shapes.

show composition

ᬒᬁᬲ᭄ᬯᬲ᭄ᬢ᭄ᬬᬲ᭄ᬢᬸ

The following is a selection of other examples of contextual shaping and positioning.

After a stacked consonant, the vowel signs that would normally appear below a base are moved to the side, and the shape is modified.

	Composition	Example
ᬓ᭄ᬭᬸ	1B44 1B2D 1B38	ᬓ᭄ᬭᬸᬦ
ᬓ᭄ᬬᬹ	1B44 1B2C 1B39

ᬵ and the right side of ᭁ combine with several of the consonants. The table below shows 2 examples.

	Composition	Example
ᬳᬵ	ᬳᬵ
ᬭᬵ	ᬭᬵ	ᬢᬭᬵ

When a vowel sign and a syllable-final consonant mark appear over the same base, they are typically drawn side by side. Combinations such as rerekan and above-base vowels are typically stacked.^§

	Composition	Example
ᬓᬷᬃ	ᬷᬃ	ᬢᬷᬃᬢ
ᬰᬶᬁ	ᬶᬁ

Typographic units

Word boundaries

Words are not separated by spaces, and in fact some word boundaries occur between stacked consonants. This means that segmentation for line-breaking, etc. uses orthographic syllables as a unit (see graphemes).

Graphemes

Grapheme clusters alone are not sufficient to represent typographic units in Balinese. Stacks and conjoined sequences are very common and must not be split apart by edit operations that visually change the text (such as letter-spacing, first-letter highlighting, and line breaking). For those operations one needs to segment the text using orthographic syllables, which string grapheme clusters together with ᭄, which has an Indic Syllabic Category of Virama.

The adeg-adeg is rendered visibly if it is not part of a consonant cluster, for example at the end of a word followed by a space.

Balinese doesn't use word boundaries for text segmentation, relying instead on grapheme boundaries because consonant clusters that span word boundaries are combined into stacks or conjoined forms.

Grapheme clusters

Base Combining_mark* Joiner?

Combining marks may include zero or more of the following types of character:

Nukta (see extendedC)
Dependent vowels (see plainV and vocalics)
Final consonants (see finals)
Virama (adeg adeg) (see clusters and novowel)

Any of the above may occur after a consonant base. Independent vowel bases usually only have final consonant marks.

The following examples show a variety of grapheme clusters:

Click on the text version of these words to see more detail about the composition.

	ᬢᬷᬃᬢ
	ᬅᬃᬣ
	ᬓᬺᬰ᭄ᬡ
	ᬤᬍᬫ᭄
	ᬤᬦ᭄ᬢ

Note how grapheme clusters break up the conjuncts. This is not usually desirable (see orthographicS just below).

Larger typographic units

(Consonant Rerekan? Adeg_adeg)* Grapheme_cluster

Balinese commonly stacks or conjoins glyphs, to form conjuncts. The conjuncts represent consonant clusters, which can arise (a) where one phonetic syllable ends in a consonant letter and the following syllable begins with a consonant, or (b) when most medial consonants are written, since Balinese uses conjunct forms for sequences such as Cr-, Cy-, Cw-, Cry-, etc. The cluster of consonants that make up the conjunct are all encoded with adeg adeg between them (see clusters).

Balinese is unusual in that these conjuncts occur across word boundaries, so the word-final consonant of the first word may be stacked above the word-initial consonant of the second. See fig_kahananlankwasa2 for an example.

Grapheme clusters terminate after a sequence of marks containing an adeg adeg, but editorial operations that change the visual appearance of the text, such as letter-spacing, first-letter highlighting, line-breaking, and justification, should never split conjunct forms apart. For this reason, an alternative way of segmenting graphemes is needed. This may not apply, however, for some other operations such as cursor movement or backwards delete.

Where conjuncts appear, a typographic unit contains multiple grapheme clusters. The non-final grapheme clusters all end with ᭄, and the final grapheme cluster begins with a consonant.

The following are examples.

Click on the text version of these words to see more detail about the composition.

	ᬤᬦ᭄ᬢ
	ᬢᬶᬫ᭄ᬧᬮ᭄
	ᬩ᭄ᬭ᭄ᬬᬕ᭄
	ᬰᬵᬲ᭄ᬢ᭄ᬭ

Note that one of the characteristic features of the Indic category of Virama is that the adeg adeg is visible when not followed by a consonant, but invisible when a consonant does follow (creating a stack). This means that the adeg adeg sometimes participates in a simple grapheme cluster, but when followed by a consonant it becomes the 'glue' that creates an orthographic syllable.

On the infrequent occasions when an adeg adeg needs to be visible even though it is followed by another base, an invisible character must be added to prevent it joining with the following base. A zero-width space can achieve that.

ᬧᬓ᭄ᬭᬫᬦ᭄

Browser behaviour

Test in your browser. The words test units that equate to grapheme clusters only, and others that include conjuncts. First, the text is displayed in a contenteditable paragraph, then in a textarea. Results are reported for Gecko (Firefox), Blink (Chrome), and WebKit (Safari) on a Mac.

ᬢᬷᬃᬢ ᬓᬺᬰ᭄ᬡ ᬧᬾᬜ᭄ᬚᭀᬃ ᬢᬶᬫ᭄ᬧᬮ᭄ᬩ᭄ᬭ᭄ᬬᬕ᭄

Cursor movement. Move the cursor through the text.
Gecko steps through the whole text using grapheme clusters. It takes 2 or more steps (depending on the number of GCs) to get through the stacks, one grapheme cluster at a time. Blink and WebKit step through all words using the orthographic syllables described here (ie. they step over a stack and all associated combining characters in one jump).

Selection. Place the cursor next to a character and hold down shift while pressing an arrow key.
The behaviour is the same as for cursor movement.

Deletion. Forward deletion works in the same way as cursor movement. The backspace key deletes code point by code point, except for WebKit, which deletes one grapheme cluster at a time.

Line-break. See this test. The CSS sets the value of the line-break property to anywhere. Change the size of the box to slowly move the line break point.
Gecko appears to segment on orthographic syllable, per the description here, except for one case where the complex stack is split. WebKit and Blink appear to sometimes wrap inside stacks and other times not. It's not obvious why, but both segment in the same way.

Punctuation & inline features

Phrase & section boundaries

See type samples.

Balinese has its own punctuation marks.

phrase	᭞ 1B4E 1B4F ᭝
sentence	᭟
section start	᭚ ᭛ 1B7F
section end	᭞᭜᭞ ᭟᭜᭟ ᭚᭜᭚ ᭛᭜᭛
end of text	᭽ ᭾ ᭽᭜᭽ ᭚᭜᭽

᭝ is used as a colon, and ᭞ and ᭟ are used as comma and full stop respectively. 1B4E and 1B4F were introduced in Unicode v16 to express finer distinctions than the former, used in some manuscripts. u§#G27140

Both ᭚ and ᭛ are used to begin a section in text. 1B7F was introduced in Unicode v16 to represent finer subdivisions in some manuscripts.

At the end of a section, ᭜ is usually used between two other punctuation marks that vary according to the section opener. Typical sequences include carik siki ᭞᭜᭞, carik pareren ᭟᭜᭟ (sometimes called pasalinan), panti ᭚᭜᭚, and carik agung ᭛᭜᭛.u§#G27140

End of text markers include ᭽ and ᭾, or a combination of those or their shorter counterparts with ᭜, such as ᭽᭜᭽ or ᭚᭜᭽.u§#G27140

Line & paragraph layout

Line breaking & hyphenation

Because there are no spaces between words, and because the end of one word and the beginning of another often form conjuncts (see fig_kahananlankwasa2), Balinese doesn't wrap at word boundaries. See graphemes for a description of the typographic units that are used for line break opportunities.

Unfortunately, modern browsers are often unable to detect appropriate break points for Balinese, so in the sample text at the beginning of this page 200B is used at places where the line could be broken. Otherwise, the line would continue, unbroken off the right side of the page.

Pameneng

In lontar texts where a word must be broken at the end of a line (always after a full syllable), the sign ᭠ is inserted. This sign is not used as a word-joining hyphen; it is used only in linebreaking.

Observation: The images appear to show a gap before the pameneng.

A compacted image of a lontar showing a pameneng at the end of a line, with the beginning of the following line below. (Click to see more.)

In online use, an application would need to insert the pameneng, rather than the content author. As line-length is changed by stretching a window, or as content is added earlier in the same paragraph, the location of the word relative to the line edge will change. The insertion of pameneng is only appropriate at those instants when the appropriate sequence of characters appears at the line end.

For an application to use this correctly, it would need to know where the word boundaries are in the text, and then put this character at the end of the line only when a multisyllabic word is broken. This would require a dictionary to be applied to the text, since it would not be appropriate to insert the pameneng at the boundary of 2 words.

Observation: Aditya Bayu Perdana has found instances in lontar where ᬄ is moved to the beginning of a line, alone, while a pameneng appears at the end of the previous line. If this is not just a scribal inconsistency (eg. it's not clear why you wouldn't put the bisah at the end of the line if there's space for a pameneng), it may indicate that this letter should not be a combining mark in Unicode; however, the usage needs to be verified first. See pictures.

Line-edge rules

As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.

Show (default) line-breaking properties for characters in the Balinese orthography.

The following list gives examples of typical behaviours for characters used in contemporary Balinese. Context may affect the behaviour of some of these and other characters.

Click on the Balinese characters to show what they are.

᭚ ᭛ ᭝ ᭞ ᭟ ᭠ should not begin a new line

Text alignment & justification

According to Sudewa, full justification is not a feature of Balinese text in traditional palm-leaf manuscripts, and only left, or occasionally centred or right alignment is relevant.

Baselines, line height, etc.

Balinese uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

fig_baselines shows glyphs from the Noto Serif fonts. The basic height of Balinese letters is the same as the Latin x-height, however extenders and combining marks, extend well beyond the Latin ascenders and descenders, creating a need for larger line heights.

qhx᭛᭄ᬐᬓᬿᬲᬺᬧᬷᬲᬸᬃᬭᬼ — Font metrics for Latin text in the Noto Serif font compared with Balinese glyphs in the Noto Serif Balinese font.

Page & book layout

General page layout & progression

Traditionally, Balinese was written on thin, landscape palm-leaf manuscripts, called lontar.

Picture of a palm leaf manuscript. — Example of a palm-leaf manuscript from Wikipedia.

The text was packed in without paragraph breaks.

Terminology

ᬅᬓ᭄ᬱᬭ aksara letter

ᬯ᭄ᬬᬜ᭄ᬚᬦ wianjana consonant

ᬅᬓ᭄ᬱᬭᬯ᭄ᬬᬜ᭄ᬚᬦ aksara wianjana consonant

ᬯᬺᬱᬵᬲ᭄ᬢ᭄ᬭ wreṣāstra 18 consonants used to write basic Balinese words

ᬰ᭄ᬯᬮᬮᬶᬢ sualalita consonants used used for writing Sanskrit and Kawi loanwords

ᬅᬮ᭄ᬧᬧ᭄ᬭᬵᬡ alpaprāṇa unaspirated

ᬫᬵᬳᬵᬧ᭄ᬭᬵᬡ mahāprāṇa aspirated