Buginese orthography notes

Usage & history

The Buginese or Lontara script was traditionally used to write the Bugis, Makassarese and Mandar languages of Sulawesi in Indonesia. During the period of Dutch colonisation, it was largely replaced by the Latin alphabet, and is now considered to be endangered. However, it still persists for ceremonial purposes, such as weddings, and for writing personal documents such as letters and notes. There is also some use for printing traditional Buginese literature.

ᨒᨚᨈᨑ

ᨅᨔ ᨕᨘᨁᨗ

The script is originally derived from the Brahmi script, but has evolved considerably along similar lines to other scripts in the Indonesian archipelago. Wikipedia relates that the term Lontara "is derived from the Malay name for palmyra palm, lontar, whose leaves are traditionally used for manuscripts. In Buginese, this script is called urupu sulapa eppa which means "four-cornered letters", referencing the Bugis-Makasar belief of the four elements that shaped the universe: fire, water, air and earth."

Sources: Scriptsource, Wikipedia.

Basic features

The Buginese script, also known as Lontara (ᨒᨚᨈᨑ), is an abugida. Consonants carry an inherent vowel which can be modified by appending vowel signs to the consonant. See the table to the right for a brief overview of features for the modern Buginese orthography.

Modern Buginese runs left to right in horizontally stacked lines.

Words can be separated by spaces, or spaces can be used to separate units longer than words. In some texts words are not separated at all.

Buginese has 18 basic consonants, but is a defective script, meaning that it doesn't represent all the sounds of the spoken language. ❯ consonants

With the exception of prenasalised onsets, no consonant clusters or gemination are indicated by the Buginese orthography. Single code points are, however, provided for 4 syllable-initial prenasalised clusters. ❯ onsets

Syllable-final consonant sounds are also not written. ❯ finals

The Buginese orthography has an inherent vowel a, and represents vowels using 5 vowel signs (including 1 pre-base vowel). All vowel signs are combining marks, and are stored after the base character. ❯ vowels

There are no composite vowels, but repetition can be shown by duplication of the vowel sign.

Standalone vowel sounds are written using vowel signs applied to ᨕ [U+1A15 BUGINESE LETTER A]. ❯ standalone

There are no native numbers.

Vowels

The Buginese orthography has an inherent vowel a, and represents vowels using 5 vowel signs (including 1 pre-base vowel). All vowel signs are combining marks, and are stored after the base character.

There are no composite vowels, but repetition can be shown by duplication of the vowel sign.

Standalone vowel sounds are written using vowel signs applied to ᨕ [U+1A15 BUGINESE LETTER A].

For a mapping of sounds to graphemes see vowel_mappings.

Inherent vowel

a following a consonant is not written, but is seen as an inherent part of the consonant letter, so ka is written by simply using the consonant letter.

ᨀ ka [U+1A00 BUGINESE LETTER KA]

Vowel signs

Non-inherent vowel sounds that follow a consonant can be represented using vowel signs, eg.

ᨀᨗ ki [U+1A00 BUGINESE LETTER KA + U+1A17 BUGINESE VOWEL SIGN I]

Buginese vowel signs are all combining characters. A single character is used per base consonant. All vowel signs are typed and stored after the base consonant, whether or not they precede it when displayed. The font takes care of the glyph positioning.

An orthography that uses vowel signs is different from one that uses simple diacritics or letters for vowels in that the vowel signs are generally attached to the orthorgraphic syllable, rather than just applied to the letter of the immediately preceding consonant. This means that pre-base vowel signs and the left glyph of circumgraphs appear before a whole consonant cluster if the cluster is rendered as a conjunct (see prescript_vowels).

Two of the vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

Combining marks used for vowels

Buginese uses the following dedicated combining marks for vowels. They may be used on their own, or in combination with others (see composite_vowels).

ᨗ␣ᨘ␣ᨙ␣ᨚ␣ᨛ

Pre-base vowel sign

ᨙ

One vowel sign appears to the left of the base consonant letter or cluster, eg. ᨀᨙ ke.

This is a combining mark that is always stored after the base consonant. The font places the glyph before the base consonant.

ᨕᨔᨙ — The pre-base e vowel appears to the left of the base character after which it is pronounced.

Because a vowel sign is associated with a syllable, rather than the immediately preceding consonant, the vowel sign doesn't need to split letters that represent pre-nasalised consonants (see clusters), eg. ᨋᨙ nre

Standalone vowels

ᨕᨗ␣ᨕᨘ␣ᨕᨙ␣ᨕᨚ␣ᨕᨛ␣ᨕ

Buginese uses ᨕ [U+1A15 BUGINESE LETTER A] to represent the sound a when there is no preceding consonant.

Other syllable-initial vowel sounds are written using the same character with vowel signs attached.

ᨕᨛᨊᨛ — A word that begins with BUGINESE LETTER A supporting a standalone ə vowel.

Vowel absence

Other than the characters representing consonant clusters mentioned in the section clusters, Buginese has no way to indicate missing vowels between consonants or at the end of a word.

Vowel sounds to characters

This section maps Bugis vowel sounds to common graphemes in the Buginese orthography, grouped by dependant ( d ), or standalone ( s ) types. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

ᨗ [U+1A17 BUGINESE VOWEL SIGN I]

ᨕᨗ [U+1A15 BUGINESE LETTER A + U+1A17 BUGINESE VOWEL SIGN I]

ᨘ [U+1A18 BUGINESE VOWEL SIGN U]

ᨕᨘ [U+1A15 BUGINESE LETTER A + U+1A18 BUGINESE VOWEL SIGN U]

ᨙ [U+1A19 BUGINESE VOWEL SIGN E]

ᨕᨙ [U+1A15 BUGINESE LETTER A + U+1A19 BUGINESE VOWEL SIGN E]

ᨚ [U+1A1A BUGINESE VOWEL SIGN O]

ᨕᨚ [U+1A15 BUGINESE LETTER A + U+1A1A BUGINESE VOWEL SIGN O]

ᨛ [U+1A1B BUGINESE VOWEL SIGN AE]

ᨕᨛ [U+1A15 BUGINESE LETTER A + U+1A1B BUGINESE VOWEL SIGN AE]

Inherent vowel

ᨕ [U+1A15 BUGINESE LETTER A]

Consonants

Buginese has 18 basic consonants, but is a defective script, meaning that it doesn't represent all the sounds of the spoken language.

Syllable-final consonant sounds are also not written.

For a mapping of sounds to graphemes see consonant_mappings.

Basic consonants

ᨄ␣ᨅ␣ᨈ␣ᨉ␣ᨀ␣ᨁ

ᨌ␣ᨍ

ᨔ␣ᨖ

ᨆ␣ᨊ␣ᨎ␣ᨂ

ᨓ␣ᨑ␣ᨒ␣ᨐ

ᨖ [U+1A16 BUGINESE LETTER HA] was introduced to represent an Arabic sound.

Geminated consonants

Similarly, geminated consonants sounds (which are distinctive and frequent in Buginese) are not written, eg. the following sequence can be read as lapa lava or lappa joint ᨒᨄ lp

These omissions can lead to ambiguities in the written text that are exploited for Buginese word games.

Onset consonants

ᨇ␣ᨏ␣ᨃ␣ᨋ␣

Four dedicated characters are used to represent pre-nasalised consonant clusters in Buginese (but not Makassarese).

Final consonants

Syllable-final consonant sounds (normally ʔ and ŋ) are not written, eg. the final consonant sound is not written at the end of ᨕᨙᨔᨙᨊᨙ

Consonant clusters

Other than the prenasalised onsets mentioned just above, Buginese doesn't mark clusters.u There is no virama-like character.

Consonant sounds to characters

This section maps Bugis consonant sounds to common graphemes in the Buginese orthography. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.

Stops

ᨄ [U+1A04 BUGINESE LETTER PA]

ᨅ [U+1A05 BUGINESE LETTER BA]

ᨇ [U+1A07 BUGINESE LETTER MPA]

ᨈ [U+1A08 BUGINESE LETTER TA]

ᨉ [U+1A09 BUGINESE LETTER DA]

ᨀ [U+1A00 BUGINESE LETTER KA]

ᨁ [U+1A01 BUGINESE LETTER GA]

ŋk

ᨃ [U+1A03 BUGINESE LETTER NGKA]

Occurs in syllable-final position, but not written.

Affricates

ᨌ [U+1A0C BUGINESE LETTER CA]

ᨍ [U+1A0D BUGINESE LETTER JA]

ɲc

ᨏ [U+1A0F BUGINESE LETTER NYCA]

Fricatives

ᨔ [U+1A14 BUGINESE LETTER SA]

ᨖ [U+1A16 BUGINESE LETTER HA]

Nasals

ᨆ [U+1A06 BUGINESE LETTER MA]

ᨊ [U+1A0A BUGINESE LETTER NA]

ᨎ [U+1A0E BUGINESE LETTER NYA]

ᨂ [U+1A02 BUGINESE LETTER NGA]

Not written when syllable-final.

Other

ᨓ [U+1A13 BUGINESE LETTER VA]

ᨑ [U+1A11 BUGINESE LETTER RA]

ᨋ [U+1A0B BUGINESE LETTER NRA]

ᨒ [U+1A12 BUGINESE LETTER LA]

ᨐ [U+1A10 BUGINESE LETTER YA]

Glyph shaping & positioning

This section brings together information about the following topics: writing styles; cursive text; context-based shaping; context-based positioning; baselines, line height, etc.; font styles; case & other character transforms.

You can experiment with examples using the Buginese character app.

Buginese text is not cursive (ie. joined up like Arabic).

The orthography has no case distinction, and no special transforms are needed to convert between characters.

Context-based shaping & positioning

The only glyph shaping seems to be in connection with the iya ligature, ᨕᨗ‍ᨐ [U+1A15 BUGINESE LETTER A + U+1A17 BUGINESE VOWEL SIGN I + U+200D ZERO WIDTH JOINER + U+1A10 BUGINESE LETTER YA]. The initial ᨕ [U+1A15 BUGINESE LETTER A] is hidden in the ligature ᨕᨗ‍ᨐ

The positioning of diacritics depends on the related base components, eg. compare ᨊᨗᨊ nin ninᨕᨗ‍ᨐ ǝ̣i‍y iya ᨐᨗ yi yi

One vowel sign, ᨙ [U+1A19 BUGINESE VOWEL SIGN E], is displayed to the left of the base consonant, although it is typed after the consonant, eg. ᨕᨙᨔᨙᨊᨙ

Font styling & weight

tbd

Punctuation & inline features

Word boundaries

Words can be separated by spaces, or spaces can be used to separate units longer than words. In some texts words are not separated at all.

Phrase & section boundaries

᨞␣᨟

Buginese uses very little punctuation, and it is native.

phrase/sentence	U+0020 SPACE ᨞ [U+1A1E BUGINESE PALLAWA]
section end	᨟ [U+1A1F BUGINESE END OF SECTION]

phrase/sentence

U+0020 SPACE

᨞ [U+1A1E BUGINESE PALLAWA]

section end

᨟ [U+1A1F BUGINESE END OF SECTION]

For separators at the sentence level and below, Buginese typically uses only one punctuation mark, ᨞ [U+1A1E BUGINESE PALLAWA], which is equivalent to both comma and full stop in Latin transcriptions of Buginese.

The end of a section may be shown with ᨟ [U+1A1F BUGINESE END OF SECTION].

Everson lists some additional punctuation marks, but these don't appear to be supported by Unicode.e

Bracketed text

tbd

Quotations & citations

tbd

Emphasis

tbd

Abbreviation, ellipsis & repetition

᨞␣ꧏ

᨞ [U+1A1E BUGINESE PALLAWA] can also be used to indicate the doubling of a word or its root, according to Wikipedia. The Unicode Standard says that ꧏ [U+A9CF JAVANESE PANGRANGKEP] may be used for this purpose.

Alternatively, the Unicode Standard says, repetition can be shown by duplication of the vowel sign, especially ᨙ [U+1A19 BUGINESE VOWEL SIGN E] and ᨚ [U+1A1A BUGINESE VOWEL SIGN O].

Inline notes & annotations

tbd

Other punctuation

tbd

Other inline text decoration

tbd

Line & paragraph layout

Line breaking & hyphenation

tbd

Show (default) line-breaking properties for characters in the modern Buginese orthography.

Hyphenation/word-breaking

According to Everson, hyphenation can occur after any full orthographic syllable, but there are no details about how that works, or whether he actually means line-breaking, rather than hyphenation per-se.

Text alignment & justification

tbd

Text spacing

tbd

This section looks at ways in which spacing is applied between characters over and above that which is introduced during justification.

Baselines, line height, etc.

tbd

Buginese uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

Counters, lists, etc.

tbd

Styling initials

tbd

	labial	dental	alveolar	palatal	velar	glottal
stops	p b	t d		c ɟ	k ɡ	ʔ
pre-nasalised	ᵐp			ⁿc	ᵑk
fricatives			s			h
nasals	m		n	ɲ	ŋ
approximants	w		l	j
trills/flaps			r
pre-nasalised			ⁿr

Bugis (draft) Buginese

Sample

Usage & history

Basic features

Character index

Letters

Basic consonants

Prenasalised consonants

Vowels

Other

Combining marks

Vowels

Punctuation

Other

Phonology

Vowel sounds

Consonant sounds

Vowels

Inherent vowel

Vowel signs

Combining marks used for vowels

Pre-base vowel sign

Standalone vowels

Vowel absence

Vowel sounds to characters

Consonants

Basic consonants

Geminated consonants

Onset consonants

Final consonants

Consonant clusters

Consonant sounds to characters

Stops

Affricates

Fricatives

Nasals

Other

Numbers

Odds and ends

Text direction

Glyph shaping & positioning

Context-based shaping & positioning

Font styling & weight

Graphemes

Grapheme clusters

Punctuation & inline features

Word boundaries

Phrase & section boundaries

Bracketed text

Quotations & citations

Emphasis

Abbreviation, ellipsis & repetition

Inline notes & annotations

Other punctuation

Other inline text decoration

Line & paragraph layout

Line breaking & hyphenation

Hyphenation/word-breaking

Text alignment & justification

Text spacing

Baselines, line height, etc.

Counters, lists, etc.

Styling initials

Page & book layout

References

Bugis (draft)
Buginese