Buginese orthography notes v32

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size: 36px

ᨔᨗᨊᨗᨊ ᨑᨘᨄ ᨈᨕᨘ ᨑᨗ ᨍᨍᨗᨕᨂᨗ ᨑᨗᨒᨗᨊᨚᨕᨙ ᨊᨄᨘᨊᨕᨗ ᨆᨊᨙᨂᨗ ᨑᨗᨕᨔᨙᨂᨙ ᨕᨒᨙᨅᨗᨑᨙ᨞ ᨊᨄᨘᨊᨕᨗ ᨑᨗᨕᨔᨙᨂᨙ ᨕᨀᨒᨙ᨞ ᨊᨄᨘᨊᨕᨗ ᨑᨗᨕᨔᨙᨂᨙ ᨕᨈᨗ ᨆᨑᨙᨊᨗ ᨊ ᨔᨗᨅᨚᨒᨙ ᨅᨚᨒᨙᨊ ᨄᨉ ᨔᨗᨄᨀᨈᨕᨘ ᨄᨉ ᨆᨔᨒᨔᨘᨑᨙ᨞

Source: Unicode UDHR, article 1

Usage & history

Origins of the Buginese script, 14thC – today.

Phoenician

└ Aramaic

└ Brahmi

└ Tamil-Brahmi

└ Pallava

└ Old Kawi

└ Buginese

+ Balinese

+ Batak

+ Baybayin

+ Javanese

+ Makasar

+ Old Sundanese

+ Rencong

+ Rejang

The Buginese or Lontara script ( ᨒᨚᨈᨑ ) was traditionally used to write the Bugis, Makassarese and Mandar languages of Sulawesi in Indonesia. During the period of Dutch colonisation, it was largely replaced by the Latin alphabet, and is now considered to be endangered. However, it still persists for ceremonial purposes, such as weddings, and for writing personal documents such as letters and notes. There is also some use for printing traditional Buginese literature.

The script is originally derived from the Brahmi script, but has evolved considerably along similar lines to other scripts in the Indonesian archipelago. Wikipedia relates that the term Lontara "is derived from the Malay name for palmyra palm, lontar, whose leaves are traditionally used for manuscripts. In Buginese, this script is called urupu sulapa eppa which means "four-cornered letters", referencing the Bugis-Makasar belief of the four elements that shaped the universe: fire, water, air and earth."

Unicode 17 has 1 dedicated block, comprising 30 characters.

Sources: Scriptsource, Wikipedia.

Basic features

The Buginese script, also known as Lontara (ᨒᨚᨈᨑ), is an abugida, ie. each consonant contains an inherent vowel sound.

❯ basicV

Vowels The inherent vowel is pronounced a.

Post-consonant vowels are written using 5 combining marks (vowel signs).

There is 1 pre-base glyph but no circumgraphs, nor composite vowel signs, but repetition can be shown by duplication of the vowel sign.

Standalone vowel sounds are written using ᨕ with an attached vowel sign.

❯ consonantSummary

Consonants Buginese has 18 basic consonants, but is a defective script, meaning that it doesn't represent all the sounds of the spoken language. Specifically, syllable codas are not written.

Vowel absence Because syllable codas are not written in the Buginese orthography, vowel absence is not indicated using conjunct forms, or dedicated syllable-final characters. There are also no dedicated medial consonant characters.

Single code points are, however, provided for 4 syllable-initial prenasalised clusters.

Numbers There are no native numbers.

Layout Modern Buginese text runs left to right in horizontal lines. Words can be separated by spaces, or spaces can be used to separate units longer than words. In some texts words are not separated at all. There are no case distinctions.

Character index

Letters

Show

Basic consonants

ᨄ,ᨅ,ᨈ,ᨉ,ᨌ,ᨍ,ᨀ,ᨁ,ᨔ,ᨖ,ᨆ,ᨊ,ᨎ,ᨂ,ᨓ,ᨑ,ᨒ,ᨐ

Prenasalised consonants

ᨇ,ᨏ,ᨃ,ᨋ

Vowels

ᨕ

Other

ꧏ

Combining marks

Show

Vowels

ᨗ,ᨘ,ᨙ,ᨚ,ᨛ

Punctuation

Show

᨞,᨟

Other

Show

‍

To be investigated

,‌

Phonology

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

There's something odd here and later about a vs ə. Need to figure out which is right.

Consonant sounds

	labial	alveolar	palatal	velar	glottal
stops	p b	t d	c ɟ	k ɡ	ʔ
pre-nasalised	ᵐp		ⁿc	ᵑk
fricatives		s			h
nasals	m	n	ɲ	ŋ
approximants	w	l	j
trills/flaps		r
pre-nasalised		ⁿr

Tone

Buginese is not a tonal language.

Vowels

	Post-consonant	Standalone
	◌ᨗ,◌ᨘ	ᨕᨗ,ᨕᨘ,ᨕᨗ‍ᨐ
	◌ᨙ,◌ᨚ	ᨕᨙ,ᨕᨚ
	◌ᨛ	ᨕᨛ
	ⓘ	ᨕ

ⓘ represents the inherent vowel. Diacritics are added to the vowels to indicate nasalisation (not shown here).

Inherent vowel

ᨀ ka

The inherent vowel for Buginese is pronounced a. So ka is written by simply using the consonant letter.

eg.

ᨀᨆᨗᨔᨗ

ᨀ,ᨆᨗ,ᨔᨗ

Post-consonant vowels

Post-consonant vowels are written using 5 vowel signs. All vowel signs are combining marks, and are stored after the base character.

There is 1 pre-base vowel sign but no circumgraphs, and no composite vowel signs. However, repetition can be shown by duplication of the vowel sign.

All vowel signs are typed and stored after the base consonant, whether or not they precede it when displayed. The glyph rendering system takes care of the positioning at display time.

Two vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.

Plain vowels

ᨀᨗ ki

Buginese uses the following dedicated combining marks for vowels.

◌ᨗ,◌ᨘ,◌ᨙ,◌ᨚ,◌ᨛ

eg.

ᨅᨛᨒᨙᨑ

ᨀᨆᨗᨔᨗ

According to the Unicode Standard, word repetition can be shown by duplication of a vowel sign, especially 1A19 and 1A1A.

Standalone vowels

Buginese uses 1A15 to represent the sound a when there is no preceding consonant.

eg.

ᨕᨔᨘ

ᨕᨑᨘᨕ

Other syllable-initial vowel sounds are written using the same character with vowel signs attached.

ᨕᨗ,ᨕᨘ,ᨕᨙ,ᨕᨚ,ᨕᨛ,ᨕ

eg.

ᨕᨗᨊ

ᨕᨛᨊᨛ

ᨓᨕᨙ

show composition

ᨕᨛᨊᨛ

Vowel components

Pre-base vowel sign

ᨀᨙ ke

The vowel sign ᨙ appears to the left of the base consonant letter.

eg.

ᨕᨙᨔᨙᨊᨙ

This is a combining mark that is always typed and stored after the base consonant, ie. the codepoints follow the order in which the items are pronounced. The rendering process places the glyph before the base consonant without changing the code points. The following shows the sequence of code points that make up the word just above.

eg.

ᨕ,ᨙ,ᨔ,ᨙ,ᨊ,ᨙ

show composition

ᨕᨔᨙ

Because a vowel sign is associated with a syllable, rather than the immediately preceding consonant, the vowel sign doesn't need to split letters that represent pre-nasalised consonants (see onsets).

eg.

ᨋᨙ nre

Vowel sounds to characters

This section maps Bugis vowel sounds to common graphemes in the Buginese orthography.

dependent ᨗ

standalone ᨕᨗ

ija

ligature ᨕᨗ‍ᨐ

dependent ᨘ

standalone ᨕᨘ

dependent ᨙ

standalone ᨕᨙ

dependent ᨚ

standalone ᨕᨚ

dependent ᨛ

standalone ᨕᨛ

inherent vowel eg. ᨅᨗᨌᨑ

standalone ᨕ

Vowel absence

Vowel absence principally occurs either when a consonant is a syllable coda, or when a consonant is part of a consonant cluster.

Because syllable codas are not written in the Buginese orthography, it doesn't indicate consonant clusters other than the atomic letters for prenasalised onsets mentioned in onsets.u The latter are indicated using atomic code points.

There is no virama-like character, and Buginese has no way to indicate missing vowels between consonants or at the end of a word.

Consonants

Onsets	ᨄ,ᨅ,ᨈ,ᨉ,ᨌ,ᨍ,ᨀ,ᨁ
	ᨔ,ᨖ
	ᨆ,ᨊ,ᨎ,ᨂ
	ᨓ,ᨑ,ᨒ,ᨐ
Onset clusters	ᨇ,ᨏ,ᨃ,ᨋ

The glottal stop sound only occurs in codas, and those are not written.

Basic consonants

These are the basic consonant letters in Buginese.

Click on each letter for more details and for examples of usage.

ᨄ,ᨅ,ᨈ,ᨉ,ᨌ,ᨍ,ᨀ,ᨁ,ᨔ,ᨖ,ᨆ,ᨊ,ᨎ,ᨂ,ᨓ,ᨑ,ᨒ,ᨐ

ᨖ was introduced to represent an Arabic sound.

Buginese also has 4 letters for pre-nasalised consonants (see onsets).

Onsets

Four dedicated characters are used to represent pre-nasalised consonant clusters in Buginese (but not Makassarese).

ᨇ,ᨏ,ᨃ,ᨋ,

eg.

ᨌᨛᨇ

ᨌᨛ,ᨇ

ᨒᨘᨋ

ᨒᨘ,ᨋ

Codas

Syllable-final consonant sounds (normally ʔ and ŋ) are not written. For example, the final consonant sound is not written at the end of words like the following.

eg.

ᨕᨘᨒᨛ

ᨔᨈᨘ

These omissions can lead to ambiguities in the written text that are exploited for Buginese word games.

In recent times there have been various suggestions for indicating codas, but none have been used in practice. Read more in Wikipedia.

Consonant length

Geminated consonants sounds (which are contrastive and frequent in the Buginese language) are not written.

eg.

ᨒᨀᨕᨗ

ᨈᨚᨒᨚᨒ

This can lead to some ambiguity. For example, the following word doesn't distinguish between lapa, meaning lava, or lappa, meaning joint.

ᨒᨄ

An idea of the frequency is given by this extract which is the first phrase in article 1 of the UDHR. Six words have at least one geminate.

eg.

ᨔᨗᨊᨗᨊ ᨑᨘᨄ ᨈᨕᨘ ᨑᨗ ᨍᨍᨗᨕᨂᨗ ᨑᨗᨒᨗᨊᨚᨕᨙ ᨊᨄᨘᨊᨕᨗ ᨆᨊᨙᨂᨗ ᨑᨗᨕᨔᨙᨂᨙ ᨕᨒᨙᨅᨗᨑᨙ᨞ sininna rupa tau ri ɟaɟiaŋŋi rilinoe nappunnai maneŋŋi riaseŋŋe alebbireŋ

Consonant sounds to characters

This section maps Bugis consonant sounds to common graphemes in the Buginese orthography.

consonant ᨄ

ᵐp

digraph ᨇ

consonant ᨅ

consonant ᨈ

consonant ᨉ

consonant ᨌ

ᶮc

digraph ᨏ

consonant ᨍ

consonant ᨀ

ᵑk

digraph ᨃ

consonant ᨁ

consonant Occurs in syllable-final position, but is not written, eg. ᨒᨚᨈᨑ

consonant ᨔ

consonant ᨖ

consonant ᨆ

consonant ᨊ

consonant ᨎ

consonant ᨂ Not written when syllable-final.

consonant ᨓ

consonant ᨑ

ⁿr

digraph ᨋ

consonant ᨒ

consonant ᨐ

Other features

Odds and ends

A photo on Flickr shows a sign (associated with Makassar) with consonant clusters using what looks like 1A18 between the characters. There is also a syllable-final -r represented by ᨑ.

Still in the same picture is ᨄ, being used to represent the sound f.

Wikipedia mentions some recent proposals for diacritics to represent virama, anusvara, and glottal stop. The Unicode document repository has several documents proposing the addition of a virama character.

Everson also lists 6 punctuation marks, based on western semantics but with Buginese shapes, that are not in the Unicode block.

A number of proposals for extending the Buginese script to cover additional languages were raised in 2016.

Numbers

Buginese has no native digits.

Text direction

Buginese runs left to right in horizontally stacked lines.

In older journal text when space runs out on a page, scribes used to run the text into the margin and continue in bostrophedon arrangements.

Show default bidi_class properties for characters in the Buginese orthography described here.

Glyph shaping & positioning

You can experiment with examples using the Buginese character workbench.

Context-based shaping & positioning

The only glyph shaping seems to be in connection with the optional iya ligature, 1A15 1A17 200D 1A10. The initial 1A15 is hidden in the following ligature.

ᨕᨗ‍ᨐ

ᨕ,ᨗ,‍,ᨐ

The positioning of diacritics depends on the related base components, eg. compare

cf.

ᨊᨗᨊ nin nin

ᨕᨗ‍ᨐ ǝ̣i‍y iya

ᨐᨗ yi yi

One vowel sign, 1A19, is displayed to the left of the base consonant, although it is typed after the consonant.

eg.

ᨕᨙᨔᨙᨊᨙ

Typographic units

Word boundaries

Words can be separated by spaces, or spaces can be used to separate units longer than words. In other text , following tradition, words are not separated at all.

Graphemes

tbd

Punctuation & inline features

Phrase & section boundaries

Buginese uses very little punctuation, and it is native.

phrase/sentence	᨞
section end	᨟

For separators at the sentence level and below, Buginese typically uses only one punctuation mark, ᨞, which is equivalent to both comma and full stop in Latin transcriptions of Buginese.u

Where spaces are not routinely used after every word, they also constitute a phrasal boundary.

The end of a section may be shown with ᨟.u

Everson lists some additional punctuation marks, but these are not yet supported by Unicode.e

Abbreviation, ellipsis & repetition

Repetition

᨞ can also be used to indicate the doubling of a word or its root, according to Wikipedia. The Unicode Standard says that ꧏ may be used for this purpose.u

Alternatively, the Unicode Standard says, repetition can be shown by duplication of the vowel sign, especially 1A19 and 1A1A.u

Line & paragraph layout

Line breaking & hyphenation

tbd

Show (default) line-breaking properties for characters in the modern Buginese orthography.

In-word line-breaks

If all words are separated by spaces it seems that the space is a likely candidate for the line break opportunity.

Otherwise, according to Everson, in-word line-breaks can occur after any syllable.e

The major web browsers currently break lines at spaces, only.

Baselines, line height, etc.

tbd

Buginese uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

Buginese

Sample

Usage & history

Basic features

Character index

Letters

Basic consonants

Prenasalised consonants

Vowels

Other

Combining marks

Vowels

Punctuation

Other

To be investigated

Phonology

Vowel sounds

Consonant sounds

Tone

Vowels

Inherent vowel

Post-consonant vowels

Plain vowels

Standalone vowels

Vowel components

Pre-base vowel sign

Vowel sounds to characters

Vowel absence

Consonants

Basic consonants

Onsets

Codas

Consonant length

Consonant sounds to characters

Other features

Odds and ends

Numbers

Text direction

Glyph shaping & positioning

Context-based shaping & positioning

Typographic units

Word boundaries

Graphemes

Punctuation & inline features

Phrase & section boundaries

Abbreviation, ellipsis & repetition

Repetition

Line & paragraph layout

Line breaking & hyphenation

In-word line-breaks

Baselines, line height, etc.

Page & book layout

Online resources

References