Updated 18 December, 2022
This page brings together basic information about the Lao script and its use for the Lao language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Lao using Unicode.
ມາດຕາ 1: ມະນຸດເກີດມາມີສິດເສລີພາບ ແລະ ສະເໝີໜ້າກັນໃນທາງກຽດຕິສັກ ແລະ ທາງສິດດ້ວຍມະນຸດມີສະຕິສຳປັດຊັນຍະ(ຮູ້ດີຮູ້ຊົ່ວ)ແລະມີມະໂນທຳຈື່ງຕ້ອງປະພຶດຕົນຕໍ່ກັນໃນທາງພີ່ນ້ອງ.
ມາດຕາ 2: ຂໍ້ 1.ຄົນຜູ້ໃດກໍ່ອ້າງຕົນໄດ້ວ່າ:ມີສິດ ແລະ ເສລີພາບທຸກຢ່າງທີ່ໄດ້ປ່າວຮ້ອງຢູ່ໃນປະກາດສະບັບນີ້ໂດຍບໍ່ເລືອກໜ້າ ບໍ່ຈຳກັດເຊື້ອຊາດ,ຜິວເນື້ອ,ເພດ,ສາສະໜາ ຄວາມຄິດເຫັນໃນດ້ານການເມືອງ ຫຼື ອື່ນໆ ກຳເນີດແຫ່ງຊາດຫຼື ສັງຄົມຖານະການມີຊັບສົມບັດມາກ ຫຼື ນ້ອຍ,ມີຕະກຸນ ຫຼື ຖານະອື່ນໆ. ຂໍ້ 2.ອີກປະການໜື່ງ ຈະບໍ່ຈຳກັດຢ່າງໃດໃນການແຕກຕ່າງກັນອັນເນື່ອງມາຈາກລະບຽບການເມືອງການປົກຄອງ ຫຼື ລະຫວ່າງຊາດຂອງປະເທດ ຫຼື ດິນແດນ ຊື່ງບຸກຄົນຜູ້ໃດຜູ້ໜື່ງສັງກັດຢູ່;ດິນແດນນັ້ນຈຳເປັນເອກະລາດຢູ່ໃນຄວາມອາລັກຂາຂອງມະຫາອຳນາດ ຫຼື ບໍ່ມີອິດສະຫຼະ ຫຼື ຖືກລົດອະທິປະໄຕລົງໂດຍຈຳກັດກໍ່ຕາມ.
The Lao script is used for writing the Lao language, and is also the official script of a number of minority languages in Laos. There is a considerable Lao-speaking population in Thailand who write their language with the Thai script.
ອັກສອນລາວ ʔáksɔ̌ːn láːw
The Lao alphabet was adapted from the Khmer script, and is a sister system to the Thai script, with which it shares many similarities and roots. However, Lao has fewer characters and is formed in a more curvilinear fashion than Thai. Further distancing from the Thai script occurred via a number of reforms. In 1975, the latest spelling reform simplified and standardised the script.
Sources: Scriptsource, Wikipedia.
The script was originally an abugida, but since the script reforms leading up to 1960 it has been alphabetic. When the communist Pathet Lao overthrew the Lao government in 1975, they implemented a final spelling reform which simplified and standardized the script.
Observation. I'm not aware of any variant spellings or confusable characters.
Lao is an alphabet. This means that both consonants and vowels are indicated. See the table to the right for a brief overview of features for the modern Lao orthography.
Lao text runs left to right in horizontal lines.
Spaces separate phrases, rather than words.
Each onset consonant is associated with a high, mid, or low class related to tone. Tone is indicated by a combination of the consonant class, the syllable type (checked/unchecked), plus any tone mark.
No conjuncts are used for consonant clusters, except for one subjoined consonant, used in combination only with HA.
Syllable-initial clusters and syllable-final consonant sounds are all written with ordinary consonant letters. However, because all vowels are written, it is not difficult to algorithmically detect syllable boundaries.
Unlike its close relative, Thai, the Lao orthography has no inherent vowel, but still represents vowels using 18 vowel signs (including 5 pre-base), and 2 consonants. Only vowel signs that appear above or below the consonant are combining marks; the others are ordinary characters that are typed in the order seen. Vowels are often written differently when they appear in a closed vs. open syllable.
There are no independent vowels, and standalone vowel sounds are written using vowel signs applied to ອ [U+0EAD LAO LETTER O].
This page lists 27 composite vowels (made from 12 vowel signs, and 2 consonants). Composite vowels can involve up to 4 glyphs, and glyphs can surround the base consonant(s) on up to 3 sides.
Click on the sounds to reveal locations in this document where they are mentioned.
Phones in a lighter colour are non-native or allophones.
labial | dental | alveolar | post- alveolar |
palatal | velar | glottal | |
---|---|---|---|---|---|---|---|
stops | p b | t d | k | ʔ | |||
aspirated | pʰ | tʰ | kʰ | ||||
affricates | t͡ɕ | ||||||
fricatives | f | s | x | h | |||
nasals | m | n | ɲ | ŋ | |||
approximants | ʋ/w | l | j | ||||
trills/flaps | r | ||||||
labial | dental | alveolar | post- alveolar |
palatal | velar | glottal | |
---|---|---|---|---|---|---|---|
stop | p | t | k | ʔ | |||
nasal | m | n | ŋ | ||||
approximant | w | j |
Lao has 6 phonological tones in unchecked syllables, and 4 in checked syllables.
Name | Vowel | Final | Unchecked? | Checked? |
---|---|---|---|---|
Rising | ˨˦ or ˨˩˦ | ě | ✓ | |
High | ˦ | é | ✓ | ✓ |
High falling | ˥˧ | ê | ✓ | |
Mid | ˧ | ē | ✓ | ✓ |
Low | ˩ | è | ✓ | ✓ |
Low falling | ˧˩ | e᷆ | ✓ | ✓ |
The tone depends on the class of the initial consonant in a syllable, the structure of the syllable, and whether or not a tone mark is applied to override the default.
Tone values vary depending on location in Laos. There is some disagreement about whether there are 5 or 6 tones in Vientiane, and the tables below show that different sources disagree on the tones produced. According to some, most dialects of Lao and Isan have six tones, those of Luang Prabang have five.wl
The following tables present different descriptions of tone values in Lao for the Vientiane dialect. The first and third tables basically agree on the tone value, although the names of tones vary. The middle table shows some different tone values altogether. See a list of studies for Vientiane tones.
This diagram shows 5 tones with names corresponding to a mixture of the first two tables below.
Tone marks are normally used only on open syllables, and modify the default tone value. Two of the four tone marks are only used with Class 1 consonants. Tone marks tend to be placed directly over the consonant (or superscript vowel), unlike Thai which tends to place them slightly to the right.
Open or live syllables are those that end with a long vowel or sonorant (eg. ງນມຍວ). Closed or dead syllables end with a stop consonant (eg. ກດບ) or short vowel.
Open | Closed short vowel |
Closed long vowel |
Tone mai eːk |
Tone mai toː |
Tone mai tiː |
Tone mai cat-ta-waː |
|
---|---|---|---|---|---|---|---|
Class 1 | low | ˊ high | ˆ low falling | ˉ mid | ˋ high falling | ˋ high falling | ˇ low rising |
Class 2 | ˇ low rising | ˊ high | ˆ low falling | ˉ mid | ˆ low falling | - | - |
Class3 | ˊ high | ˉ mid | ˋ high falling | ˉ mid | ˋ high falling | - | - |
Refs: Daniels
Live | Dead short vowel |
Dead long vowel |
Tone mai eːk |
Tone mai toː |
Tone mai tiː |
Tone mai cat-ta-waː |
|
---|---|---|---|---|---|---|---|
Class 1 | ˋ low | ˇ rising | ˇ rising | mid | ˆ falling | ˊ high | ˇ rising |
Class 2 | ˇ rising | ˇ rising | ˋ low | mid | ˋ low | - | - |
Class3 | ˊ high | mid | ˆ falling | mid | ˆ falling | - | - |
Refs: Simmala
Live | Dead short vowel |
Dead long vowel |
Tone mai eːk |
Tone mai toː |
Tone mai tiː |
Tone mai cat-ta-waː |
|
---|---|---|---|---|---|---|---|
Class 1 | low rising | high rising | low falling | high-mid | high falling | ||
Class 2 | low rising | high rising | low falling | high-mid | low falling | ||
Class3 | high rising | high-mid | high falling | high-mid | high falling |
Refs: SEAlang
The Simmala chart appears a little suspect, since they say in the text that the rising tone doesn't occur in dead syllables, and yet the book has examples of dead syllables with long vowels with a low tone.
The syllable is a basic element of the Lao language, and many words are monosyllabic. All syllables begin with a written consonant. Syllables that begin with a vowel sound are written with a silent base consonant.
The phonological structure of a syllable is (C)w?V(C)
.
Unlike Thai, its close neighbour linguistically, Lao doesn't naturally support onset clusters of consonants other than with ʷ, and then not before rounded vowels.wl,#Syllables Onset consonants followed by labialisation include: tʷʰ tɕʷ kʷ kʷʰ ʔʷ sʷ ŋʷ lʷ.
Only a small set of consonants occur at the end of a syllable. ʔ occurs after short vowels.
Lao also has 6 phonological tones in unchecked syllables (ie. ending in a vowel, or m, n, ŋ, w or j), and 4 in checked syllables (ie. ending in p, t, k, or ʔ).
Unlike its close relative, Thai, the Lao orthography has no inherent vowel, but still represents vowels using 18 vowel signs (including 5 pre-base), and 2 consonants. Only vowel signs that appear above or below the consonant are combining marks; the others are ordinary characters that are typed in the order seen. Vowels are often written differently when they appear in a closed vs. open syllable.
There are no independent vowels, and standalone vowel sounds are written using vowel signs applied to ອ [U+0EAD LAO LETTER O].
This page lists 27 composite vowels (made from 12 vowel signs, and 2 consonants). Composite vowels can involve up to 4 glyphs, and glyphs can surround the base consonant(s) on up to 3 sides.
For a mapping of sounds to graphemes see vowel_mappings.
Dashes are used to indicate whether the character represents a vowel sound in a closed or an open syllable.
Although modern Lao orthography is alphabetic, and therefore has no inherent vowel, vowel signs are still used to write the vowel sounds. For example, ki is written:
ກິ ki [U+0E81 LAO LETTER KO + U+0EB4 LAO VOWEL SIGN I]
What's different is that ka, which would in an abugida be written just using a consonant, is also spelled out using a vowel sign.
ກະ ka [U+0E81 LAO LETTER KO + U+0EB0 LAO VOWEL SIGN A]
Vowels in Lao are written with a mixture of combining characters and ordinary spacing characters. Only the superscript and subscript vowel signs are combining characters. It is also common to use some consonants to represent vowel sounds (see consonant_vowels).
Vowel signs are arranged around a syllable onset, which are rare in Lao but may include 2 consonants, rather than just around the immediately preceding consonant. For example, vowel characters that are rendered to the left of the base will appear before all the consonants in a syllable onset (see prebase_vowels). And where a vowel sign is composed of components that are rendered before and after the base, they will surround the syllable onset (see composite_vowels).
Lao uses the following combining marks for vowels.
Unlike its close neighbour Thai, Lao consonants do not carry an inherent vowel. The short vowel a has to be written explicitly, using ະ [U+0EB0 LAO VOWEL SIGN A] in open syllables, and ັ [U+0EB1 LAO VOWEL SIGN MAI KAN] in closed syllables.
When used in conjunction with other vowels, ະ [U+0EB0 LAO VOWEL SIGN A] and ັ [U+0EB1 LAO VOWEL SIGN MAI KAN] are used to indicate short vowels for open and closed syllables, respectively, eg. ລະດັບ
This can be seen by comparing the long and short vowels in vowel_mappings. In common with other languages, long i, ɯ and u vowels have dedicated characters for long sounds, and there are a few unusual sequences, but many composite vowels use the aforementioned signs as shorteners. The following provides one example of the general pattern.
The following additional, vowel-specific characters are ordinary spacing characters, with the general category of 'letter'. Many of these are typed and stored before the base consonant (see prebase_vowels).
ຳ [U+0EB3 LAO VOWEL SIGN AM] is classed as a vowel, but also contains the final consonant m, represented by a built-in nikhahit (cf. -ໍ [U+0ECD LAO NIGGAHITA]). It is a spacing combining character, but the Unicode Standard classifies it as a letter.
Five vowel signs appear to the left of the onset consonant. See an example in fig_prebase.
Like Thai, Lao uses a visual encoding model, so these characters are not combining characters, and are typed and stored before the base.
Note that ແ [U+0EC1 LAO VOWEL SIGN EI] should not be typed as two successive ເ [U+0EC0 LAO VOWEL SIGN E] characters.
These vowel signs are placed before the start of the syllable onset. This means that in a word with more than one consonant at the start (such as for shifting the tone) the pre-base vowel is placed to the left of the syllable-initial consonant, rather than to the left of the consonant after which it is pronounced. Tone marks and post-base vowel signs are however attached to the latter. For the following examples, click on the Lao text to see the order of characters.
ແກວ່ງ ໃຫຍ່ ເຫຼືອງ
On its own, in the middle of a closed syllable, ອ [U+0EAD LAO LETTER O] is pronounced as the vowel -ɔː- and ວ [U+0EA7 LAO LETTER WO] is pronounced -uːə̯-, eg. ຈອກ tɕɔːk˧˩ glass ບວມ buːəm to swell
ອ [U+0EAD LAO LETTER O] is also used as a vowel carrier for standalone vowels (see standalone).
ວ [U+0EA7 LAO LETTER WO], ຍ [U+0E8D LAO LETTER NYO], and ຽ [U+0EBD LAO SEMIVOWEL SIGN NYO] also represent the glide w or j at the end of a diphthong (see composite_vowels).
ຽ [U+0EBD LAO SEMIVOWEL SIGN NYO] was originally an alternate form of non-initial ຍ [U+0E8D LAO LETTER NYO], but is now used for diphthongs, either alone as iːə̯ or as the semi-vowel j, eg. ປຽກ piːə̯k˧˩ wet ຊາຽ saːj˦ sand
All of these characters also appear in the composite vowels described below.
In the lists below, hyphens represent consonants. Vowels used in closed syllables are indicated by a trailing hyphen in the IPA transcription.
Vowels in Lao are commonly written using a combination of code points, which may produce glyphs on more than one side of the base. See fig_composite.
Some composite vowels represent plain vowel sounds:
The other composites represent diphthongs, which generally end in one of ə̯, i, or w.
In some cases, the spelling is straightforward because a glide consonant simply follows a plain vowel.
For others, the spelling doesn't closely follow the sound represented.
Observation: Simmala et al. list the composite vowel ເ-ັຍະ for -ia in (only) one location, but I have yet to identify words containing this sequence, and suspect that it may be a typographic error.
Characters that don't appear in the combinations:
The following list shows where vowel signs are positioned around a base consonant to produce vowels, and how many instances of that pattern there are. Numbers after + sign indicate multiple code points.
At maximum, vowel components can occur concurrently on 3 sides of the base.
Distribution of vowel elements is as follows:
-ັ -ິ -ີ -ຶ -ື -ໍ -ົ | -ຳ | ||
ເ ແ ໂ ໃ ໄ | ະ າ ອ ວ ຍ ຽ | ຍ ຽ ະ ວ | |
-ຸ -ູ |
Lao uses a silent ອ [U+0EAD LAO LETTER O] as a base (although it is often transcribed as ʔ), to which vowel signs are applied, eg. ໂອ ອຸ່ນ
Lao has no independent vowel letters.
Because the orthography is alphabetic, vowel absence after syllable-final consonants is not normally marked in any way. Nor is it marked in syllable-initial clusters.
໌ [U+0ECC LAO CANCELLATION MARK was previously used to indicate silenced consonants, but is now described as obsolete.wl,#Punctuation
This section maps Lao vowel sounds to common graphemes in the Lao orthography. The dash indicates the location of the consonant relative to the vowel sign; if there are 2 dashes, the vowel is used only in closed syllables. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.
ເ–ິ– [U+0EC0 LAO VOWEL SIGN E + U+0EB4 LAO VOWEL SIGN I]
ເ–ິະ [U+0EC0 LAO VOWEL SIGN E + U+0EB4 LAO VOWEL SIGN I + U+0EB0 LAO VOWEL SIGN A]* (modern vowel form)
The Unicode Lao block provides the following characters for indicating tone.
Tone marks should be typed and stored in memory immediately after the base consonant of the syllable, or after a superscript vowel sign if there is one. However, the tone mark should be typed before ຳ [U+0EB3 LAO VOWEL SIGN AM], and should be displayed above the nikhahit, see fig_tone_am.
Each onset consonant is associated with a high, mid, or low class related to tone. Tone is indicated by a combination of the consonant class, the syllable type (checked/unchecked), plus any tone mark.
No conjuncts are used for consonant clusters, except for one subjoined consonant, used in combination only with HA.
Syllable-initial clusters and syllable-final consonant sounds are all written with ordinary consonant letters. However, because all vowels are written, it is not difficult to algorithmically detect syllable boundaries.
For a mapping of sounds to graphemes see consonant_mappings.
A silent ຫ [U+0EAB LAO LETTER HO SUNG] can be added before the following onset characters to make their default tonal class high:
There are alternate forms for 3 of these. Two can be represented as ligatures, for which there are separate characters in Unicode: ໜ [U+0EDC LAO HO NO] and ໝ [U+0EDD LAO HO MO], d,462 eg. ໝາ A third can be represented by ຫຼ [U+0EAB LAO LETTER HO SUNG + U+0EBC LAO SEMIVOWEL SIGN LO],u,378 eg. ຫຼາຍ
ອ [U+0EAD LAO LETTER O] is silent when used as a base for vowels at the beginning of a syllable, eg. ໂອ When it appears after a base consonant it becomes the vowel ɔː, eg. ຈອກ It is also used in combination with other characters to produce additional vowel sounds (see composite_vowels).
One more letter was officially removed from the alphabet by the Ministry of Education, but it is still used occasionally to transliterate Indic or other foreign words into Lao, eg. ຝຣັ່ງ flaŋ foreigner It is generally used to represent the letter 'r': the sound r no longer exists in Lao.
Phonetically, unlike Thai, Lao doesn't have syllable-initial consonant clusters other than those followed by w plus a non-rounded vowel (see structure). Those are written using an ordinary ວ [U+0EA7 LAO LETTER WO], eg. ຄວາຍ
Otherwise, consonant letter clusters occur in the following circumstances:
No special characters or viramas are involved, except those described in highclass.
In a consonant cluster any tone marks or superscript vowels appear over the second consonant.
Lao doesn't have any code points dedicated to syllable final consonants, although consonants do appear in those positions, eg. ນົກ
Only the following consonants appear in syllable-final position. Note how the sound may change: where there is more than one phonetic realisation, the one with a preceding hyphen is the final.
Because Lao requires vowels to be written, there is not the ambiguity about syllable boundaries that one finds in Thai (caused by ambiguity about whether a consonant is syllable-final or a syllable in its own right).
A final m sound may be represented by ຳ [U+0EB3 LAO VOWEL SIGN AM].
This section maps Lao consonant sounds to common graphemes in the Lao orthography, grouped by high class ( h ), mid class ( m ), low class ( l ) and syllable-final ( f ).. Click on a grapheme to find other mentions on this page (links appear at the bottom of the page). Click on the character name to see examples and for detailed descriptions of the character(s) shown.
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.
Unicode 12 added 14 consonant letters and 1 combining mark for writing Pali.
Lao uses Western digits.
There is, however, a set of Lao digits.
Observation: Pending further clarification about how widespread the use of Lao digits is, note that Lao Wikipedia uses Lao digits for table of contents list numbering and for footnote references. See the relevant sections below.
The CLDR standard-decimal pattern is #,##0.###
. The standard-percent pattern is #,##0%
.cldr
Observation: Lao Wikipedia uses a French pattern, #.##0,###
, eg. ມີກຳລັງຕິດຕັ້ງ 7.207,24 ເມກາວັດ There are 7,207.24 megawatts installed
The CLDR standard format for currency is ¤#,##0.00;¤-#,##0.00
, and the symbol for the Lao currency, Kip, is ₭ [U+20AD KIP SIGN].cldr
Lao text runs left to right in horizontal lines.
Show default bidi_class
properties for characters by the modern Lao orthography.
This section brings together information about the following topics: writing styles; cursive text; context-based shaping; context-based positioning; baselines, line height, etc.; font styles; case & other character transforms.
You can experiment with examples using the Lao character app.
None of the characters require special shaping based on the visual context.. Nor is printed text cursive.
Lao has no special requirements for baseline alignment between mixed scripts or in general.
The orthography has no case distinction, and no special transforms are needed to convert between characters.
Prescript vowels are visually ordered, and therefore do not need to be positioned by the font.
Vowel signs, tones, and one consonant, on the other hand, are combining characters that need to be correctly positioned relative to the base character, and multiple marks can be combined with a single base character.
When using the vowel sign AM with a tone mark the small circle needs to push the tone mark upwards, even though the tone mark occurs before the vowel sign in memory (see fig_tone_am).
Observation: Italicised text used for a figure captions, and also for quotations.
tbd
Unicode grapheme clusters divide text into segments that contain a single base consonant plus any following combining characters: the latter include the 9 combining vowel signs, and all tone marks. Not included are free-standing vowel signs and consonants that make up other parts of a composite vowel, both pre-base and post-base. Also, syllable-initial consonant clusters with -ວ [U+0EA7 LAO LETTER WO] and ຫ- [U+0EAB LAO LETTER HO SUNG] are treated as 2 text units, but not ຫຼ [U+0EAB LAO LETTER HO SUNG + U+0EBC LAO SEMIVOWEL SIGN LO].
This implies that a pre-base vowel sign such as ເ- [U+0EC0 LAO VOWEL SIGN E] would be treated as a separate item from what follows, and in fact this can be seen in fig_drop_caps_2, where that character is the only thing highlighted in an initial letter selection. (On the other hand, initial letters followed by combining characters select the whole sequence, as seen in fig_drop_caps.)
This means that Lao typography is different from some other SE Asian scripts where pre-base vowel signs are selected with the base because they are combining characters, or syllable-initial consonant clusters form a unit because the 'medial' consonants are represented by combining characters.
Words are not separated by spaces, nontheless double-clicking or other selection methods are expected to identify word boundaries. There are 2 alternative approaches for managing this.
Unlike Thai or Khmer, it is fairly straightforward to parse individual syllables in Lao, because its alphabetic nature makes it possible to identify syllable-final consonants. Note that syllable-based segmentation must identify and keep together any syllable-initial clusters involving h or l, for example, the initial 2 letters in ຫມາ should wrap as a unit just like the ligated form, ໝາ mā.
What about kw etc?
While nearly all syllables can be argued to be words in their own right, there is still a preference for keeping multi-syllabic words together during word-based segmentation. eg. ປະເທດ For this, an application needs to use a dictionary to parse Lao text.
However, widely used software automatically inserts U+200B ZERO WIDTH SPACE (ZWSP) in Lao text at word or syllable boundaries, and many web pages use such inserted ZWSP characters to get browsers to wrap correctly.g3,#issuecomment-385847864
If a dictionary fails to keep two or more syllables together as needed, it should be possible to use the Unicode character U+2060 WORD JOINERbetween the two syllables. This is an invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks.
If dictionaries are used for segmentation, they should be selected based on the language, not the script. (See the list of languages using the Lao script.)
Lao uses ASCII punctuation, but also uses space as punctuation.
phrase |
, [U+002C COMMA] ; [U+003B SEMICOLON] : [U+003A COLON] |
---|---|
sentence |
. [U+002E FULL STOP] |
Spaces are used, but represent phrase or sentence boundaries.
Numbers are also normally surrounded by spaces.
In principle, periods are not used, though this appears to be changing.wl,#Punctuation
Observation: Lao Wikipedia uses periods at the end of sentences, and commas (see an example).l An online news site also consistently uses periods to end sentences.
Western punctuation is also used. Contemporary writing may include punctuation marks borrowed from French, such as the exclamation mark (!), and question mark (?). However, questions can be determined by question words within a sentence.wl,#Punctuation
Hyphens are also commonly found in modern writing.wl,#Punctuation
Lao commonly uses ASCII parentheses to insert parenthetical information into text.
start | end | |
---|---|---|
standard |
( [U+0028 LEFT PARENTHESIS] and ) [U+0029 RIGHT PARENTHESIS] are used for parentheses in contemporary writing.wl,#Punctuation
Lao texts use quotation marks around quotations. Of course, due to keyboard design, quotations may also be surrounded by ASCII double and single quote marks.
start | end | |
---|---|---|
initial | ||
nested | ’ [U+2019 RIGHT SINGLE QUOTATION MARK] |
The default quote marks for Lao are “ [U+201C LEFT DOUBLE QUOTATION MARK] at the start, and ” [U+201D RIGHT DOUBLE QUOTATION MARK] at the end.cldr
When an additional quote is embedded within the first, the quote marks are ‘ [U+2018 LEFT SINGLE QUOTATION MARK] and ’ [U+2019 RIGHT SINGLE QUOTATION MARK].cldr
Contemporary writing may also include « [U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK] and » [U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK] for quotation marks, borrowed from French.wl,#Punctuation
tbd
ຯ [U+0EAF LAO ELLIPSIS] is used to indicate ellipsis or abbreviation, as well as missing words.wl,#Punctuation
The ellipsis, … [U+2026 HORIZONTAL ELLIPSIS], is also commonly found in modern writing.wl,#Punctuation
Observation: Lao Wikipedia uses periods after date-related abbreviations,l eg. in ຄ.ສ. 1935 ສະບັບຄົ້ນ ḵʰ.s. 1935 sab̯äb̯ḵʰo²ṉ CE 1935 Edition It is also used in the abbreviated name of the country, eg. ສ.ປ.ປ.ລາວ s.p̯.p̯.ḻāw̱ Lao PDR
ໆ [U+0EC6 LAO KO LA] is used in ໆລໆ kʰɯaŋ-mǎːj-lɛ-ɯːn-ɯːn (ເຄຶ່ອງໝາຍ ແລະອຶ່ນໆ), with a meaning similar to etc. For example, ການສື່ສານ,ສື່ມວນຊົນ,ສື່ໂຄສະນາ...ໆລໆ Communication, media, advertising ... etc
Some sources use ຯລຯ and others ໆລໆ – check this out.
ໆ [U+0EC6 LAO KO LA] is used to indicate repetition of a preceding sound.
tbd
CLDR includes the following punctuation.
tbd
Although Lao doesn't use spaces or dividers between words, the expectation is that line-breaks occur at word boundaries. See word for a discussion of issues related to word-based segmentation.
As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.
Show (default) line-breaking properties for characters in the modern Lao orthography.
The following list gives examples of typical behaviours for some of the characters used in modern Lao. Context may affect the behaviour of some of these and other characters.
Click/tap on the Lao characters to show what they are.
Line breaking should not move a danda or double danda to the beginning of a new line even if they are preceded by a space character.
Since spaces aren't used to separate words, Lao has to use alternative strategies for justification of text.
tbd
This section looks at ways in which spacing is applied between characters over and above that which is introduced during justification.
Lao uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.
Lao places vowel and tone marks above base characters, one above the other, and can also add combining characters below the line. The complexity of these marks means that the vertical resolution needed for clearly readable Lao text is higher than for English, or most Latin text. In addition, Lao also tends to add more interline spacing than Latin text does.
To give an approximate idea, fig_baselines compares Latin and Lao glyphs from Noto fonts. The basic height of Lao letters is typically slightly above the Latin x-height, however extenders and combining marks reach well beyond the Latin ascenders and descenders, creating a need for larger line spacing.
fig_baselines_other shows similar comparisons for the Lao MN and DokChampa fonts.
You can experiment with counter styles using the Counter styles converter. Patterns for using these styles in CSS can be found in Ready-made Counter Styles, and we use the names of those patterns here to refer to the various styles.
The modern Lao orthography uses a numeric style.
The lao numeric style is decimal-based and uses these digits.rmcs
Examples:
Observation: Lao Wikipedia uses periods for suffixes.l
It is possible to find the first letter in a paragraph styled so that it is larger and sits alongside several lines of the continuing paragraph text.
Observation: All combining characters, including spacing ones, are included in the selections shown in fig_drop_caps.
Any punctuation such as opening quotes and opening parentheses should also be included in the initial styling. ?
Observation: In the figures shown, the alphabetic baseline of the highlighted letter(s) matches the bottom of the row that determines the size of the highlighted letter(s). Selections without diacritics above are somewhat shorter than the height of the lines alongside, whereas selections with multiple diacritics rise slightly higher than the first line of text.
Observation: In fig_drop_caps_2, the selection picks out only ຫ from the digraph ຫລ; and ເ from the syllable ເມຶ່ອ.
This section is for any features that are specific to Lao and that relate to the following topics: general page layout & progression; grids & tables; notes, footnotes, etc; forms & user interaction; page numbering, running headers, etc.
Observation: Lao Wikipedia uses Lao digits for footnote references.