Updated 8 February, 2024
This page brings together basic information about the Pollard script and its use for the A-Hmao language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write A-Hmao using Unicode.
The Pollard script has evolved over the course of its history, with 2 major milestones in 1936 and 1988. Although they agree that modern usage is generally closer to the 1936 approach, the sources used are not crystal clear which characters exactly are used in contemporary writing, so this page describes both the 1936 and 1988 versions of the orthography as described by Joakim Enwall.
Richard Ishida, A-Hmao (Pollard Miao) Orthography Notes, 08-Feb-2024, https://r12a.github.io/scripts/plrd/hmd
𖼊𖽠 𖼐𖼊𖽪𖾏 𖼮𖽑𖽦𖾐 𖼮𖽑𖽦𖾐 𖼊𖽔𖾏 𖼐𖼷𖽑𖽡𖽝𖾐 𖼐𖼷𖽑𖽡𖽝𖾐, 𖼄𖽘 𖽐𖼀𖽜 𖼐𖽻𖾐 𖼀𖽔 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼺𖽡𖽝𖾐. 𖼐𖽻𖾐 𖼀𖽔 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼎𖽺 𖼀𖽡𖾐 𖼖𖽜 𖼳𖽷𖾐 𖼮𖽡𖽙𖾏 𖼄𖽘𖾐, 𖼄𖽘𖾐 𖼎𖽻𖾐 𖼷𖽑𖽹 𖼺𖽡𖽝𖾏 𖼄𖽡𖾐 𖼇𖽪𖾐 𖼊𖽡𖾐. 𖼷𖽠 𖼐𖽡 𖽃𖽡𖾐 𖼷𖽻 𖼷𖽻𖾐 𖽐𖼀𖽔𖾐 𖽐𖼷𖽑𖽙𖾑, 𖼄𖽘 𖽐𖼷𖽑𖽙𖾑 𖼀𖽑𖽪 𖼀𖽪𖾑 𖼖𖽔 𖼷𖽻 𖼀𖽑𖽪. 𖼻𖽻𖾐 𖼮𖽡 𖼐𖽻𖾐 𖼀𖽔 𖼊𖽡𖽝 𖼮𖽑𖽡𖽠𖾐 𖼐𖽡 𖼳𖽻𖾐 𖼈𖽜𖾐, 𖼮𖽡𖽝𖾐 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼊𖽡𖽝 𖼮𖽑𖽡𖽠𖾐 𖼐𖽡 𖽐𖼊𖽡𖾐 𖼡𖽳𖾐. 𖼐𖽻𖾐 𖼀𖽔 𖼎𖽺 𖼀𖽡 𖼖𖽜 𖼳𖽷𖾐 𖼮𖽡𖽙𖾏 𖼄𖽘𖾐, 𖼄𖽔 𖼄𖽘𖾐 𖽃𖽻𖾐 𖼮𖽡 𖼐𖽡 𖼄𖽔𖾏 𖼮𖽢𖾏 𖼮𖽑𖽙𖾑, 𖼳𖽷𖾐 𖼷𖽻 𖼐𖽻𖾐 𖼀𖽔 𖼖𖽙 𖼊𖽘𖾏 𖼐𖽑𖽙𖾐. 𖼐𖽻𖾐 𖼽𖽦𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼎𖽺 𖼀𖽡𖾐 𖼖𖽜 𖼳𖽷𖾐 𖼮𖽡𖽙 𖼄𖽘𖾐, 𖼄𖽔 𖼄𖽘𖾐 𖽃𖽻𖾐 𖼮𖽡 𖼐𖽡 𖼄𖽔𖾏 𖼮𖽢𖾏 𖽐𖼮𖽡, 𖼳𖽷𖾐 𖼷𖽻 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼽𖽔𖾐 𖼡𖽠 𖼄𖽻 𖼊𖽘𖾏 𖼐𖽑𖽪𖾐. 𖼄𖽘 𖼷𖽙 𖽐𖼀𖽔𖾐 𖼮𖽡𖽙𖾏 𖼊𖽡𖾐 𖼐𖽘 𖼘𖽡, 𖽐𖼮𖽑𖽹𖾐 𖼚𖽻 𖼀𖽡𖾐 𖼚𖽻 𖽐𖼀𖽪𖾐 𖼘𖽙𖾐, 𖼐𖽻𖾐 𖼀𖽔 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼎𖽻𖾐 𖼳𖽷𖾐 𖼊𖽻𖾐, 𖼐𖽻𖾐 𖼀𖽔 𖽐𖼎𖽑𖽦𖾑 𖼮𖽡𖾑 𖼺𖽷𖾐, 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖽐𖼎𖽑𖽦𖾑 𖼮𖽡𖾑 𖼎𖽙𖾑. 𖼐𖽻𖾐 𖼀𖽔 𖼐𖽻𖾐 𖼽𖽘𖾐 𖼡𖽔𖾐 𖼡𖽠 𖼊𖽡 𖼀𖽡𖾐 𖼖𖽜 𖼎𖽻𖾐 𖼷𖽑𖽹𖾐 𖼐𖽡𖾑. 𖼮𖽑𖽡𖽘 𖼮𖽡 𖼣𖽪 𖼄𖽔𖾏 𖼳𖽷𖾐 𖽐𖼮𖽦𖾐 𖽐𖼚𖽡𖾏 𖼳𖽷𖾐 𖼀𖽡 𖼚𖽠, 𖼄𖽔𖾏 𖼮𖽑𖽡𖽠 𖼣𖽪 𖼄𖽔𖾏 𖼳𖽷𖾐 𖽐𖼮𖽦𖾐 𖽐𖼚𖽡𖾏 𖼳𖽷𖾐 𖼀𖽡 𖼚𖽠, 𖼄𖽔 𖼮𖽑𖽡𖽠 𖼣𖽪 𖼄𖽔𖾏 𖼳𖽷𖾐 𖽐𖼮𖽦𖾐 𖽐𖼚𖽡𖾏 𖼳𖽷𖾐 𖽐𖼎𖽺 𖼊𖽡𖽝. 𖼮𖽡 𖼞𖽡𖾐 𖼄𖽙 𖽐𖼎𖽔.
Source: Miao translated songs
The A-Hmao language has around 300,000 speakerseth, in northeast and central Yunnan and northwestern Guizhou provinces, however it is just one of a number of closely related languages and dialects spoken in China. Despite numerous attempts to define a romanised orthography, the Pollard or Miao script is still the primary script for writing the A-Hmao languageeth, and is also used or other dialects and languages such as Lisu and Yi. Literacy in the orthography described here is high, and in addition to Bible translations it has been used for a large number of hand-written traditional songs and storiesc, for bilingual educational materials, and elementary textbooks.pf
The script was invented by James Pollard and others, and was first used in 1905. This page describes two describes two snapshots of the orthography, one in 1936 and the other in 1988, as described by Joakim Enwall The 1936 version was the result of a number of revisions, but was then used to publish a translation of the New Testament. Beginning in the 1950s, significant revisions of the script produced an alternative version which Enwall dates to 1988 and calls the Chuxiong Reformed Pollard Script. It handles tone marks differently and provides additional consonants for a more phonetic version of the script. The contemporary Miao communities appear to prefer the older version of the orthography.
The Pollard script is an alphabet, ie. consonants and vowels are written separately. See the table to the right for a brief overview of features for the A-Hmao language.
The information in this page is based on the orthographies for the Miao script in 1936 and 1988, as described by Joakim Enwall. It is consistently reported that users of the Pollard script today prefer the 1936 orthography over the 1988 revision, however the sources consulted provided little reliable evidence as to which characters exactly are used in modern writing. This page, therefore, puts the spotlight on the 2 orthographies described by Enwall in his book A Myth Become Reality on pages 185ff, and presents both alternatives.
The Pollard script is strongly focused on writing syllables, and splits each syllable into an onset and a rhyme. We will describe the orthography in these terms here, rather than as simple consonants and vowels.
A-Hmao text runs left-to-right in horizontal lines. There is no case distinction. Words are typically monosyllabic, but may also be made up of more than one syllable. Spacing occurs around syllable boundaries, rather than word boundaries.
Syllable onsets in the 1936 orthography always contain one of 22 basic consonant letters. The consonant sound can be prenasalised or aspirated, or both. Prenasalisation is indicated using the letter 𖽐 immediately before the consonant. Aspiration is indicated using the combining mark 16F51 immediately after the consonant.
The 1988 (Chuxiong) orthography has more consonants (totalling 43) because it describes the sounds phonetically, rather than phonemically, and provides symbols that cover a wider range of voicing or devoicing for the consonants. These variants are separate code points, written with a dot in the upper right corner of the letter glyph. The dot is built in to the glyph for the consonant, and not a combining mark. The same code points can be written in alternative form using a 'wart' appended to the consonant shape, and is the default shape in the Unicode charts. The choice of dot or wart is a font-related distinction. The sources consulted don't make it clear whether these 'wart' characters are used by people writing in the Pollard script today.
The Pollard script is an alphabet where the rhymes are written using 24-25 combining marks. In some cases, dipthongs and rhymes with final consonants are written using a single combining mark, but in other cases 2 combining marks are used.
Tone is written differently in the 2 orthographies described here. The 1936 orthography uses 3 invisible characters after the vowel to place the vowel mark(s) in positions relative to the onset, and the position indicates the tone. In the 1988 Chuxiong revision, 7 explicit tone mark letters are used, in which case vowels and tone marks sit on the baseline.
Syllable-initial standalone vowels are preceded by 𖽃.
A-Hmao uses ASCII digits, and ASCII code points for punctuation marks.
The following represents the repertoire of the A-Hmao language.
Click on the sounds to reveal locations in this document where they are mentioned.
Phones in a lighter colour are non-native or allophones. Source Wikipedia.
For a list of diphthongs see the summary tables in vowels.
|t d tl̥ dl
|ⁿt ⁿd ⁿtl̥ ⁿdl
A-Hmao is a contour tone language, with 8 tones. It also has a particularly complex tone sandhi process by which tones interact with and alter the tones and other phonemes in other words.pj,2
The A-Hmao is strongly focused on syllables. Many words are monosyllabic, although there are also multi-syllabic words.
Onsets are based on consonants, which can be prenasalised, aspirated, voiced, devoiced, or a mixture of the foregoing. The rhyme may have a final nasal, but typically ends in a vowel or a diphthong.
The following table summarises the main rhyme to character assigments for the 1936 orthography, according to Enwall.
The following table summarises the main rhyme to character assigments for the 1988 Chuxiong revision, according to Enwall.
𖼞𖽡 ki U+1B13 MIAO LETTER KA + U+16F61 MIAO VOWEL SIGN I
All vowel diacritics are typed and stored after the base consonant, and the glyph rendering system takes care of the positioning relative to the onset at display time (see tones.
All of the vowel signs are spacing marks, meaning that they consume horizontal space when added to a base consonant.
A-Hmao uses the following dedicated combining marks for rhymes or their components. They may be used on their own, or in combination with others (see compositeV).
The Chuxiong orthography uses the following dedicated combining marks for rhymes or their components. They may also be used on their own, or in combination with others (see compositeV).
The differences between the 1936 orthography and the 1988 one are as follows:
𖼞𖽡𖽾 kin U+1B13 MIAO LETTER KA + U+16F61 MIAO VOWEL SIGN I + U+16F7E MIAO VOWEL SIGN NG
In many cases, a single combining mark expresses the rhyme, even if it contains multiple phones (see the list above). However, in some cases multiple combining marks are used. The following lists those found in the 1936 orthography.
Standalone vowels are written using 𖽃 as the base.
A-Hmao is reported to have 8 tones (see tone).
In the 1936 orthography, and most commonly used approach for modern text, the tone is indicated by the position of the vowel(s). The position is indicated using one of 3 invisible formatting characters, which must be used immediately after the vowel. An absence of tone mark indicates tone 3u,785, and the vowel remains on the baseline. These are illustrated here:
16F00 16F71 16F8F
16F00 16F71 16F90
16F00 16F71 16F91
If the rhyme is composed of more than one vowel diacritic, all the diacritics are moved together to the appropriate position. For example,
16F00 16F61 16F5D 16F90
Observation: The sources consulted don't indicate how these positions map to actual tones, nor why there are fewer of them than explicit tone letters used in the Chuxiong revision.
Unicode supports a fourth tone position, bottom, which is not used for A-Hmao.
16F00 16F71 16F92
One of the significant changes in the 1988 revision is that tones are expressed using dedicated glyphs, and both vowels and tone marks sit on the baseline. Again, the default tone is indicated by a tone mark absence. For example:
16F00 16F71 16F93
16F00 16F71 16F94
The tone marks are letters and they are stored immediately after the vowele,189.
No explanation is given as to why there are 2 sets of duplicates in the list.
Observation: Page 155 of Enwalle,155 has a similar table for the Chuxiong orthography, but the counter assignments are somewhat different, and don't involve duplicates.
The Unicode Standard lists the following tone marks as archaic (with no indication of the contours they map to).
The following table summarises the main consonant to character assigments of the 1936 orthography.
The right column contains alternative letters used to write phonetic sounds; they are not normally used.
|Lateral release stops
The following table summarises the main consonant to character assigments of the 1988 (chuxiong) orthography.
The right column contains aspirated phones.
|Lateral release stops
Whereas the table just above takes you from sounds to letters, the following simply lists the basic consonant letters.
The following consonant letters are those used for the 1936 orthography. The letters for stops and affricates represent both unvoiced and voiced sounds.
The 1988 orthography includes many more basic consonant letters because it provides distinct letters for the voiced stops and affricates, as well as some other new letters. It also has a number of letters that produce the same sound as another; the intended difference is no longer apparent.
The letters with dots alongside them are a new feature of the 1988 orthography. In fact, these are encoded using the same code points as letters rendered by default in the Unicode block chart with a 'wart'; the difference is only made by the font. For example, 𖼛U+16F1B MIAO LETTER DLHA and 𖼛U+16F1B MIAO LETTER DLHA are exactly the same letter.
The differences between the 1936 orthography and the 1988 one are as follows. The changes also percolate through into the aspirated, voiced, and prenasalised forms.
Most sources list the following letters as in use by A-Hmao. They are not listed in the 1936 repertoire by Enwall. Unfortunately, the sources don't usually indicate whether these are in regular use in modern A-Hmao writing. They cover allophonic sounds that occur in the language. Note the circle, which is typically referred to as a 'wart'. (In the 1988 revision, the wart becomes a dot at the right of the base shape.)
Although it is listed in the basic consonants above, 𖽂 is generally used for Chinese loan words, as is 𖼵 (not listed above) which is another letter for ʐ.
As can be seen in the charts above, all Pollard script onsets contain a consonant which may also carry graphemes for prenasalisation and/or aspiration/devoicing.
As shown in the summary tables above, A-Hmao stops and affricates can all be prenasalised. This is indicated by writing the letter 𖽐 before the consonant. This is not a combining mark.
A-Hmao letters can also be followed by the combining mark 16F51. This creates aspiration for stops and affricates, devoices nasals, and produces sound changes for other consonants (see the summary charts above). It can also be used with consonants that are prenasalised.
The few syllable-final consonant sounds in the Pollard script generally represent words from Chinese, and single-character rhymes exist that contain the required sound in addition to the vowel. They include the following.
In other cases, the rhyme contains 2 combining marks, one for the vowel and the other for the nasal.
Multiple combining marks are also used in the rhyme for several diphthongs (see compositeV).
This section offers advice about characters or character sequences to avoid, and what to use instead. It takes into account the relevance of Unicode Normalisation Form D (NFD) and Unicode Normalisation Form C (NFC)..
Although usage is recommended here, content authors may well be unaware of such recommendations. Therefore, applications should look out for the non-recommended approach and treat it the same as the recommended approach wherever possible.
Regardless of the final rendered position of characters, code points should be typed and stored in the following order.
Click on fig_sequence to see examples of sequences of characters that follow the rules just above, even though the components appear to be in a different order when rendered.
A-Hmao uses ASCII digits
A-Hmao text runs left to right in horizontal lines.
bidi_class properties for characters in the A-Hmao orthography described here.
Experiment with examples using the A-Hmao character app.
Apart from the difference in character repertoire, the 1936 and 1988 orthographies differ systematically in 2 major ways (see fig_style_1936 and fig_style_1988):
Any font fallback mechanisms need to pick a font with the appropriate glyphs and positioning rules.
The shaping of the letters doesn't typically change (the wart vs dot styles described just above are set for a whole font), but the figures in the previous section show clearly that positioning and kerning of glyphs is important in the Pollard script. Note particularly the final example, where the vowel and tone mark are tucked under the dot (which is part of the consonant) and the aspiration mark.
Base characters can carry 2 or 3 combining marks. In the 1936 orthography, one of the multiple combining marks is typically an invisible tone position marker.
Note how, in fig_gpos the position of the vowel not only changes in height, but also changes it's horizotal position relative to the onset glyphs.
Another common occurrence is for the aspiration marker to appear to the left of a nasal consonant, although the order of characters in memory remains unchanged. fig_gpos_nasal shows an example. Note, however, that the aspiration marker doesn't appear to the left of the prenasalisation marker.
A-Hmao separates syllables by spaces. Many A-Hmao words are monosyllabic, but there are also multi-syllabic words, and a space occurs after each syllable in such words (like Vietnamese).
A-Hmao uses ASCII punctuation, but may also use Chinese punctuationpf,2.
Lines are broken at the spaces between syllables, or after punctuation.pf,2
There is no breaking within syllables, or hyphenation.pf,2
As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.
Show line-breaking properties for characters in the modern A-Hmao orthography.
The following list gives examples of typical behaviours for some of the characters used in A-Hmao. Context may affect the behaviour of some of these and other characters.
Click/tap on the characters to show what they are.