This page lists characters in the following Unicode blocks and provides information about them.
This is not authoritative, peer-reviewed information – these are just notes I have gathered and copied from various places.
We only have usage data for the Sinhalese language.
If you click on any red example text, you will see at the bottom right of the page a list of the characters that make up the example.
To find a character by codepoint, type #char0000 at the end of the URL in the address bar, where 0000 is a four-figure, hex codepoint number, all in uppercase. Or type the character or the hex number in the Find control above.
To view this page as intended, you need a Sinhala font. This page comes with a Noto Serif Sinhala webfont. Click the blue vertical bar at the bottom right of the page to apply other fonts, if you have them on your system. For transcriptions I recommend the excellent and free Doulos SIL font. The large character in the box will not be rendered unless the webfont downloaded with the page or a system font has a glyph for it. (If there is no glyph and you want to see what it looks like, click on See in UniView.)
Information about languages that use these characters is taken from the list maintained for the Character Use app. The list is not exhaustive.
References are indicated by superscript characters. Wherever possible, those contain direct links to the source material. When such a pointer is alongside an arrow → it means that it's worth following the link for the additional information it provides. Digits refer to the main sources, which are listed at the bottom of a set of notes.
When you are using UniView and you turn on Show notes, UniView will pull in information about characters from this page.
U+0DB8 SINHALA LETTER MAYANNA
U+0DB9 SINHALA LETTER AMBA BAYANNA
U+0DBA SINHALA LETTER YAYANNA
U+0DBB SINHALA LETTER RAYANNA
U+0DC0 SINHALA LETTER VAYANNA
Part of the extended (miśra) repertoire (for Sanskrit, Pali, etc).
gn elsewhere two consonants, eg. පඥා pᵑǰā (paᵑǰā) pagɲa.
U+0DC6 SINHALA LETTER FAYANNA
Part of the extended (miśra) repertoire (for Sanskrit, Pali, etc). A new consonant for foreign sounds.
Sometimes this consonant is written using a grapheme that represents the Latin f combined with ප [U+0DB4 SINHALA LETTER ALPAPRAANA PAYANNA].
U+0D85 SINHALA LETTER AYANNA
U+0D86 SINHALA LETTER AAYANNA
U+0D87 SINHALA LETTER AEYANNA
U+0D88 SINHALA LETTER AEEYANNA
U+0D89 SINHALA LETTER IYANNA
U+0D8A SINHALA LETTER IIYANNA
U+0D8B SINHALA LETTER UYANNA
U+0D8C SINHALA LETTER UUYANNA
U+0D91 SINHALA LETTER EYANNA
U+0D92 SINHALA LETTER EEYANNA
U+0D94 SINHALA LETTER OYANNA
U+0D8D SINHALA LETTER IRUYANNA
U+0D8E SINHALA LETTER IRUUYANNA
Long vocalic r. Part of the extended (miśra) repertoire (for Sanskrit, Pali, etc). Not in contemporary use, however the corresponding vowel sign, ◌ෲ [U+0DF2 SINHALA VOWEL SIGN DIGA GAETTA-PILLA], is used, eg. කරතෲ krtr̥̄ (karatr̥̄).
U+0D93 SINHALA LETTER AIYANNA
U+0D96 SINHALA LETTER AUYANNA
U+0D8F SINHALA LETTER ILUYANNA
U+0DD9 SINHALA VOWEL SIGN KOMBUVA
Sinhala ගැටය සහිත ඇලපිල්ල gæṭa sahita ælapilla ri,ru
Vocalic r. Part of the extended (miśra) repertoire (for Sanskrit, Pali, etc).
This can be also be written with śuddha letters රු ru, which corresponds to the actual pronunciation.
Sinhala ගයනුකිත්ත gayanukitta li
Vocalic l. Part of the extended (miśra) repertoire (for Sanskrit, Pali, etc). Not in contemporary use.
This can be also be written with śuddha letters ලි li , which corresponds to the actual pronunciation.
U+0DCA SINHALA SIGN AL-LAKUNA
Sinhala හල් කිරිම hal kirīma
Indicates that the consonant is not followed by a vowel sound.
In most cases this is visible in Sinhalese text, appearing over the relevant consonant. It may have one of two different shapes, however, depending on the consonant it is associated with: with k you get ක්; with kʰ you get ඛ්.
It is also used to indicate that the font should use a conjunct shape and hide the virama. In this case it is used with ZWJ [U+200D ZERO WIDTH JOINER]. In modern text, the ZWJ follows the virama, and the result is a ligated shape or a reduced shape for one of the conjunct letters. The forms with r or y look like this when combined with k: ර්ක r͓₊k, ක්ර k͓₊r, and ක්ය k͓₊y. There is also a form using both: ක්ය්ර k͓₊y͓₊r (kyra). Wikipedia3 lists several more conjuncts, some of which are reproduced here:
If ZWJ is used before the virama the font should produce an older form of conjunct where the character glyphs are drawn together so that they touch (and the virama is hidden), eg. ම්ම mm (cf. මම).
U+0D82 SINHALA SIGN ANUSVARAYA
U+0DE6 SINHALA LITH DIGIT ZERO
U+0DE7 SINHALA LITH DIGIT ONE
U+0DE8 SINHALA LITH DIGIT TWO
U+0DE9 SINHALA LITH DIGIT THREE
U+0DEA SINHALA LITH DIGIT FOUR
U+0DEB SINHALA LITH DIGIT FIVE
U+0DEC SINHALA LITH DIGIT SIX
U+0DED SINHALA LITH DIGIT SEVEN
U+0DEE SINHALA LITH DIGIT EIGHT
U+111E1 SINHALA ARCHAIC DIGIT ONE
U+111E2 SINHALA ARCHAIC DIGIT TWO
U+111E3 SINHALA ARCHAIC DIGIT THREE
U+111E4 SINHALA ARCHAIC DIGIT FOUR
U+111E5 SINHALA ARCHAIC DIGIT FIVE
U+111E6 SINHALA ARCHAIC DIGIT SIX
U+111E7 SINHALA ARCHAIC DIGIT SEVEN
U+111E8 SINHALA ARCHAIC DIGIT EIGHT
U+111E9 SINHALA ARCHAIC DIGIT NINE
U+111EA SINHALA ARCHAIC NUMBER TEN
U+111EB SINHALA ARCHAIC NUMBER TWENTY
U+111EC SINHALA ARCHAIC NUMBER THIRTY
U+111ED SINHALA ARCHAIC NUMBER FORTY
U+111EE SINHALA ARCHAIC NUMBER FIFTY
U+111EF SINHALA ARCHAIC NUMBER SIXTY
U+111F0 SINHALA ARCHAIC NUMBER SEVENTY
U+111F1 SINHALA ARCHAIC NUMBER EIGHTY
U+111F2 SINHALA ARCHAIC NUMBER NINETY