/* */ var charDetails = { // MAIN BLOCK // Use _tools/generate_details_page_stubs.html to generate stubs to go here '\u{11400}': `

𑐀

ə independent vowel.

𑐀𑐥𑐵

𑐀𑐩𑑂𑐥𑐶

Combinations

𑐀𑑅

aː is 𑐀𑑅

𑐀𑑃

ã is 𑐀𑑃

𑐀𑑄

ãː is 𑐀𑑄

𑐀𑑄

𑐀𑑄𑐐𑑅

`, '\u{11401}': `

𑐁

æ independent vowel.

𑐁𑐏𑑅

𑐁𑐮𑐸

Combinations

𑐁𑑅

æː is 𑐁𑑅

𑐁𑑃

æ̃ is 𑐁𑑃

𑐁𑑄

æ̃ː is 𑐁𑑄

`, '\u{11402}': `

𑐂

i independent vowel.

𑐂𑐏𑐸𑑄

𑐦𑐂

Combinations

𑐂𑑃

ĩ is 𑐂𑑃

𑐮𑐂𑑃𑐔𑐵

𑐎𑐸𑑃𑐂𑑃

𑐂𑑄

ĩː is 𑐂𑑄

𑐃𑑄

Note that the long nasalised vowel uses the short independent vowel.

`, '\u{11403}': `

𑐃

iː long independent vowel.

𑐃𑐮𑐔𑑄

𑐮𑐾𑐟𑐵𑐃𑐐𑐸

The long nasalised vowel uses the short independent vowel. See 11402.

`, '\u{11404}': `

𑐄

u independent vowel.

𑐄𑐳𑐵𑑃𑐫𑑂

𑐩𑐥𑐄

Combinations

𑐄𑑃

ũ is 𑐄𑑃

𑐄𑑄

ũː is 𑐄𑑄

𑐄𑑄𑐠

Note that the long nasalised vowel uses the short independent vowel.

`, '\u{11405}': `

𑐅

uː long independent vowel.

The long nasalised vowel uses the short independent vowel. See 11404.

`, '\u{11406}': `

𑐆

Infrequent.

ru independent vowel.

`, '\u{1140A}': `

𑐊

e independent vowel.

𑐊𑐮𑐵

Combinations

𑐊𑑅

eː is 𑐊𑑅

𑐊𑑃

ẽ is 𑐊𑑃

𑐊𑑄

ẽː is 𑐊𑑄

`, '\u{1140B}': `

𑐋

əi diphthong independent vowel.

`, '\u{1140C}': `

𑐌

o independent vowel.

𑐌𑐴𑐵𑐫𑑀

Combinations

𑐌𑑅

oː is 𑐌𑑅

𑐌𑑃

õ is 𑐌𑑃

𑐌𑑄

õː is 𑐌𑑄

`, '\u{1140D}': `

𑐍

əu diphthong independent vowel.

`, '\u{1140E}': `

𑐎

k consonant with inherent vowel a.

𑐎𑐮

𑐳𑑃𑐎𑑂𑐰𑑅

𑐟𑐠𑑂𑐫𑐵𑐒𑑂𑐎

Combinations

𑐎𑑂𑐲

t͡ʃ is 𑐎𑑂𑐲

𑐎𑑂𑐲𑐶𑐟𑐶𑐖

`, '\u{1140F}': `

𑐏

kʰ consonant with inherent vowel a.

𑐏𑐵𑐟𑐵

𑐏𑑂𑐰𑐧𑐶

`, '\u{11410}': `

𑐐

ɡ consonant with inherent vowel a.

𑐐𑐳𑐵

𑐐𑑂𑐰𑐵𑐮𑐶

𑐬𑐒𑑂𑐐

Shaping

This is one of 4 letters that uses a rounded shape for 11438, eg. compare 𑐎𑐸 and 𑐐𑐸.

𑐡𑐸𑐐𑐸

`, '\u{11411}': `

𑐑

ɡʱ consonant with inherent vowel a.

𑐑𑑅𑐔𑐵

𑐮𑐑𑑂𑐰𑐵𑐟𑐸

`, '\u{11412}': `

𑐒

ŋ consonant with inherent vowel a.

𑐒

𑐬𑐒𑑂𑐐

`, '\u{11413}': `

𑐓

Infrequent.

ŋʰ consonant with inherent vowel a.

`, '\u{11414}': `

𑐔

t͡ɕ consonant with inherent vowel a.

𑐔𑐵𑐎𑑅

𑐔𑑂𑐰𑐫𑐾

𑐏𑑂𑐰𑐮𑑂𑐔𑐵𑐑𑐵𑑃𑐫𑑂

`, '\u{11415}': `

𑐕

t͡ɕʰ consonant with inherent vowel a.

𑐕𑐶𑐐𑐸

𑐕𑑂𑐫𑑄

𑐐𑐮𑑂𑐕𑐶

`, '\u{11416}': `

𑐖

d͡ʑ consonant with inherent vowel a.

𑐖𑐷𑐳𑑂𑐰𑐵𑑄

𑐴𑑂𑐰𑐖𑑂𑐫𑐵

𑐥𑐘𑑂𑐖𑐵𑐧𑐷

Combinations

𑐖𑑂𑐘

ɡj is 𑐖𑑂𑐘

`, '\u{11417}': `

𑐗

d͡ʑʱ consonant with inherent vowel a.

𑐗𑐵𑐳𑐸

𑐗𑑂𑐫𑐵𑑅

`, '\u{11418}': `

𑐘

Infrequent.

ɲ consonant with inherent vowel a.

𑐥𑐘𑑂𑐖𑐵𑐧𑐷

Combinations

𑐖𑑂𑐘

ɡj is 𑐖𑑂𑐘

`, '\u{11419}': `

𑐙

Infrequent.

ɲʰ consonant with inherent vowel a.

`, '\u{1141A}': `

𑐚

Infrequent.

ʈ consonant with inherent vowel a.

`, '\u{1141B}': `

𑐛

Infrequent.

ʈʰ consonant with inherent vowel a.

`, '\u{1141C}': `

𑐜

Infrequent.

ɖ consonant with inherent vowel a.

𑐰𑐶𑐜𑐹𑐬

𑐜𑑂𑐰𑐵𑐎𑐸

`, '\u{1141D}': `

𑐝

Infrequent.

ɖʱ consonant with inherent vowel a.

`, '\u{1141E}': `

𑐞

Infrequent.

ɳ consonant with inherent vowel a.

𑐀𑐬𑑂𑐠𑐥𑐹𑐬𑑂𑐞

`, '\u{1141F}': `

𑐟

t consonant with inherent vowel a.

𑐟𑐵𑐮𑑂𑐮𑐵

𑐟𑑂𑐰𑑃𑐳𑐵

𑐎𑐳𑑂𑐟𑐶

Shaping

This is one of 4 letters that uses a rounded shape for 11438, eg. compare 𑐎𑐸 and 𑐟𑐸.

𑐟𑐸𑐫𑐸

`, '\u{11420}': `

𑐠

tʰ consonant with inherent vowel a.

𑐠𑐵𑐫𑑂

𑐠𑑂𑐰

𑐀𑐬𑑂𑐠𑐥𑐹𑐬𑑂𑐞

`, '\u{11421}': `

𑐡

d consonant with inherent vowel a.

𑐡𑐣𑐵𑐳𑐸

𑐣𑐸𑐐𑑅𑐡𑑂𑐫𑑅

𑐳𑐹𑐬𑑂𑐡𑑂𑐫

`, '\u{11422}': `

𑐢

dʱ consonant with inherent vowel a.

𑐢𑐬𑑂𑐩

𑐢𑑂𑐰𑑅

𑐁𑐣𑑂𑐢𑑂𑐬𑐥𑑂𑐬𑐡𑐾𑐱

`, '\u{11423}': `

𑐣

n consonant with inherent vowel a.

𑐣𑐐𑐬

𑐣𑑂𑐰𑐎𑐹

`, '\u{11424}': `

𑐤

nʰ consonant with inherent vowel a.

𑐤𑐾𑐥𑐸

𑐤𑑂𑐫𑐵𑐎𑑄

`, '\u{11425}': `

𑐥

p consonant with inherent vowel a.

𑐥𑐮𑐾𑐳𑑂𑐰𑐵𑑄

𑐥𑑂𑐰𑐮𑐵𑐏

𑐀𑐩𑑂𑐥𑐶

`, '\u{11426}': `

𑐦

pʰ consonant with inherent vowel a.

𑐦𑐫𑑂

𑐁𑐦𑑂𑐰𑑅

`, '\u{11427}': `

𑐧

b consonant with inherent vowel a.

𑐧𑐩𑐹

𑐧𑑂𑐰𑐴

𑐀𑐩𑑂𑐧𑑅

`, '\u{11428}': `

𑐨

Shaping. This is one of 4 letters that uses a rounded shape for 𑐸 [U+11438 NEWA VOWEL SIGN U], eg. compare 𑐎𑐸 𑐨𑐸

The shape of the consonant itself also changes when used with 𑐸 [U+11438 NEWA VOWEL SIGN U] or 𑐹 [U+11439 NEWA VOWEL SIGN UU], eg. compare 𑐨 𑐨𑐸 𑐨𑐹

`, '\u{11428}': `

𑐨

bʱ consonant with inherent vowel a.

𑐨𑐵𑐬𑐟

𑐨𑑂𑐰𑐫𑑂

Shaping

This is one of 4 letters that uses a rounded shape for 11438, eg. compare 𑐎𑐸 and 𑐨𑐸.

𑐨𑐸𑐫𑐸

The shape of the consonant itself also changes when used with 11438 or 11439, eg. compare:

𑐨
𑐨𑐸
𑐨𑐹

`, '\u{11429}': `

𑐩

m consonant with inherent vowel a.

𑐩𑐔𑐵

𑐀𑐩𑑂𑐧𑑅

𑐢𑐬𑑂𑐩

`, '\u{1142A}': `

𑐪

mʰ consonant with inherent vowel a.

𑐪𑐐𑑅𑐳

𑐪𑑂𑐫𑐵𑐫𑑂

`, '\u{1142B}': `

𑐫

j consonant with inherent vowel a.

Combinations

𑐴𑑂𑐫

jʰ is 𑐴𑑂𑐫

`, '\u{1142B}': `

𑐫

j consonant with inherent vowel a.

𑐫𑐵𑐳𑐸

𑐎𑐵𑐫

𑐀𑐮𑑂𑐫𑐵𑐏

Combinations

𑐴𑑂𑐫

jʰ is 𑐴𑑂𑐫

𑐴𑑂𑐫𑐵𑑄𑐐𑐸

𑐫𑑂

ɛː is 𑐫𑑂 as a dependent vowel.

𑐎𑐫𑑂‌𑐩𑐶

𑐀𑐫𑑂

ɛː is 𑐀𑐫𑑂 as a standalone vowel.

𑐀𑐫𑑂‌𑐮𑐵𑑅

𑑃𑐫𑑂

ɛ̃ː is 𑑃𑐫𑑂

𑐎𑑃𑐫𑑂

𑐵𑐫𑑂

æː is 𑐵𑐫𑑂 as a dependent vowel.

𑐎𑐥𑐵𑐫𑑂

𑐁𑐫𑑂

æː is 𑐁𑐫𑑂 as a standalone vowel.

𑐵𑑃𑐫𑑂

æ̃ː is 𑐵𑑃𑐫𑑂

𑐄𑐳𑐵𑑃𑐫𑑂

`, '\u{1142C}': `

𑐬

r consonant with inherent vowel a.

𑐟𑐬𑐰𑐵𑐬

𑐢𑐬𑑂𑐩

𑐀𑑄𑐐𑑂𑐬𑐾𑐖𑐷

Shaping

Note the special glyph shaping for the consonant clusters above.

Also, this letter uses a special shape for 11438 and 11439, eg. compare the following:

𑐎𑐸

𑐬𑐸

𑐎𑐹

𑐬𑐹

`, '\u{1142D}': `

𑐭

rʰ consonant with inherent vowel a.

`, '\u{1142E}': `

𑐮

l consonant with inherent vowel a.

𑐮𑐵𑐎𑐵𑑄

𑐐𑐮𑑂𑐕𑐶

𑐟𑐵𑐮𑑂𑐮𑐵

`, '\u{1142F}': `

𑐯

lʰ consonant with inherent vowel a.

𑐯𑐵𑑅

𑐯𑑂𑐰𑐎𑐦𑐵𑐎

`, '\u{11430}': `

𑐰

w consonant with inherent vowel a.

𑐰𑐵𑑄𑐐𑐸

𑐣𑑂𑐰𑐎𑐹

𑐳𑐏𑑂𑐰𑐵𑑅𑐮𑑂𑐰𑐴𑑄

Combinations

𑐴𑑂𑐰

wʰ is 𑐴𑑂𑐰

𑐴𑑂𑐰𑐖𑑂𑐫𑐵

`, '\u{11431}': `

𑐱

Infrequent. Generally used for loan words.

s consonant with inherent vowel a.

𑐱𑐣𑐶𑐧𑐵𑑅

𑐁𑐣𑑂𑐢𑑂𑐬𑐥𑑂𑐬𑐡𑐾𑐱

Shaping

This is one of 4 letters that uses a rounded shape for 11438, eg. compare 𑐎𑐸 and 𑐱𑐸. `, '\u{11432}': `

𑐲

Infrequent.

ʂ consonant with inherent vowel a.

Combinations

𑐎𑑂𑐲

t͡ʃ is 𑐎𑑂𑐲

`, '\u{11432}': `

𑐲

Infrequent. Generally used for loan words.

ʂ consonant with inherent vowel a.

Combinations

𑐎𑑂𑐲

t͡ɕʰ is 𑐎𑑂𑐲.

𑐎𑑂𑐲𑐶𑐟𑐶𑐖

`, '\u{11433}': `

𑐳

s consonant with inherent vowel a.

𑐳𑐣𑑂𑐟𑑂𑐬𑐵𑐳𑐶

𑐎𑐳𑑂𑐟𑐶

𑐔𑐶𑐎𑐶𑐟𑑂𑐳𑐎

`, '\u{11434}': `

𑐴

h consonant with inherent vowel a.

𑐴𑐮𑐶𑐩

𑐥𑐵𑐴𑐵𑑄

Combinations

Aspirated/murmured consonant sounds typically have dedicated, atomic characters. However, two don't.

𑐴𑑂𑐰

wʰ is 𑐴𑑂𑐰

𑐴𑑂𑐰𑐖𑑂𑐫𑐵

𑐴𑑂𑐫

jʰ is 𑐴𑑂𑐫

𑐴𑑂𑐫𑐵𑑄𑐐𑐸

Shaping

The shape of the consonant changes when used with 11438 or 11439. For example, compare:

𑐴

𑐴𑐸

𑐴𑐹

`, '\u{11435}': `

𑐵

a~æ vowel sign.

𑐎𑐵𑐫

𑐐𑐳𑐵

𑐧𑐵𑐖𑐵𑑅

Combinations

𑐵𑐫𑑂

æː is 𑐵𑐫𑑂

𑐎𑐥𑐵𑐫𑑂

𑐟𑐵𑐫𑑂‌𑐐𑑅𑐳𑐶𑐩𑐵

𑐵𑑃𑐫𑑂

æ̃ː is 𑐵𑑃𑐫𑑂

𑐄𑐳𑐵𑑃𑐫𑑂

𑐳𑐸𑐥𑐵𑑃𑐫𑑂

𑐵𑑅

aː is 𑐵𑑅

𑐮𑐸𑐳𑐵𑑅

𑐖𑐰𑐵𑑅𑐖𑑂𑐫𑐵

𑐵𑑃

ã is 𑐵𑑃

𑐩𑐵𑑃

𑐎𑐟𑐵𑑃𑐩𑐬𑐷

𑐵𑑄

ãː is 𑐵𑑄

𑐠𑐵𑑄

𑐖𑐷𑐳𑑂𑐰𑐵𑑄

`, '\u{11436}': `

𑐶

i vowel sign.

𑐎𑐶𑐳𑐶

𑐡𑐶𑐮𑑂𑐮𑐷

Combinations

𑐶𑑃

ĩ is 𑐶𑑃

𑐟𑐶𑑃𑐢𑐸𑑃

𑐶𑑄

ĩː is 𑐶𑑄

𑐣𑐎𑐶𑑄

`, '\u{11437}': `

𑐷

iː long vowel-sign.

𑐮𑐐𑐵𑐣𑐷

𑐖𑐷𑐳𑑂𑐰𑐵𑑄

`, '\u{11438}': `

𑐸

Shaping

The shape is rounded when used with 𑐐 [U+11410 NEWA LETTER GA], 𑐟 [U+1141F NEWA LETTER TA], 𑐨 [U+11428 NEWA LETTER BHA], and 𑐱 [U+11431 NEWA LETTER SHA], eg. 𑐐𑐸 𑐟𑐸 𑐨𑐸 𑐱𑐸

With 𑐬 [U+1142C NEWA LETTER RA] it emerges from the side of the consonant, eg. 𑐬𑐸

It sometimes ligates with a character, eg. 𑐖𑐸

And with 𑐨 [U+11428 NEWA LETTER BHA] and 𑐴 [U+11434 NEWA LETTER HA] it alters the shape of the consonant letter itself, eg. compare 𑐨 𑐨𑐸 𑐴 𑐴𑐸

`, '\u{11438}': `

𑐸

u vowel sign.

𑐟𑐸𑐫𑐸

𑐡𑐸𑐐𑐸

Combinations

𑐸𑑃

ũ is 𑐸𑑃

𑐧𑐸𑑃𑐕𑐸𑑃

𑐸𑑄

ũː is 𑐸𑑄

𑐐𑐸𑑄

Shaping

The shape is rounded in the following four combinations.

𑐐𑐸

𑐟𑐸

𑐨𑐸

𑐱𑐸

It emerges from the side of the consonant in the combination:

𑐬𑐸

And it sometimes ligates with the consonant base, eg.

𑐖𑐸

With 11428 and 11434 it alters the shape of the consonant letter itself, eg. compare the following:

𑐨

𑐨𑐸

𑐴

𑐴𑐸

`, '\u{11439}': `

𑐹

uː long vowel-sign.

𑐂𑐩𑐹

𑐨𑐸𑐮𑐹𑐏𑐵

Shaping

It emerges from the side of the consonant in the following combination.

𑐬𑐹

It sometimes ligates with a character, eg.

𑐖𑐹

With 11428 and 11434 it alters the shape of the consonant letter itself, eg. compare the following:

𑐨

𑐨𑐹

𑐴

𑐴𑐹

`, '\u{1143E}': `

𑐾

e vowel sign.

𑐖𑐸𑐫𑐾

𑐟𑐣𑐾𑐖𑑂𑐫𑐵

Combinations

𑐾𑑅

eː is 𑐾𑑅

𑐾𑑃

ẽ is 𑐾𑑃

𑐕𑐾𑑃

𑐾𑑄

ẽː is 𑐾𑑄

𑐕𑐾𑑄𑐐𑐹

Shaping

When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare:

𑐎

𑐎𑐾

When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare

𑐎𑐾

𑐐𑐾

`, '\u{1143A}': `

𑐺

Used for non-native sounds in loan words.

ɾi vowel sign.

𑐳𑑄𑐳𑑂𑐎𑐺𑐟

`, '\u{1143F}': `

𑐿

əi diphthong vowel sign.

𑐩𑐟𑐿𑐎𑑂𑐫

Combinations

𑐿𑑄

əĩ is 𑐿𑑄

Shaping

When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare

𑐎

𑐎𑐿

When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare

𑐎𑐿

𑐐𑐿

`, '\u{11440}': `

𑑀

o vowel sign.

𑐌𑐴𑐵𑐫𑑀

𑐨𑐹𑐐𑑀𑐮

Combinations

𑑀𑑅

oː is 𑑀𑑅

𑑀𑑃

õ is 𑑀𑑃

𑑀𑑄

õː is 𑑀𑑄

Shaping

When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare

𑐎

𑐎𑑀

When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare

𑐎𑑀

𑐐𑑀

`, '\u{11441}': `

𑑁

əu diphthong vowel sign.

𑐧𑐸𑐏𑑃𑐥𑑁

𑐨𑑁

Combinations

𑑁𑑄

əũ is 𑑁𑑄

Shaping

When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare

𑐎

𑐎𑑁

When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare

𑐎𑑁

𑐐𑑁

`, '\u{11442}': `

𑑂

Vowel-killer.

Combinations

𑐎𑑂𑐲

t͡ʃ is 𑐎𑑂𑐲

𑐖𑑂𑐘

ɡj is 𑐖𑑂𑐘

𑐴𑑂𑐰

wʰ is 𑐴𑑂𑐰

𑐴𑑂𑐫

jʰ is 𑐴𑑂𑐫

`, '\u{11443}': `

𑑃

◌̃ indicates nasalisation of short vowels.m§5-6

𑐎𑐸𑑃𑐂𑑃

ə̃ when used with the inherent vowel.

𑐏𑑃𑐟𑑂𑐰𑐵

Combinations

𑐶𑑃

ĩ is 𑐶𑑃

𑐸𑑃

ũ is 𑐸𑑃

𑐾𑑃

ẽ is 𑐾𑑃

𑑀𑑃

õ is 𑑀𑑃

𑐵𑑃

æ̃ is 𑐵𑑃

𑐂𑑃

ĩ is 𑐂𑑃

𑐄𑑃

ũ is 𑐄𑑃

𑐊𑑃

ẽ is 𑐊𑑃

𑐌𑑃

õ is 𑐌𑑃

𑐀𑑃

ã is 𑐀𑑃

𑐁𑑃

æ̃ is 𑐁𑑃

`, '\u{11444}': `

𑑄

◌̃ long vowel nasalisation.m§5-6

𑐎𑐸𑐮𑐵𑑄𑐗𑑄𑐐𑑅

It is placed on the right edge of the character with which it combines, whether that is a consonant or a spacing vowel-sign.p§11

ə̃ː when used with the inherent vowel.

𑐃𑐮𑐔𑑄

Combinations

𑐶𑑄

ĩː is 𑐶𑑄

𑐸𑑄

ũː is 𑐸𑑄

𑐾𑑄

ẽː is 𑐾𑑄

𑑀𑑄

õː is 𑑀𑑄

𑐵𑑄

æ̃ː is 𑐵𑑄

𑐿𑑄

əĩ is 𑐿𑑄

𑑁𑑄

əũ is 𑑁𑑄

𑐂𑑄

ĩː is 𑐂𑑄

𑐄𑑄

ũː is 𑐄𑑄

𑐊𑑄

ẽː is 𑐊𑑄

𑐌𑑄

õː is 𑐌𑑄

𑐀𑑄

ãː is 𑐀𑑄

𑐁𑑄

æ̃ː is 𑐁𑑄

`, '\u{11445}': `

𑑅

Used to lengthen vowels (with the exception of i and u),m§5-6.

𑐩𑐵𑐖𑐵𑑅

əː when used with the inherent vowel.

𑐁𑐏𑑅

Also used to represent post-vocalic aspiration (h).p§11

Combinations

𑐾𑑅

eː is 𑐾𑑅

𑐊𑑅

eː is 𑐊𑑅

𑑀𑑅

oː is 𑑀𑑅

𑐌𑑅

oː is 𑐌𑑅

𑐀𑑅

əː is 𑐀𑑅

𑐵𑑅

aː is 𑐵𑑅

𑐁𑑅

aː is 𑐁𑑅

`, '\u{11446}': `

𑑆

Used to transcribe sounds for which there are no existing characters, such as those in loan words.p§11

`, '\u{11447}': `

𑑇

Used to elide an initial A in Sanskrit as a result of sandhi.p§11

`, '\u{11448}': `

𑑈

Represents nasalisation in some manuscripts. In other sources, a form of punctuation.p§11

`, '\u{11449}': `

𑑉

Represents the sacred syllable om.

`, '\u{1144A}': `

𑑊

Represents the Sanskrit invocation सिद्धिरस्तु siddhirastu may there be success. It is written at the beginning of a text, often in the combination 𑑊𑑉. It corresponds to the sign ঀ [U+0980 BENGALI ANJI] in related scripts such as Bengali.p§11

`, '\u{1144B}': `

𑑋

Sentence delimiter. It has a number of variant shapes.p§11

`, '\u{1144C}': `

𑑌

Indicates the end of a larger block of text than a sentence.p§11

`, '\u{1144D}': `

𑑍

Phrase separator. [Usage not clear.]p§11

`, '\u{1144E}': `

𑑎

Used for marking breaks and filling gaps in a line at a margin. It has a number of variant shapes. [Usage not clear.]p§11

`, '\u{1144F}': `

𑑏

Indicates an abbreviation. [Usage not clear.]p§11

`, '\u{1145A}': `

𑑚

Marks the end of a sentence. [Usage not clear.]p§11

`, '\u{1145B}': `

𑑛

Used for filling gaps in a line and as a mark for end of text.p§11

`, '\u{11450}': `

𑑐

0 digit.

`, '\u{11451}': `

𑑑

1 digit.

`, '\u{11452}': `

𑑒

2 digit.

`, '\u{11453}': `

𑑓

3 digit.

`, '\u{11454}': `

𑑔

4 digit.

`, '\u{11455}': `

𑑕

5 digit.

`, '\u{11456}': `

𑑖

6 digit.

`, '\u{11457}': `

𑑗

7 digit.

`, '\u{11458}': `

𑑘

8 digit.

`, '\u{11459}': `

𑑙

9 digit.

`, // COMMON PUNCTUATION // ".. '\u{201C}': ` `, // .." '\u{201D}': ` `, // '.. '\u{2018}': ` `, // ..' '\u{2019}': ` `, // « '\u{00AB}': ` `, // » '\u{00BB}': ` `, // ; '\u{003B}': ` `, // : '\u{003A}': ` `, // . '\u{002E}': ` `, // ? '\u{003F}': ` `, // ! '\u{0021}': ` `, // ( '\u{0028}': ` `, // ) '\u{0029}': ` `, // … '\u{2026}': ` `, // – '\u{2013}': ` `, // — '\u{2014}': ` `, // § '\u{00A7}': ` `, '\u{2020}': `

Called dagger, but also known as obelisk, obelus, or long cross.b321

A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321

In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321

`, '\u{2021}': `

Called dagger, but also known as diesis, or double obelisk.b321

A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

`, '\u{2032}': `

Abbreviation for feet (1′ = 12″).b330

Also used for minutes of arc (eg. 60′=1°).b330

`, '\u{2033}': `

Abbreviation for inches (1′ = 12″).b321

Also used for seconds of arc (eg. 360″=1°).b321

`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `

An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.

Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text อักษรไทย may look like อั ก ษ ร ไ ท ย when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).

`, // zwj '\u{200D}': `

Creates glyph joining behaviour in the absence of normal joining contexts.

`, // zwnj '\u{200C}': `

Prevents glyph joining behaviour.

`, // word-break '\u{2060}': `

An invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks, eg. it can be used around the + sign in base⁠+delta⁠ to prevent a line break occuring in that sequence of characters. It has no effect on word segmentation.

It can also be used to bracket other characters to turn them into non-breaking characters, such as U+2009 THIN SPACE or ― [U+2015 HORIZONTAL BAR].

Not to be confused with U+200D ZERO WIDTH JOINER or U+034F COMBINING GRAPHEME JOINER, since it has no effect on shaping.

This functionality is also provided by U+FEFF ZERO WIDTH NO-BREAK SPACE, but since that character also represents the byte-order mark, the use of this word joiner character (added in Unicode 3.2) is strongly preferred over the latter.

`, // rli '\u{2067}': `

Sets the base direction for the following text to RTL, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // lri '\u{2066}': `

Sets the base direction for the following text to LTR, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, //fsi '\u{2068}': `

Sets the base direction for the following text to the direction of the first strong directional character, per Unicode Bidirectional Algorithm rules, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // pdi '\u{2069}': `

Ends the range of text that started with RLI, LRI, or FSI.

`, // rle '\u{202B}': `

Sets the base direction for the following text to RTL, with no isolation. The Unicode Standard recommends use of RLI, instead.

`, // lre '\u{202A}': `

Sets the base direction for the following text to LTR, with no isolation. The Unicode Standard recommends use of LRI, instead.

`, // pdf '\u{202C}': `

Ends the range of text that started with RLE, or LRE.

`, // rlm '\u{200F}': `

An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // lrm '\u{200E}': `

An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // cgj '\u{034F}': `

Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.

`, // alm '\u{061C}': `

Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.

`, }