/*
*/ var charDetails = { // MAIN BLOCK // Use _tools/generate_details_page_stubs.html to generate stubs to go here '\u{11400}': `𑐀
ษ independent vowel.
๐๐ฅ๐ต
๐๐ฉ๐๐ฅ๐ถ
Combinations
๐๐
aห is ๐๐
๐๐
รฃ is ๐๐
๐๐
รฃห is ๐๐
๐๐
๐๐๐๐
`, '\u{11401}': `𑐁
รฆ independent vowel.
๐๐๐
๐๐ฎ๐ธ
Combinations
๐๐
รฆห is ๐๐
๐๐
รฆฬ is ๐๐
๐๐
รฆฬห is ๐๐
`, '\u{11402}': `
𑐂
i independent vowel.
๐๐๐ธ๐
๐ฆ๐
Combinations
๐๐
ฤฉ is ๐๐
๐ฎ๐๐๐๐ต
๐๐ธ๐๐๐
๐๐
ฤฉห is ๐๐
๐๐
Note that the long nasalised vowel uses the short independent vowel.
`, '\u{11403}': `𑐃
iห long independent vowel.
๐๐ฎ๐๐
๐ฎ๐พ๐๐ต๐๐๐ธ
The long nasalised vowel uses the short independent vowel. See 11402.
`, '\u{11404}': `𑐄
u independent vowel.
๐๐ณ๐ต๐๐ซ๐
๐ฉ๐ฅ๐
Combinations
๐๐
ลฉ is ๐๐
๐๐
ลฉห is ๐๐
๐๐๐
Note that the long nasalised vowel uses the short independent vowel.
`, '\u{11405}': `𑐅
uห long independent vowel.
The long nasalised vowel uses the short independent vowel. See 11404.
`, '\u{11406}': `𑐆
Infrequent.
ru independent vowel.
`, '\u{1140A}': `
𑐊
e independent vowel.
๐๐ฎ๐ต
Combinations
๐๐
eห is ๐๐
๐๐
แบฝ is ๐๐
๐๐
แบฝห is ๐๐
`, '\u{1140B}': `
𑐋
ษi diphthong independent vowel.
`, '\u{1140C}': `
𑐌
o independent vowel.
๐๐ด๐ต๐ซ๐
Combinations
๐๐
oห is ๐๐
๐๐
รต is ๐๐
๐๐
รตห is ๐๐
`, '\u{1140D}': `
𑐍
ษu diphthong independent vowel.
`, '\u{1140E}': `
𑐎
k consonant with inherent vowel a.
๐๐ฎ
๐ณ๐๐๐๐ฐ๐
๐๐ ๐๐ซ๐ต๐๐๐
Combinations
๐๐๐ฒ
tอกส is ๐๐๐ฒ
๐๐๐ฒ๐ถ๐๐ถ๐
`, '\u{1140F}': `𑐏
kสฐ consonant with inherent vowel a.
๐๐ต๐๐ต
๐๐๐ฐ๐ง๐ถ
`, '\u{11410}': `
𑐐
ษก consonant with inherent vowel a.
๐๐ณ๐ต
๐๐๐ฐ๐ต๐ฎ๐ถ
๐ฌ๐๐๐
Shaping
This is one of 4 letters that uses a rounded shape for 11438, eg. compare ๐๐ธ and ๐๐ธ.
๐ก๐ธ๐๐ธ
`, '\u{11411}': `𑐑
ษกสฑ consonant with inherent vowel a.
๐๐ ๐๐ต
๐ฎ๐๐๐ฐ๐ต๐๐ธ
`, '\u{11412}': `𑐒
ล consonant with inherent vowel a.
๐
๐ฌ๐๐๐
`, '\u{11413}': `
𑐓
Infrequent.
ลสฐ consonant with inherent vowel a.
`, '\u{11414}': `
𑐔
tอกษ consonant with inherent vowel a.
๐๐ต๐๐
๐๐๐ฐ๐ซ๐พ
๐๐๐ฐ๐ฎ๐๐๐ต๐๐ต๐๐ซ๐
`, '\u{11415}': `𑐕
tอกษสฐ consonant with inherent vowel a.
๐๐ถ๐๐ธ
๐๐๐ซ๐
๐๐ฎ๐๐๐ถ
`, '\u{11416}': `𑐖
dอกส consonant with inherent vowel a.
๐๐ท๐ณ๐๐ฐ๐ต๐
๐ด๐๐ฐ๐๐๐ซ๐ต
๐ฅ๐๐๐๐ต๐ง๐ท
Combinations
๐๐๐
ษกj is ๐๐๐
`, '\u{11417}': `
𑐗
dอกสสฑ consonant with inherent vowel a.
๐๐ต๐ณ๐ธ
๐๐๐ซ๐ต๐
`, '\u{11418}': `
𑐘
Infrequent.
ษฒ consonant with inherent vowel a.
๐ฅ๐๐๐๐ต๐ง๐ท
Combinations
๐๐๐
ษกj is ๐๐๐
`, '\u{11419}': `
𑐙
Infrequent.
ษฒสฐ consonant with inherent vowel a.
`, '\u{1141A}': `
𑐚
Infrequent.
ส consonant with inherent vowel a.
`, '\u{1141B}': `
𑐛
Infrequent.
สสฐ consonant with inherent vowel a.
`, '\u{1141C}': `
𑐜
Infrequent.
ษ consonant with inherent vowel a.
๐ฐ๐ถ๐๐น๐ฌ
๐๐๐ฐ๐ต๐๐ธ
`, '\u{1141D}': `
𑐝
Infrequent.
ษสฑ consonant with inherent vowel a.
`, '\u{1141E}': `
𑐞
Infrequent.
ษณ consonant with inherent vowel a.
๐๐ฌ๐๐ ๐ฅ๐น๐ฌ๐๐
`, '\u{1141F}': `
𑐟
t consonant with inherent vowel a.
๐๐ต๐ฎ๐๐ฎ๐ต
๐๐๐ฐ๐๐ณ๐ต
๐๐ณ๐๐๐ถ
Shaping
This is one of 4 letters that uses a rounded shape for 11438, eg. compare ๐๐ธ and ๐๐ธ.
๐๐ธ๐ซ๐ธ
`, '\u{11420}': `𑐠
tสฐ consonant with inherent vowel a.
๐ ๐ต๐ซ๐
๐ ๐๐ฐ
๐๐ฌ๐๐ ๐ฅ๐น๐ฌ๐๐
`, '\u{11421}': `𑐡
d consonant with inherent vowel a.
๐ก๐ฃ๐ต๐ณ๐ธ
๐ฃ๐ธ๐๐ ๐ก๐๐ซ๐
๐ณ๐น๐ฌ๐๐ก๐๐ซ
`, '\u{11422}': `𑐢
dสฑ consonant with inherent vowel a.
๐ข๐ฌ๐๐ฉ
๐ข๐๐ฐ๐
๐๐ฃ๐๐ข๐๐ฌ๐ฅ๐๐ฌ๐ก๐พ๐ฑ
`, '\u{11423}': `𑐣
n consonant with inherent vowel a.
๐ฃ๐๐ฌ
๐ฃ๐๐ฐ๐๐น
`, '\u{11424}': `
𑐤
nสฐ consonant with inherent vowel a.
๐ค๐พ๐ฅ๐ธ
๐ค๐๐ซ๐ต๐๐
`, '\u{11425}': `
𑐥
p consonant with inherent vowel a.
๐ฅ๐ฎ๐พ๐ณ๐๐ฐ๐ต๐
๐ฅ๐๐ฐ๐ฎ๐ต๐
๐๐ฉ๐๐ฅ๐ถ
`, '\u{11426}': `𑐦
pสฐ consonant with inherent vowel a.
๐ฆ๐ซ๐
๐๐ฆ๐๐ฐ๐
`, '\u{11427}': `
𑐧
b consonant with inherent vowel a.
๐ง๐ฉ๐น
๐ง๐๐ฐ๐ด
๐๐ฉ๐๐ง๐
`, '\u{11428}': `𑐨
Shaping. This is one of 4 letters that uses a rounded shape for 𑐸 [U+11438 NEWA VOWEL SIGN U], eg. compare ๐๐ธ ๐จ๐ธ
The shape of the consonant itself also changes when used with 𑐸 [U+11438 NEWA VOWEL SIGN U] or 𑐹 [U+11439 NEWA VOWEL SIGN UU], eg. compare ๐จ ๐จ๐ธ ๐จ๐น
`, '\u{11428}': `𑐨
bสฑ consonant with inherent vowel a.
๐จ๐ต๐ฌ๐
๐จ๐๐ฐ๐ซ๐
Shaping
This is one of 4 letters that uses a rounded shape for 11438, eg. compare ๐๐ธ and ๐จ๐ธ.
๐จ๐ธ๐ซ๐ธ
The shape of the consonant itself also changes when used with 11438 or 11439, eg. compare:
𑐩
m consonant with inherent vowel a.
๐ฉ๐๐ต
๐๐ฉ๐๐ง๐
๐ข๐ฌ๐๐ฉ
`, '\u{1142A}': `𑐪
mสฐ consonant with inherent vowel a.
๐ช๐๐ ๐ณ
๐ช๐๐ซ๐ต๐ซ๐
`, '\u{1142B}': `
𑐫
j consonant with inherent vowel a.
Combinations
๐ด๐๐ซ
jสฐ is ๐ด๐๐ซ
`, '\u{1142B}': `
𑐫
j consonant with inherent vowel a.
๐ซ๐ต๐ณ๐ธ
๐๐ต๐ซ
๐๐ฎ๐๐ซ๐ต๐
Combinations
๐ด๐๐ซ
jสฐ is ๐ด๐๐ซ
๐ด๐๐ซ๐ต๐๐๐ธ
๐ซ๐
ษห is ๐ซ๐ as a dependent vowel.
๐๐ซ๐โ๐ฉ๐ถ
๐๐ซ๐
ษห is ๐๐ซ๐ as a standalone vowel.
๐๐ซ๐โ๐ฎ๐ต๐
๐๐ซ๐
ษฬห is ๐๐ซ๐
๐๐๐ซ๐
๐ต๐ซ๐
รฆห is ๐ต๐ซ๐ as a dependent vowel.
๐๐ฅ๐ต๐ซ๐
๐๐ซ๐
รฆห is ๐๐ซ๐ as a standalone vowel.
๐ต๐๐ซ๐
รฆฬห is ๐ต๐๐ซ๐
๐๐ณ๐ต๐๐ซ๐
`, '\u{1142C}': `𑐬
r consonant with inherent vowel a.
๐๐ฌ๐ฐ๐ต๐ฌ
๐ข๐ฌ๐๐ฉ
๐๐๐๐๐ฌ๐พ๐๐ท
Shaping
Note the special glyph shaping for the consonant clusters above.
Also, this letter uses a special shape for 11438 and 11439, eg. compare the following:
๐๐ธ
๐ฌ๐ธ
๐๐น
๐ฌ๐น
`, '\u{1142D}': `𑐭
rสฐ consonant with inherent vowel a.
`, '\u{1142E}': `
𑐮
l consonant with inherent vowel a.
๐ฎ๐ต๐๐ต๐
๐๐ฎ๐๐๐ถ
๐๐ต๐ฎ๐๐ฎ๐ต
`, '\u{1142F}': `𑐯
lสฐ consonant with inherent vowel a.
๐ฏ๐ต๐
๐ฏ๐๐ฐ๐๐ฆ๐ต๐
`, '\u{11430}': `
𑐰
w consonant with inherent vowel a.
๐ฐ๐ต๐๐๐ธ
๐ฃ๐๐ฐ๐๐น
๐ณ๐๐๐ฐ๐ต๐ ๐ฎ๐๐ฐ๐ด๐
Combinations
๐ด๐๐ฐ
wสฐ is ๐ด๐๐ฐ
๐ด๐๐ฐ๐๐๐ซ๐ต
`, '\u{11431}': `𑐱
Infrequent. Generally used for loan words.
s consonant with inherent vowel a.
๐ฑ๐ฃ๐ถ๐ง๐ต๐
๐๐ฃ๐๐ข๐๐ฌ๐ฅ๐๐ฌ๐ก๐พ๐ฑ
Shaping
This is one of 4 letters that uses a rounded shape for 11438, eg. compare ๐๐ธ and ๐ฑ๐ธ. `, '\u{11432}': `
𑐲
Infrequent.
ส consonant with inherent vowel a.
Combinations
๐๐๐ฒ
tอกส is ๐๐๐ฒ
`, '\u{11432}': `
𑐲
Infrequent. Generally used for loan words.
ส consonant with inherent vowel a.
Combinations
๐๐๐ฒ
tอกษสฐ is ๐๐๐ฒ.
๐๐๐ฒ๐ถ๐๐ถ๐
`, '\u{11433}': `𑐳
s consonant with inherent vowel a.
๐ณ๐ฃ๐๐๐๐ฌ๐ต๐ณ๐ถ
๐๐ณ๐๐๐ถ
๐๐ถ๐๐ถ๐๐๐ณ๐
`, '\u{11434}': `𑐴
h consonant with inherent vowel a.
๐ด๐ฎ๐ถ๐ฉ
๐ฅ๐ต๐ด๐ต๐
Combinations
Aspirated/murmured consonant sounds typically have dedicated, atomic characters. However, two don't.
๐ด๐๐ฐ
wสฐ is ๐ด๐๐ฐ
๐ด๐๐ฐ๐๐๐ซ๐ต
๐ด๐๐ซ
jสฐ is ๐ด๐๐ซ
๐ด๐๐ซ๐ต๐๐๐ธ
Shaping
The shape of the consonant changes when used with 11438 or 11439. For example, compare:
๐ด
๐ด๐ธ
๐ด๐น
`, '\u{11435}': `𑐵
a~รฆ vowel sign.
๐๐ต๐ซ
๐๐ณ๐ต
๐ง๐ต๐๐ต๐
Combinations
๐ต๐ซ๐
รฆห is ๐ต๐ซ๐
๐๐ฅ๐ต๐ซ๐
๐๐ต๐ซ๐โ๐๐ ๐ณ๐ถ๐ฉ๐ต
๐ต๐๐ซ๐
รฆฬห is ๐ต๐๐ซ๐
๐๐ณ๐ต๐๐ซ๐
๐ณ๐ธ๐ฅ๐ต๐๐ซ๐
๐ต๐
aห is ๐ต๐
๐ฎ๐ธ๐ณ๐ต๐
๐๐ฐ๐ต๐ ๐๐๐ซ๐ต
๐ต๐
รฃ is ๐ต๐
๐ฉ๐ต๐
๐๐๐ต๐๐ฉ๐ฌ๐ท
๐ต๐
รฃห is ๐ต๐
๐ ๐ต๐
๐๐ท๐ณ๐๐ฐ๐ต๐
`, '\u{11436}': `𑐶
i vowel sign.
๐๐ถ๐ณ๐ถ
๐ก๐ถ๐ฎ๐๐ฎ๐ท
Combinations
๐ถ๐
ฤฉ is ๐ถ๐
๐๐ถ๐๐ข๐ธ๐
๐ถ๐
ฤฉห is ๐ถ๐
๐ฃ๐๐ถ๐
`, '\u{11437}': `𑐷
iห long vowel-sign.
๐ฎ๐๐ต๐ฃ๐ท
๐๐ท๐ณ๐๐ฐ๐ต๐
`, '\u{11438}': `
𑐸
Shaping
The shape is rounded when used with 𑐐 [U+11410 NEWA LETTER GA], 𑐟 [U+1141F NEWA LETTER TA], 𑐨 [U+11428 NEWA LETTER BHA], and 𑐱 [U+11431 NEWA LETTER SHA], eg. ๐๐ธ ๐๐ธ ๐จ๐ธ ๐ฑ๐ธ
With 𑐬 [U+1142C NEWA LETTER RA] it emerges from the side of the consonant, eg. ๐ฌ๐ธ
It sometimes ligates with a character, eg. ๐๐ธ
And with 𑐨 [U+11428 NEWA LETTER BHA] and 𑐴 [U+11434 NEWA LETTER HA] it alters the shape of the consonant letter itself, eg. compare ๐จ ๐จ๐ธ ๐ด ๐ด๐ธ
`, '\u{11438}': `𑐸
u vowel sign.
๐๐ธ๐ซ๐ธ
๐ก๐ธ๐๐ธ
Combinations
๐ธ๐
ลฉ is ๐ธ๐
๐ง๐ธ๐๐๐ธ๐
๐ธ๐
ลฉห is ๐ธ๐
๐๐ธ๐
Shaping
The shape is rounded in the following four combinations.
๐๐ธ
๐๐ธ
๐จ๐ธ
๐ฑ๐ธ
It emerges from the side of the consonant in the combination:
๐ฌ๐ธ
And it sometimes ligates with the consonant base, eg.
๐๐ธ
With 11428 and 11434 it alters the shape of the consonant letter itself, eg. compare the following:
๐จ
๐จ๐ธ
๐ด
๐ด๐ธ
`, '\u{11439}': `𑐹
uห long vowel-sign.
๐๐ฉ๐น
๐จ๐ธ๐ฎ๐น๐๐ต
Shaping
It emerges from the side of the consonant in the following combination.
๐ฌ๐น
It sometimes ligates with a character, eg.
๐๐น
With 11428 and 11434 it alters the shape of the consonant letter itself, eg. compare the following:
๐จ
๐จ๐น
๐ด
๐ด๐น
`, '\u{1143E}': `𑐾
e vowel sign.
๐๐ธ๐ซ๐พ
๐๐ฃ๐พ๐๐๐ซ๐ต
Combinations
๐พ๐
eห is ๐พ๐
๐พ๐
แบฝ is ๐พ๐
๐๐พ๐
๐พ๐
แบฝห is ๐พ๐
๐๐พ๐๐๐น
Shaping
When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare:
๐
๐๐พ
When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare
๐๐พ
๐๐พ
`, '\u{1143A}': `𑐺
Used for non-native sounds in loan words.
ษพi vowel sign.
๐ณ๐๐ณ๐๐๐บ๐
`, '\u{1143F}': `
𑐿
ษi diphthong vowel sign.
๐ฉ๐๐ฟ๐๐๐ซ
Combinations
๐ฟ๐
ษฤฉ is ๐ฟ๐
Shaping
When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare
๐
๐๐ฟ
When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare
๐๐ฟ
๐๐ฟ
`, '\u{11440}': `𑑀
o vowel sign.
๐๐ด๐ต๐ซ๐
๐จ๐น๐๐๐ฎ
Combinations
๐๐
oห is ๐๐
๐๐
รต is ๐๐
๐๐
รตห is ๐๐
Shaping
When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare
๐
๐๐
When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare
๐๐
๐๐
`, '\u{11441}': `𑑁
ษu diphthong vowel sign.
๐ง๐ธ๐๐๐ฅ๐
๐จ๐
Combinations
๐๐
ษลฉ is ๐๐
Shaping
When combined with a base character that has a headstroke, this replaces that headstroke, eg. compare
๐
๐๐
When combined with a 'headless' consonant letter this becomes a circumgraph, eg. compare
๐๐
๐๐
`, '\u{11442}': `𑑂
Vowel-killer.
Combinations
๐๐๐ฒ
tอกส is ๐๐๐ฒ
๐๐๐
ษกj is ๐๐๐
๐ด๐๐ฐ
wสฐ is ๐ด๐๐ฐ
๐ด๐๐ซ
jสฐ is ๐ด๐๐ซ
`, '\u{11443}': `
𑑃
โฬ indicates nasalisation of short vowels.m,5-6
๐๐ธ๐๐๐
ษฬ when used with the inherent vowel.
๐๐๐๐๐ฐ๐ต
Combinations
๐ถ๐
ฤฉ is ๐ถ๐
๐ธ๐
ลฉ is ๐ธ๐
๐พ๐
แบฝ is ๐พ๐
๐๐
รต is ๐๐
๐ต๐
รฆฬ is ๐ต๐
๐๐
ฤฉ is ๐๐
๐๐
ลฉ is ๐๐
๐๐
แบฝ is ๐๐
๐๐
รต is ๐๐
๐๐
รฃ is ๐๐
๐๐
รฆฬ is ๐๐
`, '\u{11444}': `
𑑄
โฬ long vowel nasalisation.m,5-6
๐๐ธ๐ฎ๐ต๐๐๐๐๐
It is placed on the right edge of the character with which it combines, whether that is a consonant or a spacing vowel-sign.p,11
ษฬห when used with the inherent vowel.
๐๐ฎ๐๐
Combinations
๐ถ๐
ฤฉห is ๐ถ๐
๐ธ๐
ลฉห is ๐ธ๐
๐พ๐
แบฝห is ๐พ๐
๐๐
รตห is ๐๐
๐ต๐
รฆฬห is ๐ต๐
๐ฟ๐
ษฤฉ is ๐ฟ๐
๐๐
ษลฉ is ๐๐
๐๐
ฤฉห is ๐๐
๐๐
ลฉห is ๐๐
๐๐
แบฝห is ๐๐
๐๐
รตห is ๐๐
๐๐
รฃห is ๐๐
๐๐
รฆฬห is ๐๐
`, '\u{11445}': `
𑑅
Used to lengthen vowels (with the exception of i and u),m,5-6.
๐ฉ๐ต๐๐ต๐
ษห when used with the inherent vowel.
๐๐๐
Also used to represent post-vocalic aspiration (h).p,11
Combinations
๐พ๐
eห is ๐พ๐
๐๐
eห is ๐๐
๐๐
oห is ๐๐
๐๐
oห is ๐๐
๐๐
ษห is ๐๐
๐ต๐
aห is ๐ต๐
๐๐
aห is ๐๐
`, '\u{11446}': `
𑑆
Used to transcribe sounds for which there are no existing characters, such as those in loan words.p,11
`, '\u{11447}': `𑑇
Used to elide an initial A in Sanskrit as a result of sandhi.p,11
`, '\u{11448}': `𑑈
Represents nasalisation in some manuscripts. In other sources, a form of punctuation.p,11
`, '\u{11449}': `𑑉
Represents the sacred syllable om.
`, '\u{1144A}': `𑑊
Represents the Sanskrit invocation เคธเคฟเคฆเฅเคงเคฟเคฐเคธเฅเคคเฅ siddhirastu may there be success. It is written at the beginning of a text, often in the combination ๐๐. It corresponds to the sign ঀ [U+0980 BENGALI ANJI] in related scripts such as Bengali.p,11
`, '\u{1144B}': `𑑋
Sentence delimiter. It has a number of variant shapes.p,11
`, '\u{1144C}': `𑑌
Indicates the end of a larger block of text than a sentence.p,11
`, '\u{1144D}': `𑑍
Phrase separator. [Usage not clear.]p,11
`, '\u{1144E}': `𑑎
Used for marking breaks and filling gaps in a line at a margin. It has a number of variant shapes. [Usage not clear.]p,11
`, '\u{1144F}': `𑑏
Indicates an abbreviation. [Usage not clear.]p,11
`, '\u{1145A}': `𑑚
Marks the end of a sentence. [Usage not clear.]p,11
`, '\u{1145B}': `𑑛
Used for filling gaps in a line and as a mark for end of text.p,11
`, '\u{11450}': `𑑐
0 digit.
`, '\u{11451}': `𑑑
1 digit.
`, '\u{11452}': `𑑒
2 digit.
`, '\u{11453}': `𑑓
3 digit.
`, '\u{11454}': `𑑔
4 digit.
`, '\u{11455}': `𑑕
5 digit.
`, '\u{11456}': `𑑖
6 digit.
`, '\u{11457}': `𑑗
7 digit.
`, '\u{11458}': `𑑘
8 digit.
`, '\u{11459}': `𑑙
9 digit.
`, // COMMON PUNCTUATION // ".. '\u{201C}': ` `, // .." '\u{201D}': ` `, // '.. '\u{2018}': ` `, // ..' '\u{2019}': ` `, // ยซ '\u{00AB}': ` `, // ยป '\u{00BB}': ` `, // ; '\u{003B}': ` `, // : '\u{003A}': ` `, // . '\u{002E}': ` `, // ? '\u{003F}': ` `, // ! '\u{0021}': ` `, // ( '\u{0028}': ` `, // ) '\u{0029}': ` `, // โฆ '\u{2026}': ` `, // โ '\u{2013}': ` `, // โ '\u{2014}': ` `, // ยง '\u{00A7}': ` `, '\u{2020}': `Called dagger, but also known as obelisk, obelus, or long cross.b321
A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * โ โก ยง โ ยถ.b68
Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321
In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321
`, '\u{2021}': `Called dagger, but also known as diesis, or double obelisk.b321
A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * โ โก ยง โ ยถ.b68
`, '\u{2032}': `Abbreviation for feet (1โฒ = 12โณ).b330
Also used for minutes of arc (eg. 60โฒ=1ยฐ).b330
`, '\u{2033}': `Abbreviation for inches (1โฒ = 12โณ).b321
Also used for seconds of arc (eg. 360โณ=1ยฐ).b321
`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.
Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text เธญเธฑเธเธฉเธฃโเนเธเธข may look like เธญเธฑ เธ เธฉ เธฃ เน เธ เธข when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).
`, // zwj '\u{200D}': `Creates glyph joining behaviour in the absence of normal joining contexts.
`, // zwnj '\u{200C}': `Prevents glyph joining behaviour.
`, // word-break '\u{2060}': `An invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks, eg. it can be used around the + sign in base+delta to prevent a line break occuring in that sequence of characters. It has no effect on word segmentation.
It can also be used to bracket other characters to turn them into non-breaking characters, such as U+2009 THIN SPACE or ― [U+2015 HORIZONTAL BAR].
Not to be confused with U+200D ZERO WIDTH JOINER or U+034F COMBINING GRAPHEME JOINERโ, since it has no effect on shaping.
This functionality is also provided by U+FEFF ZERO WIDTH NO-BREAK SPACE, but since that character also represents the byte-order mark, the use of this word joiner character (added in Unicode 3.2) is strongly preferred over the latter.
`, // rli '\u{2067}': `Sets the base direction for the following text to RTL, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).
`, // lri '\u{2066}': `Sets the base direction for the following text to LTR, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).
`, //fsi '\u{2068}': `Sets the base direction for the following text to the direction of the first strong directional character, per Unicode Bidirectional Algorithm rules, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).
`, // pdi '\u{2069}': `Ends the range of text that started with RLI, LRI, or FSI.
`, // rle '\u{202B}': `Sets the base direction for the following text to RTL, with no isolation. The Unicode Standard recommends use of RLI, instead.
`, // lre '\u{202A}': `Sets the base direction for the following text to LTR, with no isolation. The Unicode Standard recommends use of LRI, instead.
`, // pdf '\u{202C}': `Ends the range of text that started with RLE, or LRE.
`, // rlm '\u{200F}': `An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.
`, // lrm '\u{200E}': `An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.
`, // cgj '\u{034F}': `Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.
`, // alm '\u{061C}': `Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.
`, }