/*
*/ var charDetails = { // MAIN BLOCK // Use _tools/generate_details_page_stubs.html to generate stubs to go here '\u{0622}': `آ
ʔaː initial long vowel. Unlike other word-initial vowels, which use a combining mark with alef, this sound is represented by a single character (although it does decompose in normalisation form D).
آکِمُ
آجࣺ
`, '\u{0627}': `ا
aː long vowel (not word-initial).
بَاتْ
Word-initially, this sound is written using 0622.
ʔ as a word-initial vowel carrier, although phonemic transcriptions may not show anything.
اَکَرَ
اِتَمْ
اُودࣹ
Combinations
Used to indicate the following long vowels.
aː is 064E 0627.
جَايْ
`, '\u{0628}': `ب
b consonant.
بَاتْ
نࣴبْ
نࣴبّ
Prenasalised consonant
ᵐb is 0645 0628.
مبَارْ
`, '\u{062A}': `ت
t consonant.
تَارْ
بࣴتْ
بࣴتّ
`, '\u{062C}': `ج
ɟ consonant.
جَايْ
دࣴجْ
لَجّ
Prenasalised consonant
ᶮɟ is 0646 062C.
نجࣷولْ
`, '\u{062E}': `خ
x consonant.
خَمْ
رَخَسْ
بَاخْ
`, '\u{062F}': `د
d consonant.
دَانُ
كࣵادُّ
Prenasalised consonant
ⁿd is 0646 062F.
ندَوْ
`, '\u{0631}': `ر
r consonant.
رَخَسْ
فَارْ
`, '\u{0633}': `س
s consonant.
سࣹنࣹگَالْ
رَخَسْ
`, '\u{0639}': `ع
∅ when used as a carrier for non-initial standalone vowels.
اِسْرَعࣹلْ
Combination
ëe is 08F4 0639 08F4.
دࣴعࣴلْ
`, '\u{0641}': `ف
f consonant.
فَارْ
وࣺلࣺفْ
`, '\u{0642}': `ق
q consonant.
نَقَرْ
سࣷقْ
Prenasalised consonant
ᵑq is 0646 0756.
جࣵنقّ
This never appears in word-initial position.
`, '\u{0643}': `ك
`, '\u{0644}': `ل
l consonant.
لࣺينِ
گَالْ
گَلّ
`, '\u{0645}': `م
m consonant.
مَامْ
ݝَامَمْ
Prenasalised consonants
ᵐp is 0645 0752 (never word-initial).
سࣵمݒّ
ᵐb is 0645 0628.
مبَارْ
`, '\u{0646}': `ن
n consonant.
نَانْ
لࣹينْ
فࣹنّ
Prenasalised consonants
ⁿd is 0646 062F.
ندَوْ
ᶮc is 0646 0756 (never word-initial).
سࣵنݖّ
ᶮɟ is 0646 062C.
نجࣷولْ
ᵑk is 0646 06A9.
دࣺنکّ
ᵑɡ is 0646 06AF.
نگَ
ᵑq is 0646 0756 (never word-initial).
جࣵنقّ
`, '\u{0647}': `ه
h consonant. Not common.
هَارْ
سࣴيهْ
جَاهگࣺرࣺ
`, '\u{0648}': `و
w consonant.
وَاوْ
جَوّ
Combinations
Used to indicate the following long vowels.
ɔː is 08F7 0648.
وࣷورْ
oː is 08F8 0648.
وࣸورْ
uː is 064F 0648.
بُورْ
`, '\u{064A}': `ي
y consonant.
وَاوْ
جَوّ
Combinations
Used to indicate the following long vowels.
ɛː is 08F9 064A.
لࣹينْ
eː is 08FA 064A.
لࣺينِ
iː is 08F7 0650.
نجِيتْ
`, '\u{064E}': `َ
a short vowel.
رَخَسْ
خَمْ
In word-initial position this is 0627 064E.
اَکَرَ
Before a geminated or prenasalised consonant the diacritic 08F5 is used, instead.
aː is 064E 0627.
جَايْ
وَاوْ
`, '\u{064F}': `ُ
u consonant.
دُگُبْ
دَانُ
In word-initial position this is 0627 064F.
اُݒُّکَاي
uː is 064F 0648.
بُورْ
`, '\u{0650}': `ِ
i short vowel.
ݒِݖّ
لࣺينِ
In word-initial position this is 0627 0650.
اِتَمْ
iː is 0650 064A.
نجِيتْ
`, '\u{0651}': `ّ
Indicates a lengthened consonant sound. This is phonemically significant, but only occurs at the end of a word, or before a suffix.
بࣴتْ
بࣴتّ
A consonant carrying this mark will not carry 0652 as well.ww,#Consonant_Diacritics
`, '\u{0652}': `ْ
Indicates a consonant that is not followed by a vowel sound.
گࣴمْ
اِتَمْ
Unlike Arabic, this is always used, unless either (a) the consonant carries 0651, or (b) the consonant is a nasal indicating prenasalisation, or (c) the consonant indicates a long vowel sound.ww,#Consonant_Diacritics
جࣵوّ
سࣵنتْ
نجِيتْ
`, '\u{065C}': `ٜ
Obsolete vowel diacritic, previously used for ɛ and e.
`, '\u{065D}': `ٝ
Obsolete vowel diacritic, previously used for ɔ.
`, '\u{067E}': `پ
`, '\u{0686}': `چ
`, '\u{068E}': `ڎ
`, '\u{06A9}': `ک
k consonant.
کࣹݒُّ
لࣵکّ
Prenasalised consonant
ᵑk is 0646 06A9.
دࣺنکّ
`, '\u{06AD}': `ڭ
`, '\u{06AF}': `گ
ɡ consonant.
گَالْ
مَگْ
دَگّ
Prenasalised consonant
ᵑɡ is 0646 06AF.
نگَ
`, '\u{06DB}': `ۛ
`, '\u{0751}': `ݑ
`, '\u{0752}': `ݒ
p consonant.
ݒِݖّ
کࣹݒُّ
Prenasalised consonant
ᵐp is 0645 0752.
سࣵمݒّ
This never appears in word-initial position.
`, '\u{0756}': `ݖ
c consonant.
ݖَابِ
ݒِݖّ
Prenasalised consonant
ᶮc is 0646 0756.
سࣵنݖّ
This never appears in word-initial position.
`, '\u{075D}': `ݝ
ŋ consonant.
ݝَامَمْ
ݝࣷنْ
ݝࣷنّ
`, '\u{0763}': `ݣ
`, '\u{0764}': `ݤ
`, '\u{0767}': `ݧ
ɲ consonant.
ݧَانْ
وࣷݧْ
وࣷݧّ
`, '\u{08F4}': `ࣴ
ə short vowel.
گࣴمْ
دࣴجْ
In word-initial position this is 0627 08F4.
əː is 08F4 0639 08F4.
سُفࣴلࣴعࣴر
`, '\u{08F5}': `ࣵ
a before a geminated or prenasalised consonant.
كࣵادُّ
سࣵنݖّ
In word-initial position this is 0627 08F5.
`, '\u{08F7}': `
ࣷ
ɔ short vowel.
وࣷلࣷفْ
آجࣷ
In word-initial position this is 0627 08F7.
اࣷوتࣷ
ɔː is 08F7 0648.
وࣷورْ
`, '\u{08F8}': `ࣸ
o consonant.
نࣸبْ
لࣸرِيࣺ
In word-initial position this is 0627 08F8.
oː is 08F8 0648.
وࣸورْ
`, '\u{08F9}': `ࣹ
ɛ short vowel.
سࣹنࣹگَالْ
اُودࣹ
تࣺيرࣹ
In word-initial position this is 0627 08F9.
ɛː is 08F9 064A.
لࣹينْ
`, '\u{08FA}': `ࣺ
e short vowel.
دࣺنکّ
لࣸرِيࣺ
In word-initial position this is 0627 08FA.
eː is 08FA 064A.
لࣺينِ
`, // COMMON PUNCTUATION // § '\u{00A7}': `§
`, // « '\u{00AB}': `«
`, // » '\u{00BB}': `»
`, // danda '\u{0964}': `।
`, // double danda '\u{0965}': `॥
`, // – '\u{2010}': `‐
`, // – '\u{2013}': `–
`, // — '\u{2014}': `—
`, // '.. '\u{2018}': `‘
`, // ..' '\u{2019}': `’
`, // ".. '\u{201C}': `“
`, // .." '\u{201D}': `”
`, // ! '\u{0021}': `!
`, // … '\u{2026}': `…
`, // ( '\u{0028}': `(
`, // ) '\u{0029}': `)
`, // , '\u{002C}': `,
`, // . '\u{002E}': `.
`, // : '\u{003A}': `:
`, // ; '\u{003B}': `;
`, // ? '\u{003F}': `?
`, // cgj '\u{034F}': `͏
Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.
`, // alm '\u{061C}': `
Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.
`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `
An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.
Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text อักษรไทย may look like อั ก ษ ร ไ ท ย when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).
`, // zwnj '\u{200C}': `
Prevents glyph joining behaviour.
`, // zwj '\u{200D}': `
Creates glyph joining behaviour in the absence of normal joining contexts.
`, // rlm '\u{200F}': `
An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.
`, // lrm '\u{200E}': `
An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.
`, // ‘ '\u{2018}': `‘
`, // ’ '\u{2019}': `’
`, // “ '\u{201C}': `“
`, // ” '\u{201D}': `”
`, '\u{2020}': `†
Called dagger, but also known as obelisk, obelus, or long cross.b321
A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68
Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321
In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321
`, '\u{2021}': `‡
Called dagger, but also known as diesis, or double obelisk.b321
A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68
`, // … '\u{2026}': `…
`, '\u{002D}': `-
Hyphen.
`, '\u{060C}': `،
Comma.
`, '\u{061B}': `؛
Semicolon.
`, '\u{061F}': `؟
Question mark.
`, // zwnj '\u{200C}': `
Zero-width non-joiner (ZWNJ).
An invisible character, that prevents two adjacent letters forming a visual connection with each other when rendered. Especially useful for educational illustrations, but also has real-world applications.
It is used to interrupt the shaping of joining glyphs in cursive scripts, and also used to manage the visual interactions of glyphs in other scripts, eg. to prevent the formation of conjuncts, position diacritics, etc.
More details:
`, // zwj '\u{200D}': `
Zero-width joiner (ZWJ).
An invisible character, that permits a letter to form a cursive connection without a visible neighbour. Especially useful for educational illustrations, but also has some real-world applications.
Also used with complex scripts to manage the visual representation of glyphs that normally interact, eg. to form conjuncts, position diacritics, etc.
More details:
`, // LRM '\u{200E}': `An invisible character with strong LTR directional properties that can be used to produce the correct ordering of text, especially where there is a risk of spillover effects while the Unicode Bidirectional Algorithm is at work.
Generally referred to as LRM.
`, // RLM '\u{200F}': `An invisible character with strong RTL directional properties that can be used to produce the correct ordering of text, especially where there is a risk of spillover effects while the Unicode Bidirectional Algorithm is at work.
Generally referred to as RLM.
`, // LRE '\u{202A}': `Sets the start point for a range of inline text when applying a base direction of left-to-right. The range is terminated by 202C (PDF).
Use 2066 (LRI) rather than this character.
`, // RLE '\u{202B}': `Sets the start point for a range of inline text when applying a base direction of right-to-left. The range is terminated by 202C (PDF).
Use 2067 (RLI) rather than this character.
`, // PDF '\u{202C}': `Sets the end point for a range of inline text when applying a base direction. The range is started with either 202A (LRE) or 202B (RLE).
Use 2069 (PDI) and its associated range starters rather than this character.
`, // LRI '\u{2066}': `Sets the start point for a range of inline text when applying a base direction of left-to-right, and isolates the text within that range from text outside it. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
This character should be used rather than 202A (LRE).
`, // RLI '\u{2067}': `Sets the start point for a range of inline text when applying a base direction of right-to-left, and isolates the text within that range from text outside it. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
This character should be used rather than 202B (RLE).
`, // FSI '\u{2068}': `Sets the start point for a range of inline text when applying a base direction, and isolates the text within that range from text outside it. The base direction set is determined by that of the first strong directional character in the range. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
`, // PDI '\u{2069}': `Sets the end point for a range of inline text when applying a base direction. The range is started with either 2066 (LRI), 2066 (RLI) or 2068 (FSI).
This character should be used rather than 202C (PDF).
`, // CGJ '\u{034F}': `͏
Combining grapheme joiner.
Used to produce special ordering of diacritics. The name is a misnomer, as it is generally used to break the normal sequence of diacritics.
More details:
`, } //