/* */ var charDetails = { // MAIN BLOCK '\u{0021}': `

Exclamation mark.

`, '\u{0028}': `

(

Parenthesis.

`, '\u{0029}': `

)

Parenthesis.

`, '\u{002E}': `

Full stop.

`, '\u{003A}': `

Colon.

`, '\u{060C}': `

Comma.

`, '\u{061B}': `

Semicolon.

`, '\u{061F}': `

Question mark.

`, '\u{0622}': `

ɑ~ɑː standalone vowel.

آره

آيات

آيرلنډ

`, '\u{0626}': `

əi diphthong, used word-finally only, and typically used for grammatical endings, such as the 2nd person plural of verbs.

وئ

اخلئ پخلئ

Compare with ۍ.

`, '\u{0627}': `

a or i vowel at the beginning of a word. The latter can be distinguished using the diacritic 0650 when needed.wpsa

ابادول

اسلام

See also آ.

∅ when acting as a word-initial vowel carrier.

ايغلو

اوولس

اوبدل

اېموجي

ɑ vowel when word-medial or final.wpsa

ښامار

سنډا

Always creates a ligature after ل.

لامبو

`, '\u{0628}': `

b consonant.

برابر

رېبز

طالب

`, '\u{062A}': `

t consonant.

توت

ستر

نشت

`, '\u{062B}': `

Used in loan words.

s consonant.

ثانيه

مثل

بعث

`, '\u{062C}': `

d͡ʒ consonant.

جکړ

نجلۍ

بنج

`, '\u{062D}': `

Used in loan words.

h consonant.

رحم

بحر

مدح

This is generally silent when word-final.

اصلاح

`, '\u{062E}': `

x consonant.

ختل

سخی

اړخ

`, '\u{062F}': `

d̪ consonant.

دروند

باندې

تود

`, '\u{0630}': `

Used in loan words.

z consonant.

ذکر

مذهب

`, '\u{0631}': `

r consonant.

رسنۍ

بلغاريه

ټوکر

`, '\u{0632}': `

z consonant.

زمری

ارزان

لمريز

`, '\u{0633}': `

s consonant.

سنډا

لمسول

اتلس

`, '\u{0634}': `

ʃ consonant.

شپېته

مېشت

دېرش

`, '\u{0635}': `

Used in loan words.

s consonant.

صفحه

مصر

`, '\u{0636}': `

Used in loan words.

z consonant.

حضرت

حوض

`, '\u{0637}': `

Used in loan words.

t consonant.

طالب

قطر

غلط

`, '\u{0638}': `

Used in loan words.

z consonant.

تلفظ

محافظ

`, '\u{0639}': `

Used in loan words.

ɑ vowel.

انعام

دعوه

شروع

`, '\u{063A}': `

ɣ consonant.

غنم

ايغلو

مرغ

`, '\u{0641}': `

Used in loan words.

f consonant.

فرانس

افغان

مفهوم

هدف

Often replaced by پ p.wpsa

`, '\u{0642}': `

Used in loan words.

q consonant.

فقط

موسيقي

Often replaced by ک k.wpsa

`, '\u{0643}': `

Best avoided.

k consonant.

`, '\u{0644}': `

l consonant.

لس

ايسلنډ

اوبدل

الوتل

This always ligates with a following ا.

بلا

`, '\u{0645}': `

m consonant.

مېلمه

اپيم

جام

`, '\u{0646}': `

n consonant.

نمونځ

نيوکه

کنوه

بړستن

`, '\u{0647}': `

Used for word-final ə in Peshawar. Afghan uses this for word-final ə and a.

h consonant.

هلک

ګهيځ

a vowel when word-final.

مننه

منډه

`, '\u{0648}': `

w consonant as a syllable onset, or as part of a diphthong.

لېوۀ

غويی

اورول

u vowel after a consonant or word-initial aleph.

اومامي

ايغلو

o vowel also after a consonant or word-initial aleph.

اوبدل

لامبو

No orthographic distinction is made between u and o.

`, '\u{064A}': `

j consonant as a syllable onset.

يونی

اتيا

i vowel after a consonant or word-initial aleph.

ايغلو

موسيقي

-i̯ or -j as part of a diphthong.

بايلل

`, '\u{064B}': `

Used in loan words. Found in Arabic words.

a vowel diacritic.

`, '\u{064E}': `

a vowel diacritic.

`, '\u{064F}': `

u vowel diacritic.

`, '\u{0650}': `

i vowel diacritic.

`, '\u{0652}': `

Rare.

Vowel absence indicator.

`, '\u{0653}': `

Rare. Orphan. Only found in decomposed text with ALEF.

Combining madd.

`, '\u{0654}': `

Rare. Orphan. Only found in decomposed text with YEH.

Combining hamza.

زهٔ

پسهٔ

`, '\u{0659}': `

ə vowel diacritic.

`, '\u{066A}': `

Percent sign.

`, '\u{066C}': `

Thousands separator.

۱۹٬۴۷۹

The rendered shape depends on the font.

`, '\u{067C}': `

ʈ consonant.

ټماټر

سټک

برغټ

`, '\u{067E}': `

p consonant.

پښتنه

غپل

`, '\u{0681}': `

d͡z consonant.

ځنګل

لمانځل

ږمنځ

`, '\u{0685}': `

t͡s consonant.

څنګ

هڅکه

`, '\u{0686}': `

t͡ʃ consonant.

چيچل

مېچن

بچ

`, '\u{0689}': `

ɖ consonant.

ډنډ

شډل

`, '\u{0693}': `

ɽ consonant.

ړوند

تړل

سوړ

`, '\u{0696}': `

The pronunciation of this letter varies according to regional dialect. It is pronounced in one of the following ways.

ʐ consonant in Kandahar.

ږيره

سږى

غوږ

ɡ consonant in Kabul, giving ˈɡira, sə.ˈɡaɪ, ɣwaɡ.

ʒ consonant in the southeast, giving ˈʒira, sə.ˈʒaɪ, ɣwaʒ.

`, '\u{0698}': `

ʒ consonant.

ژېړ

پېژندل

`, '\u{069A}': `

The pronunciation of this letter varies according to regional dialect. It is pronounced in one of the following ways.

ʂ consonant in Kandahar.

ښامار

مېښه

سرېښ

x consonant in Kabul, giving xɑ.ˈmɑr, me.ˈxa, srex.

ʃ consonant in the southeast, giving ʃɑ.ˈmɑr, me.ˈʃa, sreʃ.

`, '\u{06A9}': `

k consonant.

کلک

بکاڼه

زبرځواک

`, '\u{06AB}': `

Pakistan preferred form for ɡ. The preferred form of Afghan officials is گ.

ɡ consonant.

ګبين

مګر

دېګ

`, '\u{06AF}': `

Afghan official's preferred form for ɡ. The Pakistan preferred form is ګ.

ɡ consonant.

مگر

آهنگ

`, '\u{06BA}': `

◌̃ vowel nasalisation marker, used only in certain dialects such as Banisi/Banuchi and Waṇetsi.wpsa

`, '\u{06BC}': `

ɳ consonant.

ګڼل

بڼ

`, '\u{06C0}': `

Word-final only, and only in Peshawar.

ə vowel.

مرغۀ

مېړۀ

`, '\u{06CC}': `

ai̯ word-final diphthong.

سخی

زمری

غويی

-i̯ when following a vowel.

ځای

دوی

لوی

This letter has no dots below in isolated and final shapes, but does have them in initial and medial positions.

In Pakistan this letter may be replaced by ے.

`, '\u{06CD}': `

əi diphthong, only used word-finally, typically used as a grammatical ending, such as to indicate a feminine noun.

نړۍ

هګۍ

Compare with ئ.

`, '\u{06D0}': `

e vowel.

اېموجي

تېروتل

تاسې

`, '\u{06D2}': `

This letter is used in Pakistan in place of ی.

ai diphthong.

j semivowel when following a vowel.

`, '\u{06D4}': `

Arabic full stop.

`, '\u{06D5}': `

Rare. Orphan. Found only where text containing ۀ has been decomposed. Should not be used otherwise.

06D5 0654 is converted back to ۀ during NFC normalisation.

`, '\u{06F0}': `

0 digit.

`, '\u{06F1}': `

1 digit.

`, '\u{06F2}': `

2 digit.

`, '\u{06F3}': `

3 digit.

`, '\u{06F4}': `

4 digit.

`, '\u{06F5}': `

5 digit.

`, '\u{06F6}': `

6 digit.

`, '\u{06F7}': `

7 digit.

`, '\u{06F8}': `

8 digit.

`, '\u{06F9}': `

9 digit.

`, '\u{201C}': `

“

Quotation mark.

`, '\u{201D}': `

”

Quotation mark.

`, '\u{204F}': `

⁏

Semicolon.

`, '\u{2E41}': `

⹁

Comma.

`, // COMMON PUNCTUATION // § '\u{00A7}': `

`, // « '\u{00AB}': `

`, // » '\u{00BB}': `

`, // danda '\u{0964}': `

।

`, // double danda '\u{0965}': `

॥

`, // – '\u{2010}': `

‐

`, // – '\u{2013}': `

–

`, // — '\u{2014}': `

—

`, // '.. '\u{2018}': `

‘

`, // ..' '\u{2019}': `

’

`, // ".. '\u{201C}': `

“

`, // .." '\u{201D}': `

”

`, // ! '\u{0021}': `

`, // … '\u{2026}': `

…

`, // ( '\u{0028}': `

(

`, // ) '\u{0029}': `

)

`, // , '\u{002C}': `

`, // . '\u{002E}': `

`, // : '\u{003A}': `

`, // ; '\u{003B}': `

;

`, // ? '\u{003F}': `

`, // cgj '\u{034F}': `

Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.

`, // alm '\u{061C}': `

Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.

`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `

An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.

Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text อักษรไทย may look like อั ก ษ ร ไ ท ย when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).

`, // zwnj '\u{200C}': `

‌

Prevents glyph joining behaviour.

`, // zwj '\u{200D}': `

‍

Creates glyph joining behaviour in the absence of normal joining contexts.

`, // rlm '\u{200F}': `

‏

An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // lrm '\u{200E}': `

‎

An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // ‘ '\u{2018}': `

‘

`, // ’ '\u{2019}': `

’

`, // “ '\u{201C}': `

“

`, // ” '\u{201D}': `

”

`, '\u{2020}': `

†

Called dagger, but also known as obelisk, obelus, or long cross.b321

A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321

In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321

`, '\u{2021}': `

‡

Called dagger, but also known as diesis, or double obelisk.b321

A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

`, // … '\u{2026}': `

…

`, // rle '\u{202B}': `

‫

Sets the base direction for the following text to RTL, with no isolation. The Unicode Standard recommends use of RLI, instead.

`, // lre '\u{202A}': `

‪

Sets the base direction for the following text to LTR, with no isolation. The Unicode Standard recommends use of LRI, instead.

`, // pdf '\u{202C}': `

‬

Ends the range of text that started with RLE, or LRE.

`, '\u{2032}': `

′

Abbreviation for feet (1′ = 12″).b330

Also used for minutes of arc (eg. 60′=1°).b330

`, '\u{2033}': `

″

Abbreviation for inches (1′ = 12″).b321

Also used for seconds of arc (eg. 360″=1°).b321

`, // word-break '\u{2060}': `

An invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks, eg. it can be used around the + sign in base⁠+delta⁠ to prevent a line break occuring in that sequence of characters. It has no effect on word segmentation.

It can also be used to bracket other characters to turn them into non-breaking characters, such as U+2009 THIN SPACE or ― [U+2015 HORIZONTAL BAR].

Not to be confused with U+200D ZERO WIDTH JOINER or U+034F COMBINING GRAPHEME JOINER, since it has no effect on shaping.

This functionality is also provided by U+FEFF ZERO WIDTH NO-BREAK SPACE, but since that character also represents the byte-order mark, the use of this word joiner character (added in Unicode 3.2) is strongly preferred over the latter.

`, // rli '\u{2067}': `

⁧

Sets the base direction for the following text to RTL, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // lri '\u{2066}': `

⁦

Sets the base direction for the following text to LTR, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, //fsi '\u{2068}': `

⁨

Sets the base direction for the following text to the direction of the first strong directional character, per Unicode Bidirectional Algorithm rules, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // pdi '\u{2069}': `

⁩

Ends the range of text that started with RLI, LRI, or FSI.

`, } //