/*
*/ var charDetails = { '\u{0600}': `
`, '\u{0601}': `
`, '\u{0602}': `
`, '\u{0603}': `
`, '\u{0604}': `
`, '\u{0605}': `
`, '\u{0606}': `؆
`, '\u{0607}': `؇
`, '\u{0608}': `؈
`, '\u{0609}': `؉
`, '\u{060A}': `؊
`, '\u{060B}': `؋
`, '\u{060C}': `،
`, '\u{060D}': `؍
`, '\u{060E}': `؎
`, '\u{060F}': `؏
`, '\u{0610}': `ؐ
`, '\u{0611}': `ؑ
`, '\u{0612}': `ؒ
`, '\u{0613}': `ؓ
`, '\u{0614}': `ؔ
`, '\u{0615}': `ؕ
`, '\u{0616}': `ؖ
`, '\u{0617}': `ؗ
`, '\u{0618}': `ؘ
`, '\u{0619}': `ؙ
`, '\u{061A}': `ؚ
`, '\u{061B}': `؛
`, '\u{061C}': `
`, '\u{061E}': `؞
`, '\u{061F}': `؟
`, '\u{0620}': `ؠ
`, '\u{0621}': `ء
ʔ, though sometimes silent, eg.
علاءالدین
امضاء
See also 0654, which is much more common.
`, '\u{0622}': `آ
ɒː is written with the following forms, which include this character in initial position in precomposed text. The precomposed character decomposes and recomposes during normalisation.
initial | 0622آنجا |
---|---|
medial | 0627 |
final | 0627 |
final, unjoined | 0627 |
The alef+madd combination may also appear without a space before it in compound words and words that begin with prefixes, eg. درآمد درآوردن رنگآمیزی
It is also used in the word
قرآن
`, '\u{0623}': `أ
ʔ eg. رأی تأکید
`, '\u{0624}': `ؤ
ʔ eg. مؤمن مؤثر
Observation: Appears to be used between two vowels when the first is o, eg. سؤال
ؤ [U+0624 ARABIC LETTER WAW WITH HAMZA ABOVE] and ؤ [U+0648 ARABIC LETTER WAW + U+0654 ARABIC HAMZA ABOVE] are canonically equivalent.
`, '\u{0625}': `إ
`, '\u{0626}': `ئ
Eg. قائمشهر Qā’emšahr.
Note that this character decomposes into ئ [U+064A ARABIC LETTER YEH + U+0654 ARABIC HAMZA ABOVE], rather than a combining hamza with ی [U+06CC ARABIC LETTER FARSI YEH].
`, '\u{0627}': `ا
In word-initial position, one of:
∅, but indicating an initial vowel sound, when followed by either 0648 or 06CC, eg. او این
Note, however, that words beginning in this way may contain a consonant, rather than a vowel, eg. اولی ایالت
ɒː in the middle or at the end of a word, eg. مار گرگها
`, '\u{0628}': `ب
b eg. بچه سبز قلب
`, '\u{0629}': `ة
Arabic fem. t
`, '\u{062A}': `ت
t eg. تیر گیتی گوشت
`, '\u{062B}': `ث
s eg. ثبت کثیف بحث وارث
`, '\u{062C}': `ج
d͡ʒ eg. جگر کجا پنج واج
`, '\u{062D}': `ح
h eg. حیوان تلویحی فصیح قوس و قزح
`, '\u{062E}': `خ
x eg. خیس ضخیم نسخ سرخ
`, '\u{062F}': `د
d eg. دود کندن چند
`, '\u{0630}': `ذ
z eg. ذهن نبیذ شاذ
`, '\u{0631}': `ر
r eg. ریشه کویر کنکور
`, '\u{0632}': `ز
z eg. زمین یازده قرمز روز
`, '\u{0633}': `س
s eg. سیلو نشستن خیس خرس
`, '\u{0634}': `ش
ʃ eg. شیک گنجشک شپش گوش
`, '\u{0635}': `ص
s eg. صاف تصویر ناقص قرص
`, '\u{0636}': `ض
z eg. ضخیم عضو نبض عوض
`, '\u{0637}': `ط
t eg. طبقه مطلب وسط
`, '\u{0638}': `ظ
z eg. ظهر نظام حافظ
`, '\u{0639}': `ع
Preceding V → Vː
ʔ eg. علف بعضی
Silent, eg. سریع نعناع
`, '\u{063A}': `غ
Between vowels q, ɢ, x
ɢ ɣ eg. غژگاو نغمه میغ مرغ
`, '\u{063B}': `ػ
`, '\u{063C}': `ؼ
`, '\u{063D}': `ؽ
`, '\u{063E}': `ؾ
`, '\u{063F}': `ؿ
`, '\u{0640}': `ـ
`, '\u{0641}': `ف
f eg. فراخ گفتن علف صاف
`, '\u{0642}': `ق
ɢ قو چقدر موفق قاچاق
`, '\u{0643}': `ك
`, '\u{0644}': `ل
l eg. لیمو هلو گل سال
`, '\u{0645}': `م
m eg. مینو یاسمن چشم کرم
`, '\u{0646}': `ن
n eg. نیو نهنگ گفتن گردن
`, '\u{0647}': `ه
Consonant
h word-finally after a vowel, and in monosyllabic words, eg. هلو خواهش ماه نه
Vowels
e in word-final position after a consonant sound, eg. کهنه نامه
æ but only rarely in the educated Tehran dialect, and in word-final position, eg. کرایه
In rural and regional dialects æ is more common.
Non-left-joining forms
Certain suffixes don't join with this character when it appears finally in a morpheme, eg. خانهها تکیهگاه
To achieve this, you need to use U+200C ZERO WIDTH NON-JOINER. Other word-final letters typically show the same behaviour, however it's also possible to see the suffix written after a space or joined to the end of the word. When this letter ends the word, however, those alternatives are not available.§
`, '\u{0648}': `و
Functions as either a consonant or a vowel, and often it can be ambiguous as to which in unvowelled text, eg. شوید
Consonants
v as a consonant, eg. وقت ایوان سرو
-w glide (not common), when preceded by another vowel (which may not be marked), eg. اُوج روشن
Vowels
uː is written with the following forms, which include this character in all positions. No vowel diacritic is used in vowelled text.
initial | 0627 0648 | اورمزد |
---|---|---|
medial | 0648 | نور |
final | 0648 | زانو |
final, unjoined | 0648 | بازو |
o is written with the following forms, which include this character in the final position. The vowel diacritic is not normally shown.
initial | 0627 064F | |
---|---|---|
medial | 064F | |
final | 064F 0648 | یهو |
final, unjoined | 064F 0648 | کادو |
In a few words, in word-final position this is pronounced ow or oʊ̯, eg. پلو چو
The fact that vowel diacritics are not normally used means that uː and o (and sometimes ow) are indistinguishable in final forms. In word-initial form it is also indistinguishable from the other short vowels.
Silent use
∅ after خ and before a long vowel, eg. خوابیدن خواهر
This is a spelling holdover from classical Persian.
`, '\u{0649}': `ى
`, '\u{064A}': `ي
This character is not normally used in Persian, however ئ [U+064A ARABIC LETTER YEH + U+0654 ARABIC HAMZA ABOVE] decomposes into ئ [U+064A ARABIC LETTER YEH + U+0654 ARABIC HAMZA ABOVE], than a combining hamza with ی [U+06CC ARABIC LETTER FARSI YEH]. The Arabic letter therefore forms part of the Persian repertoire.
`, '\u{064B}': `ً
æn Doubled vowel diacritic. Not common. Used at the ends of certain Arabic-derived adverbs in vowelled text. It is usually written over an alif, although the vowel sound is short.
لزوماً
اصلاً
`, '\u{064C}': `ٌ
un Doubled vowel diacritic.
Not used in Persian, but taught to support education in the Qur'anwpa,#Tanvin_(nunation).
`, '\u{064D}': `ٍ
in Doubled vowel diacritic.
Not used in Persian, but taught to support education in the Qur'anwpa,#Tanvin_(nunation).
`, '\u{064E}': `َ
Rarely used; only where pronunciation needs to be spelled out.
æ is written with the following forms in vowelled text. In non-vowelled text, only the letters are written, so a medial form is not written at all.
initial | 0627 064E | اسب |
---|---|---|
medial | 064E | تبر |
The only common word in the Tehran dialect of Persian that ends with æ, which is represented by 0647, is نه næ no.
(Note, however, that in many rural and regional dialects, words that end with e in Persian are pronounced with an a ending.pm,xxxv)
`, '\u{064F}': `ُ
Rarely used; only where pronunciation needs to be spelled out.
o is written with the following forms in vowelled text. In non-vowelled text, only the letters are written, so a medial form is not written at all.
اُ ◌ُ ◌ُو ◌ُو
initial | 0627 064F | امید |
---|---|---|
medial | 064F | شتر |
final | 064F 0648 | یهو |
final, unjoined | 064F 0648 | کادو |
ow or oʊ̯ is a common pronunciation for this sound in word-final position, eg. چو نو چلو
The word-final version of this vowel is written in exactly the same way as uː in unvowelled text.
`, '\u{0650}': `ِ
Rarely used; only where pronunciation needs to be spelled out.
e is written with the following forms in vowelled text. In non-vowelled text, only the letters are written, so a medial form is not written at all.
initial | 0627 0650 | اسم |
---|---|---|
medial | 0650 | کِرم |
final | 0650 0647 | کهنه |
final, unjoined | 0650 0647 | میوه |
ّ
Rarely used. Doubles the sound of the base consonant, eg. اوّلی
`, '\u{0652}': `ْ
Rarely used. Indicates absence of a vowel between consonants, eg. نَسْتَعْلیق
`, '\u{0653}': `ٓ
Only occurs in initial positions in decomposed text over 0627. Usually, however, that combination is represented using the atomic character0622. It is not used in any other circumstances.
`, '\u{0654}': `ٔ
ʔ, normally written over a carrier letter. In most cases, that is the combination ـیٔـ [U+06CC ARABIC LETTER FARSI YEH + U+0654 ARABIC HAMZA ABOVE], eg. مسئول
However, if the glottal stop is preceded by the sound o, the combination is ـؤـ [U+0624 ARABIC LETTER WAW WITH HAMZA ABOVE], eg. مؤمن
On more rare occasions the hamza may appear over ا [U+0627 ARABIC LETTER ALEF], eg. رأی تأکید
See also the standalone isolated form of hamza, ء [U+0621 ARABIC LETTER HAMZA].
Standalone vowels
Although it may not be pronounced, the hamza and its carrier also appears between vowels. For example, فائده زئوس مؤثر سؤال سوئیس
In the examples above you can see that the hamza+carrier provides a place to put a vowel diacritic for short vowels in vowelled text. Between two long vowels it represents a nominal glottal stop.
The hamza is not always written. See for example these alternate forms for the i sound of the indefinite ending: پایٔی پایی مویٔی مویی
Ezafe
je when used to express the ezafe over ه [U+0647 ARABIC LETTER HEH], when that character is used to represent a word-final, short e, eg.
خانهٔ بزرگ
چلهٔ تابستان
Encoding
A number of precomposed combinations of base letter and hamza are encoded in Unicode. Many of these decompose and recompose under normalisation as canonical alternatives, but a few do not and need to be treated with care. Get more information about which precomposed characters are used or not used.
`, '\u{0655}': `ٕ
`, '\u{0656}': `ٖ
`, '\u{0657}': `ٗ
`, '\u{0658}': `٘
`, '\u{0659}': `ٙ
`, '\u{065A}': `ٚ
`, '\u{065B}': `ٛ
`, '\u{065C}': `ٜ
`, '\u{065D}': `ٝ
`, '\u{065E}': `ٞ
`, '\u{065F}': `ٟ
`, '\u{0660}': `٠
`, '\u{0661}': `١
`, '\u{0662}': `٢
`, '\u{0663}': `٣
`, '\u{0664}': `٤
`, '\u{0665}': `٥
`, '\u{0666}': `٦
`, '\u{0667}': `٧
`, '\u{0668}': `٨
`, '\u{0669}': `٩
`, '\u{066A}': `٪
`, '\u{066B}': `٫
`, '\u{066C}': `٬
`, '\u{066D}': `٭
`, '\u{066E}': `ٮ
`, '\u{066F}': `ٯ
`, '\u{0670}': `ٰ
`, '\u{0671}': `ٱ
`, '\u{0672}': `ٲ
`, '\u{0673}': `ٳ
`, '\u{0674}': `ٴ
`, '\u{0675}': `ٵ
`, '\u{0676}': `ٶ
`, '\u{0677}': `ٷ
`, '\u{0678}': `ٸ
`, '\u{0679}': `ٹ
`, '\u{067A}': `ٺ
`, '\u{067B}': `ٻ
`, '\u{067C}': `ټ
`, '\u{067D}': `ٽ
`, '\u{067E}': `پ
p eg. پنج شپش چپ
`, '\u{067F}': `ٿ
`, '\u{0680}': `ڀ
`, '\u{0681}': `ځ
`, '\u{0682}': `ڂ
`, '\u{0683}': `ڃ
`, '\u{0684}': `ڄ
`, '\u{0685}': `څ
`, '\u{0686}': `چ
t͡ʃ eg. چپ بیچاره پیچ نوچ
`, '\u{0687}': `ڇ
`, '\u{0688}': `ڈ
`, '\u{0689}': `ډ
`, '\u{068A}': `ڊ
`, '\u{068B}': `ڋ
`, '\u{068C}': `ڌ
`, '\u{068D}': `ڍ
`, '\u{068E}': `ڎ
`, '\u{068F}': `ڏ
`, '\u{0690}': `ڐ
`, '\u{0691}': `ڑ
`, '\u{0692}': `ڒ
`, '\u{0693}': `ړ
`, '\u{0694}': `ڔ
`, '\u{0695}': `ڕ
`, '\u{0696}': `ږ
`, '\u{0697}': `ڗ
`, '\u{0698}': `ژ
ʒ eg. ژنو بژ توژ
`, '\u{0699}': `ڙ
`, '\u{069A}': `ښ
`, '\u{069B}': `ڛ
`, '\u{069C}': `ڜ
`, '\u{069D}': `ڝ
`, '\u{069E}': `ڞ
`, '\u{069F}': `ڟ
`, '\u{06A0}': `ڠ
`, '\u{06A1}': `ڡ
`, '\u{06A2}': `ڢ
`, '\u{06A3}': `ڣ
`, '\u{06A4}': `ڤ
`, '\u{06A5}': `ڥ
`, '\u{06A6}': `ڦ
`, '\u{06A7}': `ڧ
`, '\u{06A8}': `ڨ
`, '\u{06A9}': `ک
k eg. کوچک کنجکاو کودک
`, '\u{06AA}': `ڪ
`, '\u{06AB}': `ګ
`, '\u{06AC}': `ڬ
`, '\u{06AD}': `ڭ
`, '\u{06AE}': `ڮ
`, '\u{06AF}': `گ
ɡ eg. گیتی سنگین گرگ
`, '\u{06B0}': `ڰ
`, '\u{06B1}': `ڱ
`, '\u{06B2}': `ڲ
`, '\u{06B3}': `ڳ
`, '\u{06B4}': `ڴ
`, '\u{06B5}': `ڵ
`, '\u{06B6}': `ڶ
`, '\u{06B7}': `ڷ
`, '\u{06B8}': `ڸ
`, '\u{06B9}': `ڹ
`, '\u{06BA}': `ں
`, '\u{06BB}': `ڻ
`, '\u{06BC}': `ڼ
`, '\u{06BD}': `ڽ
`, '\u{06BE}': `ھ
`, '\u{06BF}': `ڿ
`, '\u{06C0}': `ۀ
`, '\u{06C1}': `ہ
`, '\u{06C2}': `ۂ
`, '\u{06C3}': `ۃ
`, '\u{06C4}': `ۄ
`, '\u{06C5}': `ۅ
`, '\u{06C6}': `ۆ
`, '\u{06C7}': `ۇ
`, '\u{06C8}': `ۈ
`, '\u{06C9}': `ۉ
`, '\u{06CA}': `ۊ
`, '\u{06CB}': `ۋ
`, '\u{06CC}': `ی
Functions as either a consonant or a vowel, and often it can be ambiguous as to which in unvowelled text, eg. شوید
Consonants
j as a consonant, eg. یازده سیاه
-j glide, when preceded by another vowel (which may not be marked), eg. اِیوان خَیلی چایدان موی
Vowels
iː is written with the following forms, which include this character in all positions. No vowel diacritic is used in vowelled text.
initial | 0627 06CC | این |
---|---|---|
medial | 06CC | تیر |
final | 06CC | گیتی |
final, unjoined | 06CC | بسیاری |
Ezafe
je if this is the ezafe (adjectival joiner) used after a vowel, eg.
صحرای غربی اسب زیبای تندروی من asbe zibāye tundruye min my beautiful, fast horse
`, '\u{06CD}': `ۍ
`, '\u{06CE}': `ێ
`, '\u{06CF}': `ۏ
`, '\u{06D0}': `ې
`, '\u{06D1}': `ۑ
`, '\u{06D2}': `ے
`, '\u{06D3}': `ۓ
`, '\u{06D4}': `۔
`, '\u{06D5}': `ە
`, '\u{06D6}': `ۖ
`, '\u{06D7}': `ۗ
`, '\u{06D8}': `ۘ
`, '\u{06D9}': `ۙ
`, '\u{06DA}': `ۚ
`, '\u{06DB}': `ۛ
`, '\u{06DC}': `ۜ
`, '\u{06DD}': `
`, '\u{06DE}': `۞
`, '\u{06DF}': `۟
`, '\u{06E0}': `۠
`, '\u{06E1}': `ۡ
`, '\u{06E2}': `ۢ
`, '\u{06E3}': `ۣ
`, '\u{06E4}': `ۤ
`, '\u{06E5}': `ۥ
`, '\u{06E6}': `ۦ
`, '\u{06E7}': `ۧ
`, '\u{06E8}': `ۨ
`, '\u{06E9}': `۩
`, '\u{06EA}': `۪
`, '\u{06EB}': `۫
`, '\u{06EC}': `۬
`, '\u{06ED}': `ۭ
`, '\u{06EE}': `ۮ
`, '\u{06EF}': `ۯ
`, '\u{06F0}': `۰
Shape۰, ie. same as ٠ [U+0660 ARABIC-INDIC DIGIT ZERO].
`, '\u{06F1}': `۱
Shape۱, ie. same as ١ [U+0661 ARABIC-INDIC DIGIT ONE].
`, '\u{06F2}': `۲
Shape۲, ie. same as ٢ [U+0662 ARABIC-INDIC DIGIT TWO].
`, '\u{06F3}': `۳
Shape۳, ie. same as ٣ [U+0663 ARABIC-INDIC DIGIT THREE].
`, '\u{06F4}': `۴
Shape۴, cf. ٤ [U+0664 ARABIC-INDIC DIGIT FOUR].
`, '\u{06F5}': `۵
Shape۵, cf. ٥ [U+0665 ARABIC-INDIC DIGIT FIVE].
`, '\u{06F6}': `۶
Shape۶, cf. ٦ [U+0666 ARABIC-INDIC DIGIT SIX].
`, '\u{06F7}': `۷
Shape۷, ie. same as ٧ [U+0667 ARABIC-INDIC DIGIT SEVEN].
`, '\u{06F8}': `۸
Shape۸, ie. same as ٨ [U+0668 ARABIC-INDIC DIGIT EIGHT].
`, '\u{06F9}': `۹
Shape۹, ie. same as ٩ [U+0669 ARABIC-INDIC DIGIT NINE].
`, '\u{06FA}': `ۺ
`, '\u{06FB}': `ۻ
`, '\u{06FC}': `ۼ
`, '\u{06FD}': `۽
`, '\u{06FE}': `۾
`, '\u{06FF}': `ۿ
`, '\u{FDFC}': `﷼
Created by a typewriter standardisation committee in 1973, this is intended to be a condensed version of the word for the Iranian currency. The compatibility decomposition of this character is ریال (click on the red text to see the list).
The Unicode Standard says that this was only used for a short while in typewriters and keyboard layouts, and so is provided mainly for compatability reasons. Persian users are said to prefer typing the word rather than using this symbol.
`, }