/* */ var charDetails = { // MAIN BLOCK // Use _tools/generate_details_page_stubs.html to generate stubs to go here '\u{0B01}': `

Nasalises the vowel, eg. ମୁଁହ

Where 2 vowels appear together, the nasalisation sign is rendered above the second, eg. ଜ୍ୱାଇଁ

`, '\u{0B02}': `

m ŋ for syllable-final consonants, eg. ଜଂତୁ ଜଂଗଲ ଏବଂ

`, '\u{0B03}': `

This has 2 different functions, depending on its position in a word.

h when it appears at the end of a word.

In the middle of a word it doubles the preceding consonant, eg. ଦୁଃଖ

`, '\u{0B05}': `

ɔ independent vowel.

ଅଣ୍ଡା

ନଅ

`, '\u{0B06}': `

a independent vowel.

ଆଠ

ଗୋଆ

`, '\u{0B07}': `

i independent vowel.

ଇଟା

ରୁଷିଆ

`, '\u{0B08}': `

i independent vowel.

ଗାଈ

`, '\u{0B09}': `

u independent vowel.

ଉଟ

ଚଉଡା

`, '\u{0B0A}': `

u independent vowel.

ଊତ୍ତୋଳନ

`, '\u{0B0B}': `

Usually only used for Sanskrit transcriptions. Rare.

ru vocalic.

ଋତୁ

`, '\u{0B0C}': `

Usually only used for Sanskrit transcriptions.Rare.

lu vocalic.

`, '\u{0B0F}': `

e independent vowel.

ଏଗାର

କୋଡ଼ିଏ

`, '\u{0B10}': `

ɔi independent vowel.

ଐରାଵତ

`, '\u{0B13}': `

o independent vowel.

ଓଡ଼ିଆ

ସେଓ

`, '\u{0B14}': `

ɔu independent vowel.

ଔଷଧ

`, '\u{0B15}': `

k consonant with inherent vowel ɔ.

କାନ

କଙ୍କାଳ

ଚିକ୍କଣ

Combinations

କ୍ଷ

kʰj is କ୍ଷ

ଅକ୍ଷର

`, '\u{0B16}': `

consonant with inherent vowel ɔ.

ଖାଇବା

ଦେଖିବା

`, '\u{0B17}': `

ɡ consonant with inherent vowel ɔ.

ଗାଈ

ଅଙ୍ଗାରକ

`, '\u{0B18}': `

consonant with inherent vowel ɔ.

ଘର

ଶୁଙ୍ଘିବା

`, '\u{0B19}': `

ŋ consonant with inherent vowel ɔ.

ଅଙ୍ଗାରକ

ରଙ୍ଗ

Only occurs in homorganic clustersd,406.

`, '\u{0B1A}': `

t͡ʃ consonant with inherent vowel ɔ.

ଚଟିଆ

ପାଞ୍ଚ

`, '\u{0B1B}': `

t͡ʃʰ consonant with inherent vowel ɔ.

ଛଅ

ମାଛ

`, '\u{0B1C}': `

d͡ʒ consonant with inherent vowel ɔ.

ଜହ୍ନ

ଗଞ୍ଜା

`, '\u{0B1D}': `

d͡ʒʰ consonant with inherent vowel ɔ.

ଝାଡୁ

`, '\u{0B1E}': `

ɲ consonant with inherent vowel ɔ.

ପାଞ୍ଚ

ରାଜ୍ଞୀ

Only occurs in homorganic clustersd,406.

`, '\u{0B1F}': `

ʈ consonant with inherent vowel ɔ.

ଟାପୁ

ଘଣ୍ଟା

`, '\u{0B20}': `

ʈʰ consonant with inherent vowel ɔ.

ଠିକ

ଆଣ୍ଠୁ

`, '\u{0B21}': `

ɖ consonant with inherent vowel ɔ.

ଡାଲମା

ଅଣ୍ଡା

Combinations

ଡ଼

ɽ is ଡ଼

ଓଡ଼ିଆ

A precomposed character exists for this combination, 0B5C, but the Unicode Standard recommends use of the decomposed sequence since this is what is produced by NFC normalisation.

`, '\u{0B22}': `

ɖʰ consonant with inherent vowel ɔ.

ପଢିବା

Combinations

ଢ଼

ɽʰ is ଢ଼

ଦାଢ଼ି

A precomposed character exists for this combination, 0B5D, but the Unicode Standard recommends use of the decomposed sequence since this is what is produced by NFC normalisation.

`, '\u{0B23}': `

ɳ consonant with inherent vowel ɔ.

କାରଣ

କୃଷ୍ଣ

`, '\u{0B24}': `

t consonant with inherent vowel ɔ.

ତିନି

ତିନ୍ତା

`, '\u{0B25}': `

consonant with inherent vowel ɔ.

ଥଣ୍ଡା

ଅସ୍ଥି

`, '\u{0B26}': `

d consonant with inherent vowel ɔ.

ଦୁଇ

ଚନ୍ଦ୍ର

`, '\u{0B27}': `

consonant with inherent vowel ɔ.

ଧୂଳି

ବାନ୍ଧିବା

`, '\u{0B28}': `

n consonant with inherent vowel ɔ.

ନଦୀ

ଜହ୍ନ

`, '\u{0B2A}': `

p consonant with inherent vowel ɔ.

ପଦ୍ମ

ସାପ

`, '\u{0B2B}': `

consonant with inherent vowel ɔ.

ଫଳ

ବରଫ

`, '\u{0B2C}': `

b consonant with inherent vowel ɔ.

ବିଭ୍ରାଟ

ଆମ୍ବ

`, '\u{0B2D}': `

consonant with inherent vowel ɔ.

ଭିଜା

ବିଭ୍ରାଟ

`, '\u{0B2E}': `

m consonant with inherent vowel ɔ.

ମହିଳା

ପଦ୍ମ

`, '\u{0B2F}': `

d͡ʒ consonant with inherent vowel ɔ.

ଯକୃତ

ମତ୍ସ୍ଯ

`, '\u{0B30}': `

r consonant with inherent vowel ɔ.

ରସି

ଚନ୍ଦ୍ର

ଶାର୍ଟ

Shaping

At the beginning of a consonant cluster this letter becomes a small diacritic which is rendered at the top right of the orthographic syllable (see the 3rd example above). At the end of a cluster it has a standard, reduced shape that appears below the preceding consonant glyph (see the 2nd example above).

`, '\u{0B32}': `

l consonant with inherent vowel ɔ.

ଲାଲ

ବଲ୍ଲରି

`, '\u{0B33}': `

ɭ consonant with inherent vowel ɔ.

ମୂଳ

ଅମ୍ଳଜାନ

`, '\u{0B35}': `

ʋ consonant with inherent vowel ɔ.

ଵିଷ୍ଣୁ

ଦ୍ଵୀପ

`, '\u{0B36}': `

s consonant with inherent vowel ɔ.

ଶାର୍ଟ

କେଶ

`, '\u{0B37}': `

s consonant with inherent vowel ɔ.

ତୁଷାର

ଅକ୍ଷର

`, '\u{0B38}': `

s consonant with inherent vowel ɔ.

ସାପ

ମତ୍ସ୍ଯ

`, '\u{0B39}': `

h eg. ହସିବା ହ୍ରଦ

`, '\u{0B39}': `

ɦ consonant with inherent vowel ɔ.

ହସିବା

ହ୍ରଦ

`, '\u{0B3C}': `

Used to extend the repertoire of letters to additional sounds.

It is used for 2 regularly occuring sounds:

ଡ଼

ɽ is ଡ଼

ଢ଼

ɽʰ is ଢ଼

Non-native sounds

It may also be used to produce other non-native sounds. Wiktionary describes the following:

q is କ଼

x is ଖ଼

ʒ is ଝ଼

ɻ is ଷ଼

`, '\u{0B3D}': `

Usually only used for Sanskrit transcriptions.

`, '\u{0B3E}': `

a eg. ନାରୀ

`, '\u{0B3E}': `

a vowel-sign.

ନାରୀ

`, '\u{0B3F}': `

ି

i vowel-sign.

କିପରି

Shaping

After some consonants, and in some fonts, this vowel sign may be rendered as an angled shape below the bottom-right corner of the base letter.

ଆଖି

`, '\u{0B40}': `

i vowel-sign. Phonetically, modern Odia makes no distinction between long and short vowels, but the difference is retained in the spelling.

ନାରୀ

`, '\u{0B41}': `

u vowel-sign.

କୁହୁଡ଼ି

`, '\u{0B42}': `

u vowel-sign. Phonetically, modern Odia makes no distinction between long and short vowels, but the difference is retained in the spelling.

ମୂଳ

`, '\u{0B43}': `

ru dependent vocalic.

କୃମି

`, '\u{0B44}': `

Usually only used for Sanskrit transcriptions. Rare.

ru dependent vocalic.

`, '\u{0B47}': `

e vowel sign.

ମେଘ

This pre-base vowel-sign is typed and stored after the base letter, but displayed before it.

`, '\u{0B48}': `

ɔi vowel sign.

ସ୍ତ୍ରୈଣ

This circumgraph vowel-sign is typed and stored after the base letter, but displayed to either side of it.

`, '\u{0B4B}': `

o vowel sign.

ପୋଲ

This circumgraph vowel-sign is typed and stored after the base letter, but displayed to either side of it.

`, '\u{0B4C}': `

ɔu vowel sign.

ଚୌଦ

This circumgraph vowel-sign is typed and stored after the base letter, but displayed to either side of it.

`, '\u{0B4D}': `

Indicates that the inherent vowel is suppressed.

ଇ-ମେଲ୍

ରଞ୍ଜ୍

Between consonants in a consonant cluster it produces a conjunct form, and is invisible.

ପଦ୍ମ

କମ୍ପ୍ୟୁଟର

`, '\u{0B55}': `

`, '\u{0B56}': `

A lengthening mark, used only for the decomposed form of 0B48. The Unicode Standard recommends the use of the precomposed form.

`, '\u{0B57}': `

A lengthening mark, used only for the decomposed form of 0B4C. The Unicode Standard recommends the use of the precomposed form.

`, '\u{0B5C}': `

Best avoided. Decomposed sequence is recommended.

ɽ consonant with inherent vowel ɔ.

`, '\u{0B5D}': `

Best avoided. Decomposed sequence is recommended.

ɽʰ consonant with inherent vowel ɔ.

`, '\u{0B5F}': `

j eg. ବାୟୁ ଶୁନ୍ୟ

`, '\u{0B5F}': `

j consonant with inherent vowel ɔ.

ବାୟୁ

ଶୁନ୍ୟ

`, '\u{0B60}': `

Rare. Usually only used for Sanskrit transcriptions.

ru vocalic.

`, '\u{0B61}': `

Rare. Usually only used for Sanskrit transcriptions.

lu vocalic.

`, '\u{0B62}': `

Rare. Usually only used for Sanskrit transcriptions.

lu dependent vocalic.

`, '\u{0B63}': `

Rare. Usually only used for Sanskrit transcriptions.

lu dependent vocalic.

`, '\u{0B66}': `

0 digit.

`, '\u{0B67}': `

1 digit.

`, '\u{0B68}': `

2 digit.

`, '\u{0B69}': `

3 digit.

`, '\u{0B6A}': `

4 digit.

`, '\u{0B6B}': `

5 digit.

`, '\u{0B6C}': `

6 digit.

`, '\u{0B6D}': `

7 digit.

`, '\u{0B6E}': `

8 digit.

`, '\u{0B6F}': `

9 digit.

`, '\u{0B70}': `

`, '\u{0B71}': `

w eg. ଦ୍ୱାର ସ୍ୱପ୍ନ

`, '\u{0B71}': `

w consonant with inherent vowel ɔ.

ଦ୍ୱାର

ସ୍ୱପ୍ନ

`, '\u{0B72}': `

`, '\u{0B73}': `

`, '\u{0B74}': `

`, '\u{0B75}': `

`, '\u{0B76}': `

`, '\u{0B77}': `

`, // DEVANAGARI // danda '\u{0964}': `

`, // double danda '\u{0965}': `

`, // COMMON PUNCTUATION // ".. '\u{201C}': ` `, // .." '\u{201D}': ` `, // '.. '\u{2018}': ` `, // ..' '\u{2019}': ` `, // « '\u{00AB}': ` `, // » '\u{00BB}': ` `, // ; '\u{003B}': ` `, // : '\u{003A}': ` `, // . '\u{002E}': ` `, // ? '\u{003F}': ` `, // ! '\u{0021}': ` `, // ( '\u{0028}': ` `, // ) '\u{0029}': ` `, // … '\u{2026}': ` `, // – '\u{2013}': ` `, // — '\u{2014}': ` `, // § '\u{00A7}': ` `, '\u{2020}': `

Called dagger, but also known as obelisk, obelus, or long cross.b321

A reference mark, used primarily with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

Also a death sign in European typography, used to mark the year of death or the names of dead persons.b321

In lexicography it marks obsolete forms, and in editing of classical texts flags passages judged to be corrupt.b321

`, '\u{2021}': `

Called dagger, but also known as diesis, or double obelisk.b321

A reference mark used with footnotes. When used for this purpose with other signs, the traditional order is * † ‡ § ‖ ¶.b68

`, '\u{2032}': `

Abbreviation for feet (1′ = 12″).b330

Also used for minutes of arc (eg. 60′=1°).b330

`, '\u{2033}': `

Abbreviation for inches (1′ = 12″).b321

Also used for seconds of arc (eg. 360″=1°).b321

`, // FORMATTING CHARACTERS // zwsp '\u{200B}': `

An invisible character, used to signal line-break and word-break opportunities. It was originally provided for use with writing systems such as Thai, Myanmar, Khmer, Japanese, etc. that don't use spaces between words.

Justification may visibly adjust the space between the characters on either side of this character, doing so as if the ZWSP wasn't there, eg. the Thai text อักษร​ไทย may look like อั ก ษ ร ไ ท ย when justified, or when letter-spacing is applied, even though the two words are separated by a ZWSP (click on the word to see the composition).

`, // zwj '\u{200D}': `

Creates glyph joining behaviour in the absence of normal joining contexts.

`, // zwnj '\u{200C}': `

Prevents glyph joining behaviour.

`, // word-break '\u{2060}': `

An invisible character, equivalent to a zero-width no-break space, and used to prevent line-breaks, eg. it can be used around the + sign in base⁠+delta⁠ to prevent a line break occuring in that sequence of characters. It has no effect on word segmentation.

It can also be used to bracket other characters to turn them into non-breaking characters, such as U+2009 THIN SPACE or [U+2015 HORIZONTAL BAR].

Not to be confused with U+200D ZERO WIDTH JOINER or U+034F COMBINING GRAPHEME JOINER​, since it has no effect on shaping.

This functionality is also provided by U+FEFF ZERO WIDTH NO-BREAK SPACE, but since that character also represents the byte-order mark, the use of this word joiner character (added in Unicode 3.2) is strongly preferred over the latter.

`, // rli '\u{2067}': `

Sets the base direction for the following text to RTL, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // lri '\u{2066}': `

Sets the base direction for the following text to LTR, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, //fsi '\u{2068}': `

Sets the base direction for the following text to the direction of the first strong directional character, per Unicode Bidirectional Algorithm rules, and isolates it (ie. stops the bidirectional algorithm causing interactions across the boundaries of the embedded text).

`, // pdi '\u{2069}': `

Ends the range of text that started with RLI, LRI, or FSI.

`, // rle '\u{202B}': `

Sets the base direction for the following text to RTL, with no isolation. The Unicode Standard recommends use of RLI, instead.

`, // lre '\u{202A}': `

Sets the base direction for the following text to LTR, with no isolation. The Unicode Standard recommends use of LRI, instead.

`, // pdf '\u{202C}': `

Ends the range of text that started with RLE, or LRE.

`, // rlm '\u{200F}': `

An invisible character with a strong RTL directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // lrm '\u{200E}': `

An invisible character with a strong LTR directional property. Can be used to correct local issues with the Unicode Bidirectional Algorithm.

`, // cgj '\u{034F}': `

Semantically separates characters. Can be used to prevent pairs of characters being treated as digraphs, or to block canonical reordering of combining marks during normalization. The word 'joiner' in the name is a misnomer.

`, // alm '\u{061C}': `

Helps produce the correct ordering for sequences with no strong directional characters by overriding the Unicode Bidirectional Algorithm default rules. Used particularly for text in the Arabic language, and languages using Syriac and Thaana scripts. Not usually needed for Hebrew, N'Ko, or Persian.

`, }