Updated Sun 15 Oct 2017 • tags devanagari, scriptnotes
This page provides information about the characteristics of the devanagari script used to write Hindi, as described in the Writing Systems Tutorial. It is not intended to be exhaustively scientific – merely to give a basic idea of the essential features of the script.
Click on the orange text in the table to the right to see more details about that aspect of the script. Click on red text in the main sample area to see a list of code points for that text.
हालाँकि सूर के जीवन के बारे में कई जनश्रुतियाँ प्रचलित हैं, पर इन में कितनी सच्चाई है यह कहना कठिन है। कहा जाता है उनका जन्म सन् १४७८ में दिल्ली के पास एक ग़रीब ब्राह्मीण परिवार में हुआ। जनश्रुति के अनुसार सूरदास जन्म से ही अंधे थे। आजकल थी अंधे आदमी अक्सर 'सूरदास' कहलाते हैं। कई लोगों ने उन्हें गुरु के रूप में अपनाया और उनकी पूजा करना शुरु कर दिया ।
Script name | Devanagari |
---|---|
Script type | abugida |
Number of characters | 156 |
Many more glyphs than characters? | yes |
Combining characters | 52 |
Context-based positioning | yes |
Multiple combining characters | yes |
Contextual shaping | yes |
Cursive script | no |
Text direction | ltr |
Space is word separator | yes |
Baseline | high |
Case distinction | no |
Wraps at | word |
Justification | word |
Native digits? | yes |
Click on the orange text in the features list (right column) to see examples and notes. Click on highlighted text in the Sample section to see the characters. Click on the vertical blue bar, bottom right, to change font settings.
Combining characters need to be placed in different positions, according to the context.
The example on the left below displays the dot (anusvara) immediately over the long vertical stroke. The example to the right has moved the dot slightly to the right in order to accomodate the vowel sign.
vs.
In the following the black image shows the normal position of the long U vowel sign, beneath the first letter. The red example shows that character displayed higher up and to the right when combined with the base character RA.
The shape of a character when displayed can vary, often dramatically, according to the context.
One very common example in most indic scripts is the handling of 'conjunct consonants', ie. groups of consonants with no intervening vowel sounds. Since consonants in indic scripts have an inherent vowel sound, when two consonants are combined this way you have to indicate that the vowel of the initial consonant suppressed. This is normally done by altering the shape of the first consonant, or merging the shape of the two consonants.
To tell the font to do this, in Unicode you add a 'virama' or 'vowel-killer' character between the two consonants. The visual result of that is the change in the shapes of the glyphs used that indicates to the reader that this is a conjunct. The actual outcome is font dependent. For the word highlighted above which contains a conjunct of two LA characters (making a long L sound) you may see a reduction in the shape of the first LA (a so-called 'half-form') or you may see (as shown on the right) a more complicated arrangement.
vs.
There are other types of context-based shaping in the sample text above, which are applied by the particular font I used when generating my examples. One is shown below. The width of the glyph for the I character differs according to the base character to which it is attached.
vs.
Devanagari regularly combines multiple combining characters with a base consonant. The picture below shows the two combining characters that are positioned above the base character in the highlighted word above. One is a vowel sign, and the other a nasalisation mark.
Justification in devanagari works with the inter-word spaces.
Use the control below to see how this browser justifies Hindi text.
The Devanagari script characters in Unicode 7.0 are contained in 2 blocks:
The following is an incomplete list of languages and the number of characters they use, per version 26 of CLDR's lists of characters (exemplarCharacters).