Tutorial sample text:
Devanagari

Updated Sun 15 Oct 2017 • tags devanagari, scriptnotes

This page provides information about the characteristics of the devanagari script used to write Hindi, as described in the Writing Systems Tutorial. It is not intended to be exhaustively scientific – merely to give a basic idea of the essential features of the script.

Click on the orange text in the table to the right to see more details about that aspect of the script. Click on red text in the main sample area to see a list of code points for that text.

Sample (Hindi)

हालाँकि सूर के जीवन के बारे में कई जनश्रुतियाँ प्रचलित हैं, पर इन में कितनी सच्चाई है यह कहना कठिन है। कहा जाता है उनका जन्म सन् १४७८ में दिल्ली के पास एक ग़रीब ब्राह्मीण परिवार में हुआ। जनश्रुति के अनुसार सूरदास जन्म से ही अंधे थे। आजकल थी अंधे आदमी अक्सर 'सूरदास' कहलाते हैं। कई लोगों ने उन्हें गुरु के रूप में अपनाया और उनकी पूजा करना शुरु कर दिया ।

Script name

Devanagari

Script type abugida
Number of characters 156
Many more glyphs than characters? yes
Combining characters 52
Context-based positioning yes
Multiple combining characters yes
Contextual shaping yes
Cursive script no
Text direction ltr
Space is word separator yes
Baseline high
Case distinction no
Wraps at word
Justification word
Native digits? yes

Show all notes

Click on the orange text in the features list (right column) to see examples and notes. Click on highlighted text in the Sample section to see the characters. Click on the vertical blue bar, bottom right, to change font settings.

Context-based positioning

Combining characters need to be placed in different positions, according to the context.

The example on the left below displays the dot (anusvara) immediately over the long vertical stroke. The example to the right has moved the dot slightly to the right in order to accomodate the vowel sign.

 vs.

In the following the black image shows the normal position of the long U vowel sign, beneath the first letter. The red example shows that character displayed higher up and to the right when combined with the base character RA.

Context-based shaping

The shape of a character when displayed can vary, often dramatically, according to the context.

One very common example in most indic scripts is the handling of 'conjunct consonants', ie. groups of consonants with no intervening vowel sounds. Since consonants in indic scripts have an inherent vowel sound, when two consonants are combined this way you have to indicate that the vowel of the initial consonant suppressed. This is normally done by altering the shape of the first consonant, or merging the shape of the two consonants.

To tell the font to do this, in Unicode you add a 'virama' or 'vowel-killer' character between the two consonants. The visual result of that is the change in the shapes of the glyphs used that indicates to the reader that this is a conjunct. The actual outcome is font dependent. For the word highlighted above which contains a conjunct of two LA characters (making a long L sound) you may see a reduction in the shape of the first LA (a so-called 'half-form') or you may see (as shown on the right) a more complicated arrangement.

 vs.

There are other types of context-based shaping in the sample text above, which are applied by the particular font I used when generating my examples. One is shown below. The width of the glyph for the I character differs according to the base character to which it is attached.

 vs.

Multiple combining characters

Devanagari regularly combines multiple combining characters with a base consonant. The picture below shows the two combining characters that are positioned above the base character in the highlighted word above. One is a vowel sign, and the other a nasalisation mark.

Justification

Justification in devanagari works with the inter-word spaces.

Use the control below to see how this browser justifies Hindi text.

Character list

The Devanagari script characters in Unicode 7.0 are contained in 2 blocks:

The following is an incomplete list of languages and the number of characters they use, per version 26 of CLDR's lists of characters (exemplarCharacters).

First published 22 Dec 2002. This version 2017-10-15 8:59 GMT.  •  Copyright r12a@w3.org. Licence CC-By.