Kashmiri (deva) orthography notes

Basic features

The Devanagari script is an abugida, ie. each consonant contains an inherent vowel sound.

Kashmiri uses fewer consonants than Hindi, but has more vowels. The orthography includes some Kashmiri-specific characters.

❯ basicV

Vowels The inherent vowel is pronounced ə.

Post-consonant vowels are written using 16 combining marks (vowel signs) in the initial 2009 orthography reform. Two vowel signs and letters represent diphthongs. A single Unicode character is used per base consonant.

Some organisations use the characters ॅ, ॉ, ॲ, and ऑ to represent the sounds ɔ and ɔː.

Vowels may be nasalised, using the candrabindu diacritic.

There is 1 pre-base glyph but there are no circumgraphs, or composite vowel signs.

Standalone vowel sounds are written using 17 independent vowel letters, one for each vowel sound, including the inherent vowel.

This orthography has vocalics, however of the four vocalic letters in the script (plus their 4 vowel signs) only 1 is in use for contemporary text.

❯ consonantSummary

Consonants Kashmiri has 24 basic consonant letters. 093C can be used to represent 3 more sounds. Kashmiri also commonly indicates palatalisation of consonants, using य.

Eleven more letters are used to represent unassimilated Sanskrit spellings in words. Phonetically, Kashmiri has only three forms of plosive, illustrated here with the bilabial stop: unvoiced p, voiced b, aspirated pʰ. The murmured bʱ is not used, although these letters may crop up in Sanskrit or Hindi loan words. It also has a set of retroflex consonants. Kashmiri normally uses only one letter for m and one for n, although other nasals may occur in words borrowed from Sanskrit.

Vowel absence Vowel absence is commonly not marked, but may be indicated using conjunct forms (especially for palatalisation) or a visible combining mark for nasal codas. There are no dedicated medial consonant characters.

094D is used to kill the inherent vowel and to form conjuncts. Usually it is invisible, but it is rendered visibly when a word ends with a palatalisation marker.

Conjunct forms occur as half-forms, stacked consonants, and ligated glyphs. Half-forms typically remove the vertical stroke from the basic glyph.

As part of a cluster, र has special forms. When initial in an orthographic syllable it appears as a hook at the top right of the whole syllable. When non-initial it appears as one of 2 special marks applied to the other consonants. A palatalised RA at the beginning of a word needs special treatment to avoid repha formation.

Syllable codas are written using ordinary consonant letters. Nasal codas are most commonly written using 0902.

Layout Devanagari text runs left to right in horizontal lines. Words are separated by spaces. There is no case distinction.

It uses mostly ASCII punctuation, but dandas may be used for phrase boundaries.

Orthographic syllables (as opposed to phonetic syllables) play a significant role in Devanagari. An orthographic syllable starts at the beginning of any cluster of consonants and incorporates the whole cluster plus any following vowels and diacritics.

Glyphs 'hang' from a top bar (shiroreka) that runs across a word, and which can sometimes be treated as a baseline.

	Front	Central	Back
High	i iː	ɨ ɨː	u uː
Mid	e eː	ə əː	o oː
Low		a aː	ɔ

	Bilabial	Dental	Alveolar	Retroflex	Alveolo -palatal	Velar	Glottal
Stop / Affricate	plain	p b	t d	ts	ʈ ɖ	tʃ dʒ	k ɡ
aspirated	pʰ	tʰ	tsʰ	ʈʰ	tʃʰ	kʰ
Fricative			s z		ʃ		h
Nasal	m	n
Approximant		l			j	w
Trill		r

Vowels

	Post-consonant	Standalone
Plain	ि,ी,ॖ,ॗ,ु,ू	इ,ई,ॶ,ॷ,उ,ऊ
	ॆ,े,ॊ,ो	ऎ,ए,ऒ,ओ
	ऺ,ऻ	ॳ,ॴ
	ॏ	ॵ
	ⓘ,ा	अ,आ
Diphthongs	ै,ौ	ऐ,औ
Vocalics	ृ	ऋ

ⓘ represents the inherent vowel. Diacritics are added to the vowels to indicate nasalisation (not shown here).

Inherent vowel

क ka

The inherent vowel for Kashmiri is pronounced ə, so kʰə is written by simply using the consonant letter, eg.

खबर

ख,ब,र

Since Kashmiri consonants normally include an inherent vowel, the orthography has ways to indicate a consonant that is not followed by a vowel sound. See novowel.

Post-consonant vowels

Post-consonant vowels are written using using vowel signs. All vowel signs are combining marks.

There are no multipart vowels and no circumgraphs. There is 1 pre-base vowel sign.

All vowel signs are typed and stored after the base consonant, whether or not they precede it when displayed. The glyph rendering system takes care of the positioning at display time. Conjuncts are treated as indivisible units when it comes to rendering vowel signs, meaning that pre-base vowel signs are rendered before the conjunct as a whole (see prebase).

Eight vowel signs are spacing combining characters, meaning that they consume horizontal space when added to a base consonant.

Plain vowels

की kiː

Kashmiri uses the following dedicated combining marks for vowels.

ि,ी,ॖ,ॗ,ु,ू,ॆ,े,ऺ,ऻ,ॊ,ो,ॏ,ा

Some of these vowel signs are the result of recent standardisation of the orthography (see previousOrthographies).

eg.

च़ऺर

कऻशुर

कॏह

Rajanvr points to further changes to the orthography since the original 2009 reform, involving the following vowels.

ॅ,ॉ,ो

The representation of the sound ɔː has not typically been clear. However Rajanvr reports that in publications of a few Kashmiri organisations the sounds ɔ and ɔː are represented using ॅ and ॉ, respectively. (These characters were previously used for the sounds ə and əː.) This allows for a clear distinction.

eg.

वॅरी

Rajanvr also suggests that in the 2009 reform and before, ɔː was represented using ौ, which is also used for the diphthong əŭ.

Diphthongs

कै kəĭ

Kashmiri also uses single vowel signs for 2 diphthongs. These are also dedicated combining marks for vowels.

ै,ौ

Vowel length

Vowel length is indicated by the vowel sign used (see plainV).

Nasalisation

Nasalisation of the vowel in a syllable can be indicated using ँ.

eg.

मुँह

वाँदुर

Standalone vowels

Kashmiri represents standalone vowels using a set of independent vowel letters. The set contains a character to represent the inherent vowel sound.

इ,ई,ॶ,ॷ,उ,ऊ,ऎ,ए,ॳ,ॴ,ऒ,ओ,ॵ,अ,आ,,ऐ,औ

As was the case for the vowel signs, some of these letters are the result of recent standardisation of the orthography (see previousOrthographies).

eg.

अंब

ॳंजीर

Rajanvr points to further changes to the orthography since the original 2009 reform, involving the following vowels.

ॲ,ऑ,औ

The representation of the sound ɔː has not typically been clear. However Rajan reports that in publications of a few Kashmiri organisations the sounds ɔ and ɔː are represented using ॲ and ऑ, respectively. (These characters were previously used for the sounds ə and əː.) This allows for a clear distinction.

Rajanvr also suggests that in the 2009 reform and before, ɔː was represented using औ, which is also used for the diphthong əŭ.

Vowel components

This section describes various vowel components and behaviours associated with this orthography.

Pre-base vowel sign

कि ki

One vowel sign appears to the left of the base consonant letter or cluster, eg.

ि

eg.

बिचोर

This is a combining mark that is always typed and stored after the base consonant(s), ie. the codepoints follow the order in which the items are pronounced. The rendering process places the glyph before the base consonant without changing the code points. The following shows the sequence of code points that make up the word just above.

ब,ि,च,ो,र

It is placed before the start of a conjunct, regardless of the number of consonants in that conjunct. In fig_prebase the sequence of glyphs for the orthographic syllable is rendered VCC, whereas the pronunciation is CCV. In conjuncts with 3 consonants, it will still be rendered before the consonants.

show composition

बेत्रि

However, if the cluster is split by a visible virama, this creates two syllables and the pre-base vowel sign appears after the last consonant with the virama. The sequence of displayed glyphs is now CVC. If the conjunct contains 3 consonants, the displayed order will be CCVC.

Vowel sign placement

The following list shows where vowel signs are positioned around a base consonant to produce vowels, and how many instances of that pattern there are.

1 pre-base, eg. कि ki
7 post-base, eg. का kā
4 superscript, eg. के kē
4 subscript, eg. कु ku

Previous orthographies

Prior to 1995 there was no standard way to write Kashmiri, and people spelled words in different ways.rt§7 There was an orthographic standardisation reform in 1995, followed by another in 2002epmkr, and a further revision in 2009.

Rajanvr describes an additional revision in 2009 which reinstated the letters which had been dropped for ə and əː by assigning them to the sounds ɔ and ɔː. He describes this change as the orthography used by a few Kashmiri organizations in many of their publications.

fig_orthographic_changes shows the changes as they were introduced. Dependent vowels appear above, and independent vowels below in each cell.

phoneme	1995	2002	2009	2009(2)	Current usage
ɨ	ॅु	ॖ ॶ			0956 0976
ɨː	ॅू	ॗ ॷ			0957 0977
e	े'	ॆ ऎ			0946 090E
o	ो'	ो ओ			094B 0913
ə	ऽ	ॅ ॲ	ऺ ॳ		093A 0973
əː		ॉ ऑ	ऻ ॴ		093B 0974
ɔ	व		ॏ ॵ	ॅ ॲ	094F 0975
ɔː			ौ औ	ॉ ऑ	ॏ ॵ

Vowels changed during the 2002 and 2009 reforms.

The 2009 revision gives the set of characters used in this page.l The new characters were added in Unicode v6.

The reform introduced a new character, ॵ, and its equivalent vowel sign, 094F, to replace the use of ्व for the vowel ɔ. For example, the following shows the spelling changes for the word sɔkʰmoth.

Old: *स्वखNew: सॏख

Principle changes also included the substitution of 0973 and 0974 for ॲ and ऑ, respectively. However, the modified 2009 reform adopted by some organisations then reinstated ॲ and ऑ for use with the sounds ɔ and ɔː.

In the gap, there was also some experimentation with Gurmukhi characters for the phonemes ɨ and ɨː.

Vowel sounds to characters

This section maps Kashmiri vowel sounds to common graphemes in the Devanagari orthography. It shows characters used since the reforms of 2009, and ignores previous orthographies (see previousOrthographies).

Plain vowels

vowel sign ि

standalone इ

iː

vowel sign ी

standalone ई

vowel sign ॖ

standalone ॶ

ɨː

vowel sign ॗ

standalone ॷ

vowel sign ु

standalone उ

uː

vowel sign ू

standalone ऊ

vowel sign ॆ

standalone ऎ

eː

vowel sign े

standalone ए

vowel sign ॊ

standalone ऒ

oː

vowel sign ो

standalone ओ

vowel sign ऺ

standalone ॳ

əː

vowel sign ऻ

standalone ॴ

vowel sign ॏ

vowel sign ॅ In some organisations.

standalone ॵ

standalone ॲ In some organisations.

ɔː

vowel sign ौ

vowel sign ॉ In some organisations.

standalone औ

standalone ऑ In some organisations.

inherent vowel eg. दर्शुन

standalone अ

aː

vowel sign ा

standalone आ

Diphthongs

əĭ

vowel sign ै

standalone ऐ

əŭ

vowel sign ौ

standalone औ

Vowel absence

Vowel absence principally occurs either when a consonant is a syllable coda, or when a consonant is part of a consonant cluster.

Given that consonants normally include an inherent vowel, the orthography needs a way to indicate when a consonant is not followed by a vowel.

Follow these links for more information.

Unmarked vowel absence.
Conjuncts. There are a number of possibilities here:
1. Half-forms : Reduce the shape of all consonants in the cluster except the last to a 'half-form' by removing the vertical stroke.
2. Stacking : Reduce a non-initial consonant in size and shape and position it below the first.
3. Special ligation : Create a fusion of the two shapes, where one or other of the components may not be easily recognisable.
4. The letter RA has its own idiosyncratic way of combining with other consonants, whether it precedes or follows them.
Show a visible virama below the non-final consonants in the cluster.
Coda diacritics. Some Kashmiri codas can be written using combining marks. (Note that onset medials are written using conjuncts.)

Unmarked vowel absence

For some consonants without a following vowel sound Kashmiri uses a (sometimes invisible) 094D to indicate vowel absence (see clusters).

eg.

पम्पोश

However, Kashmiri commonly suppresses the inherent vowel without a conjunct or visible virama appearing in the orthography, whether this is in a consonant cluster or at the end of a word. It is necessary to just know that the vowel should not be pronounced.

eg.

अख़बार

अ,ख़,बा,र

अतलास

अ,त,ला,स

Conjuncts

Conjunct formation

To produce a conjunct, ् is added between the consonants in the cluster. There are exceptions, but this type of virama is usually not displayed, for example:

क,्,ष,क्ष

The font usually determines which visual method is used, although it is possible to influence this (see joiner).

The rendered shape of the conjunct may also vary from font to font. For example, compare the sequences below which are identical except that Noto Serif Devanagari is used for the top row, and Annapurna SIL is used for the bottom row.

क,्,क,क्क

See a table of 2-consonant clusters.
The table allows you to test results for various fonts.

Conjoined half-forms

A half-form is typically created by removing the vertical line in the consonant shape, where there is one. (The vertical line is associated with the inherent vowel, and around two-thirds of Devanagari consonant shapes contain one.) There is often some additional tweaking of glyphs in order to join the components neatly. The last consonant in the cluster retains its full shape.

त,्,य,त्य

म,्,ब,म्ब

त,्,स,्,व,त्स्व

Examples of conjuncts formed by using half-forms.

Vertical stacks

This is more common for Sanskrit, and few modern fonts reorder glyphs in this way, or do so for a limited number of combinations.

ट,्,ठ,ट्ठ

द,्,ध,द्ध

ह,्,व,ह्व

Conjuncts formed by subjoining non-initial consonants.

Ligated conjuncts

Typically, only a small number of clusters are combined in a way that makes it difficult to spot the component parts. This is, however, the default for 3 particular clusters:

क,्,ष,क्ष

ज,्,ञ,ज्ञ

क,्,त,क्त

Conjuncts formed by subjoining non-initial consonants.

Conjuncts with RA

When RA occurs in a cluster, either as a medial consonant or a coda followed by another consonant, there are special rules for rendering. See medial_ra and coda_ra for details.

Using ZWJ & ZWNJ

It's possible to prevent the formation of conjuncts, and force a visible virama, using 200C ( ZWNJ ). To produce a half-form, rather than a ligated form, use 200D ( ZWJ ).

क,्,क,,क्क

क,्,‍,क,क्‍क

क,्,‌,क,क्‌क

Use of ZWNJ and ZWJ to control conjunct rendering.

If a font doesn't have a half-form glyph for a letter (eg. such as ड), it will fall back to showing a visible virama (ie. ड्‍).

200D can also be used to produce standalone half-forms (for educational text) such as the following:

क,्,‍,क्‍

घ,्,‍,घ्‍

ह,्,‍,ह्‍

Standalone half-forms produced using ZWJ.

Visible virama

Occasionally, a visible virama can be seen, especially in Kashmiri words that end with palatalisation (see palatalisation).

eg.

द्वद्

थऺन्य्

र्‌यथ

This is also used for educational or expository purposes to indicate an isolated consonant sound, such as क् to indicate the consonant sound k.

The ability to form conjuncts also depends on the richness of the font. Where a font is not able to produce a half-form or ligature, etc., it will leave a visible virama glyph below the initial consonant(s) to indicate the missing vowel sound, as illustrated in fig_virama_visible.

ङ्ख — A consonant cluster for which there exists a conjunct form in the Tiro Hindi font (left), but not in the Noto Serif Devanagari font (right). The latter indicates that this is a cluster by showing a visible virama.

An important consequence of representing clusters in this way is that the syllable boundaries are different. For example, if we follow the cluster with a left-positioned vowel sign, it will now appear after the virama, rather than before the cluster, eg. compare the position of the pre-base vowel sign in fig_virama_vowel. This change is also reflected in segmentation of the text for line-breaking, inter-character spacing, etc.

ङ्खि — Positioning of the pre-base vowel sign in relation to the same consonant cluster where a conjunct forms (left) vs. where a visible virama appears (right).

Consonants

	प,ब,त,द,ट,ड,क,ग, ,फ,थ,ठ,ख
	च़,च,ज, ,छ़,छ
	स,ज़,श,ह
	म,ं,न,ं,ं
	व,व,र,ल,य

Basic consonants

Basic set of consonants used for Kashmiri. The basic set of letters is highly phonetic.

Click on each letter for usage notes, alternative pronunciations, and for examples of usage.

प,फ,ब,त,थ,द,ट,ठ,ड,क,ख,ग,च़,छ़,च,छ,ज,स,ज़,श,ह,म,न,व,र,ल,य

Nuktas

Three sounds are written as combinations of 093C and another character.

च़,छ़,ज़

Only one of those combinations exists in atomic form. The other two have to be typed and stored as two characters.

ज़

NFC does not recombine the decomposed version of this character into a precomposed character. Instead, normalisation produces decomposed forms when using both NFC and NFD. So both approaches are canonically equivalent, but the decomposed form is recommended by the Unicode Standard.

Palatalisation

Palatalisation is a frequent feature of Kashmiri words. It is represented using य as the final element of a cluster.

YA forms a conjunct with the preceding consonant.

eg.

त्यम्बॖर

At the end of a word, the YA may be followed by a visible virama.

eg.

थऺन्य्

Usage preceding the inherent vowel is typically transcribed using ê, eg. têmbar. At the end of a word, it is often transcribed using a superscript i, eg. tånⁱ

At the beginning of a word, some care needs to be taken when the palatalisation follows RA, so as to prevent the sequence from forming a repha.

eg.

र्‌यथ

The required rendering can be achieved using 200C. The sequences below show the outcomes with and without the ZWNJ, respectively.

र,्,य,,र्य

र,्,‌,य,र्‌य

Word-internal use of the repha with palatalisation can, however, be seen.

eg.

पऻर्यज़ान

Since they are palatal sounds, the YA is not needed after the following consonants.

च,ज,छ,श

Sanskrit letters

Words directly borrowed from Sanskrit and Hindi may use additional characters that are not normally used in Kashmiri.mkr

Nasals

ण,ञ,ङ

Kashmiri normally uses only 2 of the 5 standard nasal letters in Sanskrit. The missing letters shown just above are normally rendered in Kashmiri using 0902mkr, eg. compare*ब्रह्मण्ड b͓rh͓mɳ͓ɖब्रह्मांड

They may, however, be found occasionally in conjuncts,rt§9 eg. ang in the Kashmiri orthography is typically written as in the top example just below, but may also be written like the the one on the bottom.

अंग

अङ्ग

On the other hand, they normally never appear outside of a conjunct, ie. ganapatʰ is more properly written in Kashmiri as per the top example, rather than the Sanskrit one on the bottom.

गनपथ gnptʰ

गणपथ gɳptʰ

That said, some writers will nonetheless use the Sanskrit forms.rt§9

Voiced aspirated plosives

भ,ध,ढ,झ,घ

The voiced aspirated plosive letters of Devanagari shown just above may be used to write Sanskrit words, or those words may be written without, eg. dharma may be written धर्म using Sanskrit letters, or दर्म in the Kashmiri style.rt§9

Others

ष,क्ष,ज्ञ

The letter and the two special conjuncts listed just above are also not used in Kashmiri, although they may pop up sometimes in words borrowed directly from Sanskrit.

Onsets

Clusters of consonant letters at the beginning of an orthographic syllable occur in Kashmiri, and they are generally handled as described in the section clusters.

However, a medial र is rendered idiosyncratically, as is a medial य (see palatalisation).

Medial RA

When ra follows another consonant or consonants in a syllable onset, it is typically rendered as a small, diagonal line pointing downwards to the left, eg.

क्र,ग्र,ब्र,ह्र,श्र

After त, however, it produces:

त्र

After 5 other consonants, it is rendered as an upside-down v shape below, ie.

ट्र,ठ्र,ड्र,ढ्र,छ्र

Codas

Syllable codas are typically represented by ordinary consonant letters. When the following syllable has a consonant onset, the coda and onset will typically form a conjunct (see clusters).

Consonant clusters involving an -r coda have special joining forms. Kashmiri also has a dedicated diacritic.

RA coda

When RA precedes a consonant, it is rendered as a small hook above that consonant, typically above the rightmost vertical line. Where it precedes a cluster of 2 more consonants, it is aligned with the vertical line of the trailing consonant. Examples:

र्क,र्ल,र्य,र्स्प

However, if there is a spacing vowel sign with a vertical line to the right of the cluster, it aligns with that, eg.

र्का,र्की

Coda diacritics

0902 represents a nasal that is homorganic with a following consonant. It is positioned over the previous consonant or vowel sign.mkr

eg.

पॖंच़ॗह

ज़ॊंग

See also the candrabindu diacritic, which nasalises a vowel.

The visarga is not used in Kashmiri.rt§8

Consonant length

Gemination and consonant lengthening are handled using the normal approach to consonant clusters (see clusters).

Consonant sounds to characters

This section maps Kashmiri consonant sounds to common graphemes in the Devanagari orthography.

The list includes consonant letters are not normally found in Kashmiri words, and generally occur only in loan words that have kept their original spelling. They are often replaced with one of the consonants used for Kashmiri sounds.

Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.

Onsets

consonant प

pʰ

consonant फ

consonant ब

consonant भ Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant त

tʰ

consonant थ

t͡s

consonant च़

t͡sʰ

consonant छ़

t͡ʃ

consonant च

t͡ʃʰ

consonant छ

consonant द

consonant ध Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

d͡ʒ

consonant ज

consonant झ Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant ट

ʈʰ

consonant ठ

consonant ड

consonant ढ Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant क

kʰ

consonant ख

kʂ

consonant cluster क्ष Spelling retained in Sanskrit loans.

consonant ग

consonant घ Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

ɡj

consonant cluster ज्ञ Spelling retained in Sanskrit loans.

consonant व

consonant स

consonant ज़

precomposed consonant ज़

consonant श

consonant ष Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant ह

consonant म

final nasal ं Coda.

consonant न

final nasal ं Coda.

consonant ञ Rare. Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant ण Spellings retained in Sanskrit loans.

final nasal ं Coda.

consonant ङ Rare. Retained in unassimilated loan words. Often replaced by mainstream Kashmiri consonants.

consonant व

consonant र

vocalic vowel sign ृ

vocalic independent vowel ऋ

riː

vowel sign ॄ

independent vowel ॠ

consonant ल

consonant/palatalisation marker य

palatalisation marker य् Palatalisation indicator.

consonant/palatalisation marker य Palatalisation indicator.

Encoding choices

This section looks at alternative strategies for typing and storing letters used by Kashmiri, taking into consideration the effects of normalising the text using Unicode Normalisation Form D (NFD), and Normalisation Form C (NFC).

Vowel signs

The single code points on the left should be used, and not the sequences on the right, because they are not made the same by normalisation. Therefore the content will be regarded as different, which will affect searching and other operations on the text.

Use	Do not use
094B	093E 0947
094C	093E 0948
094A	093E 0946
093B	093E 093A

The next table shows vowel signs that were rendered obsolete by recent standardisation work. Use the characters on the left, rather than those on the right. (See previousOrthographies.)

Use	Do not use
0956	0945 0941
0957	0945 0942
093A	0945 except where it is used by a few organisations to represent the sound ɔ. 093D
093B	0949 except where it is used by a few organisations to represent the sound ɔː.
0946	0947 02BC
094B	094B 02BC
094F	व

Independent vowels

Again, the single code points on the left should be used, and not the sequences on the right, because they are not made the same by normalisation.

Use	Do not use
आ	0905 093E
ॳ	0905 093A
ॴ	0905 093B
ओ	0905 094B
औ	0905 094C
ऒ	0905 094A
ॶ	0905 0956
ॷ	0905 0957
ऐ	090F 0947
ऎ	090F 0946

The next table shows vowel signs that were rendered obsolete by recent standardisation work. Use the characters on the left, rather than those on the right. (See previousOrthographies.)

Use	Do not use
ॶ	0945 0941
ॷ	0945 0942
ॳ	ॲ except where it is used by a few organisations to represent the sound ɔ. ऽ
ॴ	ऑ except where it is used by a few organisations to represent the sound ɔː.
ऎ	0947 02BC
ओ	094B 02BC
ॵ	व

Consonants

The table just below shows precomposed and decomposed representation of a Kashmiri letter which are treated as canonically equivalent by Unicode, meaning that you can use either. The Unicode Standard, however, recommends the use of the decomposed version, because normalisation does not reconstitute the precomposed from the decomposed.

Recommended	Not recommended
ज़	ज़

Glyph shaping & positioning

You can experiment with examples using the Kashmiri character workbench.

Glyph joining

Within a Kashmiri word, spacing glyphs are typically joined together at the top bar (shirorekha).

eg.

काहवऺट

The top bar extends across or through most spacing letters, including both consonants and vowels, but some letters create a gap in the line (while still joining at either side). Two such letters can be seen in the following example.

अथॖ

Characters that create these gaps include digits and the following:

ॶ,ॷ,ॳ,ऒ,ॴ,ओ,अ,आ,ॵ,औ,थ,श

Alignment of the top bar may be appropriate when mixing text of different sizes (see initials). Also, when Gurmukhi text is mixed with another script that also has a top bar, such as Devanagari, the top bars of both scripts may need to be aligned.

Context-based shaping & positioning

Context-based shaping

The shape of a character when displayed can vary, often dramatically, according to the context.

One very common example in most indic scripts is the handling of 'conjunct consonants', ie. groups of consonants with no intervening vowel sounds. Since consonants in indic scripts have an inherent vowel sound, when two consonants are combined this way you have to indicate that the vowel of the initial consonant is suppressed. This is normally done by altering the shape of the first consonant, or merging the shape of the two consonants.

To tell the font to do this, in Unicode you add 094D between the two consonants. This produces the change in the shapes of the glyphs that indicates to the reader that this is a conjunct. The actual outcome is font dependent. For the word below which contains a conjunct of two ल characters (making a long L sound) you may see a 'half-form' used for the first LA (shown on the left) or you may see (as shown on the right) a ligated form.

दिल्ली — Alternative representations of a geminated l consonant.

There are other types of context-based shaping, which are font specific. One is shown below. The width of the glyph for 093F differs according to the base character to which it is attached.

हालाँकि — Context-sensitive shaping of the glyph for i.

प्रचलित — Context-sensitive shaping of the glyph for i.

Multiple combining characters

Diacritics regularly combine with a vowel sign attached to the same consonant or consonant cluster. The example below shows two combining characters that are positioned above the base character in a very common form of the verb 'to be'. One is 0948, and the other the nasalisation mark 0902.

हैं — Multiple combining characters over one base character.

Context-based positioning

Combining characters need to be placed in different positions, according to the context.

The example on the left below displays the dot (anusvara) immediately over the long vertical stroke. The example to the right has moved the dot slightly to the right in order to accomodate the vowel sign.

अंधे — Context-sensitive placement of the anusvara diacritic.

में — Context-sensitive placement of the anusvara diacritic.

In the following the image to the left shows the normal position of 0942, beneath the first letter. The example on the right shows that character displayed higher up and to the right when combined with the base character र.

पूजा — Context-dependent placement of the glyph representing ra.

परू — Context-dependent placement of the glyph representing ra.

phrase	, ; :
sentence	। ? !
paragraph	॥

	start	end
standard	(	)

Notes, footnotes, etc

See inlinenotes for purely inline annotations, such as ruby or warichu. This section is about annotation systems that separate the reference marks and the content of the notes.

	labial	dental	alveolar	post- alveolar	retroflex	palatal	velar	glottal
stops	p b	t d			ʈ ɖ		k ɡ
aspirated	pʰ	tʰ			ʈʰ		kʰ
affricates		t͡s		t͡ʃ d͡ʒ
aspirated		t͡sʰ		t͡ʃʰ
fricatives			s z	ʃ				h
nasals	m		n
approximants	w		l			j
trills/flaps			r

Devanagari, Kashmiri

Sample

Usage & history

Basic features

Character index

Letters

Basic consonants

Sanskrit/Hindi consonants

Vowel letters

Vocalic

Not used for modern Kashmiri

Combining marks

Vowel marks

Vocalic

Finals

Other

Not used for modern Kashmiri

Punctuation

ASCII

Other

To be investigated

Structure

Phonology

Vowel sounds

Plain vowels

Diphthongs

Consonant sounds

Tone

Vowels

Inherent vowel

Post-consonant vowels

Plain vowels

Diphthongs

Vowel length

Nasalisation

Standalone vowels

Vowel components

Pre-base vowel sign

Vowel sign placement

Previous orthographies

Vowel sounds to characters

Plain vowels

Diphthongs

Vocalics

Vowel absence

Unmarked vowel absence

Conjuncts

Conjunct formation

Conjoined half-forms

Vertical stacks

Ligated conjuncts

Conjuncts with RA

Using ZWJ & ZWNJ

Visible virama

Consonants

Basic consonants

Nuktas

Palatalisation

Sanskrit letters

Nasals

Voiced aspirated plosives

Others

Onsets

Medial RA

Codas

RA coda

Coda diacritics

Consonant length

Consonant sounds to characters

Onsets

Encoding choices

Vowel signs

Independent vowels

Consonants

Numbers, dates, currency, etc

Text direction

Glyph shaping & positioning

Glyph joining

Context-based shaping & positioning

Context-based shaping