Assyrian orthographic notes

Usage & history

Origins of the Syriac script, 6thC – today.

Phoenician

└ Aramaic

└ Syriac

+ Hebrew

+ Nabataean

+ Palmyrene

+ Hatran

+ Mandaic

+ Elymaic

+ Pahlavi

+ Kharosthi

+ Brahmi

Ethnologue lists around 600,000 speakers of Assyrian Neo-Aramaic, also know as Suret, in all countries. These speak a number of dialects, with relatively high mutual intelligibility. One prestige dialect that arose from missionary activity in the mid 1800s is Urmian, with users located in Iran. A more recent standard, Iraqi Koine, developed in the 20th century. Other major dialects include Nineveh Plains and Ashiret. See Wikipedia for a map of distribution.

Instability throughout the Middle East over the past century has led to a worldwide diaspora of Assyrian speakers, with most speakers now living abroad in such places as North and South America, Australia, Europe and Russia, but the homeland includes Upper Mesopotamia, Iranian Azerbaijan, southeastern Anatolia and the northeastern Levant, which is a large region stretching from the plain of Urmia in northwestern Iran through to the Erbil, Kirkuk and Duhok regions in northern Iraq. Speakers of Assyrian are ethnic Assyrians and are the descendants of the ancient inhabitants of Mesopotamia.ws

ܣܘܼܪܝܬ

The orthography used to write Assyrian Neo-Aramaic derives from the Estrangela form of the Syriac script, which dates from the 1st century AD. The Madnhaya, or 'eastern', version formed as a form of shorthand developed from Esṭrangela and progressed further as handwriting patterns changed. Modern usage differs from the orthography used for Syriac in that it usually includes vowel diacritics.

Basic features

Syriac is, in principle, an abjad. The script relies mostly on consonant sounds to write words, although in Modern Aramaic written in Syriac vowel sounds tend to be written using diacritics, making it more like an alphabet. See the table to the right for a brief overview of features for the modern Assyrian Neo-Aramaic orthography.

The Syriac script has three main orthographic systems: maḏnḥāyā (ܡܲܕ݂ܢܚܵܝܵܐ) (eastern), ʾesṭrangēlā (ܐܣܛܪܢܓܠܐ), and serṭā (ܣܶܪܛܳܐ) (western). Assyrian Neo-Aramaic uses a version of the maḏnḥāyā orthography, derived from East Syriac texts. However, the Estrangelo style may be used for titlesr,5.

Words in Syriac are separated by spaces.

Text runs from right to left in horizontal lines. Numbers run left to right within the right to left flow.

❯ consonantSummary

All the letters in the Syriac block are consonants. There are 22 basic consonant letters, but these can be combined with one of 2 diacritics to create 3 additional sounds. Six letters can represent either plosives (hard sounds) or fricative sounds (soft), but diacritics usually indicate which is which.

Geminated consonants are indicated by the vowels used. There is no equivalent of the Arabic sukun.

Other diacritics are used to describe sounds in long consonant clusters, indicate unpronounced consonants, identify plural forms, and disambiguate identical words in unpointed text.

❯ basicV

Modern Aramaic written in Syriac is usually fully pointed, making it more an alphabet than an abjad. There are however obligatory points and optional diacritics. For vowels, Assyrian Neo-Aramaic uses a set of dotted diacritics. (rather than the Greek symbols used in western orthographies). There are 3 matres lectionis, and 6 vowel diacritics.

Standalone vowels are written using ܐ as a carrier. ❯ standalone

There is no equivalent of the Arabic sukun to indicate clusters of consonant sounds. ❯ clusters

Assyrian Aramaic normally uses ASCII digits, but there is also a native numbering system based on alphabetic characters.

Phonology

These are sounds of the Assyrian Neo-Aramiac language, but take into account some dialectal variation.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Vowel sounds

Plain vowels

Diphthongs

Consonant sounds

	labial	dental	alveolar	post- alveolar	palatal	velar	uvular	pharyngeal	glottal
stops	p b		t d			k ɡ	q		ʔ
emphatic			tˤ
affricates				t͡ʃ d͡ʒ
fricative	f v	θ ð	s z	ʃ ʒ		x ɣ		ħ ʕ	h
emphatic			sˤ
nasal	m		n
approximants, trills, flaps	w		l r		j

Among most Assyrian Neo-Aramaic speakers, the pharyngeal ʕ is pronounced as ʔ or ∅, or geminates a previous consonant.

Show notes on dialectal variations, taken directly from Wikipedia:

In Iraqi Koine Assyrian and many Urmian & Northern dialects, the palatals c, ɟ and aspirate cʰ are considered the predominate realisation of k, g and aspirate kʰ.
The phoneme ħ is only used by Assyrian-speakers under larger Arabic influence. In most dialects, it is realised as x. The one exception to this is the dialect of Hértevin, which merged the two historical phonemes into ħ, thus lacking x instead.
The pharyngeal ʕ, represented by the letter `e, is a marginal phoneme that is generally upheld in formal or religious speech. Among the majority of Assyrian speakers, `e would be realised as aɪ̯, eɪ̯, ɛ, j, deleted, or even geminating the previous consonant, depending on the dialect and phonological context.
f is a phoneme heard in the Tyari, Barwari and Chaldean dialects. In most of the other Assyrian varieties, it merges with p. though f is found in loanwords for these varieties of Assyrian.
The phonemes t and d have allophonic realisations of θ and ð (respectively) in most Lower Tyari, Barwari and Chaldean dialects, which is a carryover of begadkefat from the Ancient Aramaic period.
In the Upper Tyari dialects, θ is realised as ʃ or t; in the Marga dialect, the t may at times be replaced with s.
In the Urmian dialect, w has a widespread allophone ʋ (it may vacillate to v for some speakers).
In the Jilu dialect, q is uttered as a tense k. This can also occur in other dialects.
ɡ is affricated, thus pronounced as d͡ʒ in some Urmian, Tyari and Nochiya dialects. k would be affricated to t͡ʃ in the same process.
ɣ is a marginal phoneme that occurs across all dialects. Either a result of the historic splitting of g, through loanwords, or by contact of x with a voiced consonant.
ʒ is found predominately from loanwords, but, in some dialects, also from the voicing of ʃ (e.g. (ḥašbunā) xaʒbu:na:, counting, from the root ḥ-š-b, to count) as in the Jilu dialect or the fortition of j (e.g. Urmiynāyā > Urmižnāyā Uɾ:mɪ:ʒna:ja:, Urmian from mija water)
n can be pronounced ŋ before velar consonants x and q and as m before labial consonants.

Tone

Assyrian Neo-Aramaic is not a tonal language.

Structure

tbd

Vowels

The phonetics described here are based on the particular dialect mentioned at the top of this page. There are a number of different dialects which tend to write the text the same way, but pronounce it differently. For more detail, see Wikipedia.

Vowel summary table

This table summarises basic vowel to character assignments.

	post-consonant	standalone
Plain	ܝܼ␣ ␣◌ܸ␣ ␣ܘܼ	ܐܝܼ␣ ␣ܐܸ␣ ␣ܐܘܼ
	◌ܹ␣◌ܹܝ␣ ␣◌ܸ␣ ␣◌ܘܿ	ܐܹ␣ܐܹܝ␣ ␣ܐܸ␣ ␣ܐܘܿ
	◌ܵ␣◌ܲ	ܐܵ␣ܐܲ

For additional details see vowel_mappings.

Post-consonant vowels

There are 3 matres lectionis, and 6 vowel diacritics.

Combining marks used for vowels

Two of the following diacritics are only used in combination with a mater lectionis (see vletter). Other vowels are expressed by simply applying diacritics to a consonant letter. This is the complete set of diacritics used for vowels.

◌ܼ␣◌ܸ␣◌ܹ␣◌ܿ␣◌ܲ␣◌ܵ

Consonant sounds following 0732 and 0738 are usually geminated.

Two diacritics for one base. The sound ija can be written with a single yodh consonant rather than 2, and vowel diacritics both above and below it, eg. see the sequence ܝܼܵ in the word ܐܝܼܛܵܠܝܼܵܐ in fig_italia. Note that both combining marks must follow the YUDH.

show composition

ܐܝܼܛܵܠܝܼܵܐ

Consonants representing vowels (matres lectionis)

Three consonants are used in combination with diacritics to represent vowels.

ܐ␣ܝ␣ܘ

ܐ is usually found at the beginning or end of a word. Words that begin with a vowel sound typically start with this letter, carrying a vowel diacritic, or preceding one of the other two. At the end of a word it is usually silent.

ܐܲܩܠܵܐ

ܘ and ܝ, when used as a vowel, always have a dot above or below, and those dots are only used in conjunction with those letters. The possibilities are as follows.

i ܝܼ

ܡܝܼܫ̰
u ܘܼ

ܩܘܼܦܬܵܐ
o ܘܿ

ܡܘܿܕܵܐ

Multipart vowels

The 4 multipart vowels listed here all consist of a diacritic and a mater lectionis. Diphthongs and glides are not included here.

Click on the letters for examples.

ܝܼ␣ܘܼ␣◌ܹܝ␣ܘܿ

Standalone vowels

At the beginning of a word, all vowels are attached to or follow a silent ܐ.

i	ܐܝܼ	ɪ	ܐܸ	u	ܐܘܼ
e	ܐܹ ܐܹܝ	ə	ܐܸ	o	ܐܘܿ
		a	ܐܲ	ɑ	ܐܵ

Simplified table of word-initial vowel sounds with ALAPH as the base.

Vowel sounds to characters

This section maps Assyrian Neo-Aramaic vowel sounds to common graphemes in the Madnhaya orthography.

Plain vowels

mater lectionis + vowel ܝܼ May also be transcribed as iː.

vowel diacritic ܸ

mater + vowel ܘܼ

vowel diacritic ܹ

composite vowel ܹܝ

composite vowel 0735 071D

mater + vowel ܘܿ Also transcribed as oː, ʊ, or ʊː.

mater + vowel 0732 0718

vowel diacritic 0738

vowel diacritic 0732 071D

Also æ,ä, or ɐ.

vowel diacritic ܲ

Also aː,ɑː, or a.

vowel diacritic ܵ

Diphthongs and other combinations

vowel diacritic ܲܝ

vowel diacritic ܲܘ

Consonants

Consonant summary table

This table summarises basic consonant to character assignments.

The right column shows the use of diacritics to disambiguate various sounds.

	normal	disabiguated
	ܦ␣ܒ␣ܬ␣ܛ␣ܕ␣ܟ␣ܓ␣ܩ␣ܐ	ܦ݁␣ܒ݁␣ܬ݁␣ܕ݁␣ܟ݁␣ܓ݁
	ܟ̰␣ܓ̰
	ܦ̮␣ܒ݂␣ܬ݂␣ܕ݂␣ܣ␣ܙ␣ܨ␣ܫ␣ܫ̰␣ܚ␣ܓ݂␣ܟ݂␣ܥ␣ܗ
	ܡ␣ܢ
	ܘ␣ܪ␣ܠ␣ܝ

For additional details see consonant_mappings.

Basic consonants

All the letters in the Syriac block are consonants. There are 22 basic consonants, but these can be combined with one of 3 diacritics to create additional sounds. See consonantSummary for the combinations as well as the simple consonants.

Click on each letter for more details and for examples of usage, especially where more than one sound is indicated.

ܦ␣ܒ␣ܬ␣ܕ␣ܛ␣ܟ␣ܓ␣ܩ␣ܐ␣ܣ␣ܙ␣ܨ␣ܫ␣ܚ␣ܥ␣ܗ␣ܡ␣ܢ␣ܘ␣ܪ␣ܠ␣ܝ

ܐ is also regarded as a mater lectionis. Its use is described in matres.

Assyrian Neo-Aramaic uses many diacritics to produce additional sounds from the basic set of Syriac consonants. Hard and soft diacritics and the maǧlīyānā extend the consonant repertoire; the marhtana is used with 3-consonant clusters; talqana silences consonants; syame indicates plural forms; and there are some additional marks used to disambiguate words.

Hard and soft sounds

݂␣݁␣̮

Six Syriac consonant symbols represent two sounds, one 'hard' and one 'soft'. The hard sound is an unaspirated plosive, the soft sound is an aspirated fricative. The intended sound of the letter can be made explicit using diacritics.

In the maḏnḥāyā style, soft form marks may be omitted if they would interfere with the vowel marks. For native words, softening depends on the letter's position within a word or syllable, location relative to other consonants and vowels, gemination, etymology, and other factors. Foreign words do not always follow the rules for softening.w

Hard form. In principle, a high dot indicates the hard form. The code point to use is 0741. However, in Assyrian it is not normally used.

ܦ݁␣ܒ݁␣ܬ݁␣ܕ݁␣ܟ݁␣ܓ݁

Soft form. A low dot indicates the soft form for 5 plosives. The code point to use is 0742. However, to produce f use ܦ̮w.

ܦ̮␣ܒ݂␣ܬ݂␣ܕ݂␣ܟ݂␣ܓ݂

When it is used with ܕ, which already has a dot below, the two dots appear side by side, ie. ܕ݂ d‐̣

Repertoire extension (maǧlīyānā)

̰␣̃

0330, called maǧlīyānā (ܡܲܓ̰ܠܝܼܵܢܵܐ) is used to represent sounds that are not present in Classical Syriac, and is typically found in loan words.

Davisr,35-6 lists only 3 uses, all of which appear below the base consonant:

d͡ʒ	ܓ̰	ܓ̰ܵܘܹܓ̰
t͡ʃ	ܟ̰	ܟ̰ܹܟܡܲܟ̰ܵܐ
ʒ	ܫ̰	ܡܝܼܫ̰

Hobermand,506 lists one additional combination, but no new sounds, however he places the tilde above the base consonant. In this position, the diacritic is still called maǧlīyānā, but uses the code point 0303.

ʒ	ܙ̃
ʒ	ܫ̃

Plurals (syame)

0308 is used to represent the Syriac syame (ܣܝ̈ܡܐ), which indicates plural nouns, adjectives and participles. It is needed for unpointed text because many plural words would otherwise look the same as the singular word, eg. the following could be read as either malkā king or as malkē kings.

ܡܠܟܐ mlk̋ʾ

Instead, the plural form can be written

ܡܠܟ̈ܐ mlk̋ʾ

Some modern usage omits this diacritic when vowel marks are present, because it is redundant, however it is still generally used.

ܡܲܠܟܵܐ ܡܲܠܟܹ̈ܐ

Although it's not strictly needed, even in unpointed text, for non-regular words, it is also used for them, eg. ܒܲܝܬܵܐ ˈbaj.tɑ house ܒܵܬܹ̈ܐ bɑtte houses

An author can place this mark above any letter in a word, but if the word contains one or more ܪ the mark is generally placed over the one which is nearest the word end, and replaces the single dot above it, eg.

ܢܘܼܟ݂ܪ̈ܵܝܹܐ

Other likely locations include low rising letters, and letters near the middle or end of a word.w

Disambiguation marks

̇␣̣␣݀

Diacritics can be used to disambiguate the pronunciation of otherwise identical-looking words in unpointed text. For example:

ܩ̇ܛܠܬ qᵵlt I killed ܩܛ̣ܠܬ qᵵlt you (m.) killed ܩܛܠܬ݀ qᵵlt she killed

Dots (ṭipā)

0307 and 0323 were used for unpointed Classical Syriac to disambiguate certain letters, morphemes or words, and they are still in use for Assyrian in a few words, eg. compare ܡ̇ܢ ṁn man who ܡ̣ܢ ṃn mɪn from

The dot may also be written over the 3rd person feminine suffix.

ܐܝܼܕ݂ܵܗ̇

ܐ݇ܬܹܐ ܠܵܗ̇

Feminine marker

ܬ݀ is a feminine marker used with ܬ to indicate a feminine suffix. East Syriac fonts should render as two dots below the base letter, whereas West Syriac fonts render as a single dot to the left of the base, eg. compare in the Eastern (top) and Western (bottom) orthographies in fig_feminine (click on the images to see the underlying code points):

ܕܲܫܘܵܬ݀ — The same word in Eastern (left) and Western (right) script styles, showing the different appearance of the feminine marker in each (coloured here, after or below the last character.)

ܕܰܫܘܳܬ݀ — The same word in Eastern (left) and Western (right) script styles, showing the different appearance of the feminine marker in each (coloured here, after or below the last character.)

There appear to be no words in the Wiktionary list that use this diacritic, and it isn't mentioned in Davis, even though there is a specific shape for eastern script styles.

Isolated forms

Isolated versions of 3 letters, such as may be found in counter styles, are usually presented as a doubled letter, using intial and final forms, ie. ܟܟ k ܡܡ m ܢܢ n

The letter ܟ when handwritten alone may also look like ܟـ k

Single letter words

Four short, single letter words are written with the word that follows them, not separate. They are:

ܒ ܕ ܘ ܠ

Before a word that begins with a vowel, or a consonant followed by a vowel, these four words have no vowel markings. If the next consonant is not followed by a vowel, however, they are written with a following 0732. For example, compare:

ܒܐܵܗܵܐ

ܘ before ܝ is pronounced u.

Silent letters (talqana)

݇␣ܑ␣݈

0747 is used in the Eastern style to indicate letters that are not pronounced. It is frequently used in the modern Aramaic koine to bridge difference in dialects. For example, ܒܬ݇ܪ is pronounced baθar in some modern dialects, harking back to the classical pronunciation, but bar in Urmi and the koine.

ܐ݇ܟ݂ܵܠ݇ܪܲܡܫܵܐ ܫܹܢ݇ܬܵܐ

The letters ܐ, ܥ, ܗ, and ܝ, when included for etymological reasons, are often silent, though without using the talqana.n

The Unicode Standard says that 0748 is used in a similar way.u

0711 is used in East Syriac texts to indicate an etymological alaph, eg. ܩܲܖ݄ܡܵܝܑܼܬ̣ qaḋ‒݄māyˈit‒̜

Consonant clusters (marhᵊtˤɑnɑ)

There is no equivalent to the Arabic sukun to indicate clusters of consonant sounds.

See, however, the note about collapsing 2 yodh characters to 1 in combiningV.

̄␣̱

0331 and 0304 are used with sequences of 3 consonants. The first lengthens the middle consonant, while the second adds a short epenthetic sound to aid pronunciation.

More diacritics are described in diacritics.

Consonant length

The short a and ɪ vowels are only used in closed syllables, so if they are followed by an intervocalic consonant, it indicates that the consonant is doubled,d

ܣܲܡܲܐ

Consonant sounds to characters

This section maps Assyrian Neo-Aramaic consonant sounds to common graphemes in the Eastern Syriac orthography.

0726072607260726 consonant ܦ

0712071207120712 consonant ܒ

072C072C consonant ܬ

tˤ

071B071B071B071B consonant ܛ

t͡ʃ

071F071F071F071F consonant ܟ̰

07150715 consonant ܕ

d͡ʒ

0713071307130713 consonant ܓ̰

071F071F071F071F consonant ܟ

0713071307130713 consonant ܓ

0729072907290729 consonant ܩ

0710071007100710 consonant/mater lectionis ܐ

0726072607260726 consonant ܦ̮

0712071207120712 consonant ܒ݂

072C072C consonant ܬ݂

07150715 consonant ܕ݂

0723072307230723 consonant ܣ

sˤ

07280728 consonant ܨ

07190719 consonant ܙ

072B072B072B072B consonant ܫ

072B072B072B072B consonant ܫ̰

072B072B072B072B consonant ܫ̃

071A071A071A071A consonant ܚ

0713071307130713 consonant ܓ݂

071F071F071F071F consonant ܟ݂

0725072507250725 consonant ܥ

07170717 consonant ܗ

0721072107210721 consonant ܡ

0722072207220722 consonant ܢ

07180718 consonant ܘ

consonant ܒ when syllable-final.

072A072A consonant ܪ

0720072007200720 consonant ܠ

071D071D071D071D consonant/mater lectionis ܝ

	1	2	3	4
assyrian (additive)	ܐ	ܒ	ܓ	ܕ

	11	22	33	44
assyrian (additive)	ܝܐ	ܟܒ	ܠܓ	ܡܕ

	111	222	333	444
assyrian (additive)	ܩܝܐ	ܪܟܒ	ܫܠܓ	ܬܡܕ

Text direction

Syriac script is written horizontally, right-to-left. Like other RTL scripts, such as Arabic and Hebrew, modern numbers and text in LTR scripts are displayed left-to-right (producing 'bidirectional' text).

ܘܝܩܝܦܕܝܐ ܗܘ ܬܪܡܝܬܐ ܕܡܛܟܣܬܐ ܕܘܝܩܝܡܝܕܝܐ ܘܫܬܐܣ ܒ15 ܟܢܘܢ ܒ 2001 ܒܠܫܢܐ ܐܢܓܠܝܐ ܘܡܩܝܡܢܐ ܕܘܝܩܝܦܕܝܐ ܗܘ ܓܝܡܝ ܘܝܠܙ (Jimmy Wales). — Bidirectional Syriac text. Numbers and Latin text (highlighted) are read left-to-right, and the rest of the text flows right-to-left.

The Unicode Bidirectional Algorithm automatically takes care of the ordering for all the text in fig_bidi_text, as long as the 'base direction' is set to RTL. In HTML this can be set using the dir attribute, or in plain text using formatting controls.

If the base direction is not set appropriately, the directional runs will be ordered incorrectly as shown in fig_bidi_no_base_direction.

ܐܝ ܦܝ (IP) ܕܝܠܟ ܢܬܟܬܒ ܒܬܫܥܝܬܐ ܕܦܐܬܐ. — The exact same sequence of characters with the base direction set to RTL (top), and with no base direction set on this LTR page (bottom).

Show default bidi_class properties for characters in the Assyrian Neo-Aramaic orthography described here.

For other aspects of dealing with right-to-left writing systems see the following sections:

directioncontrols
expressions
breaking_latin
mirrored_characters
page

For more information about how directionality and base direction work, see Unicode Bidirectional Algorithm basics. For information about plain text formatting characters see How to use Unicode controls for bidi text. And for working with markup in HTML, see Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts.

Managing text direction

Unicode provides a set of 10 formatting characters that can be used to control the direction of text when displayed. These characters have no visual form in the rendered text, however text editing applications may have a way to show their location.

202B (RLE), 202A (LRE), and 202C (PDF) are in widespread use to set the base direction of a range of characters. RLE/LRE comes at the start, and PDF at the end of a range of characters for which the base direction is to be set.

In Unicode 6.1, the Unicode Standard added a set of characters which do the same thing but also isolate the content from surrounding characters, in order to avoid spillover effects. They are 2067 (RLI), 2066 (LRI), and 2066 (PDI). The Unicode Standard recommends that these be used instead.

061C (ALM) is used to produce correct sequencing of numeric data. Follow the link and see expressions for details.

There is also 2068 (FSI), used initially to set the base direction according to the first recognised strongly-directional character.

200F (RLM) and 200E (LRM) are invisible characters with strong directional properties that are also sometimes used to produce the correct ordering of text.

For more information about how to use these formatting characters see How to use Unicode controls for bidi text. Note, however, that when writing HTML you should generally use markup rather than these control codes. For information about that, see Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts.

Expressions & sequences

A sequence of European numbers, for example a range separated by hyphens, runs from right to left in the Syriac script (and Arabic or Thaana scripts), whereas for Persian, Hebrew, N’Ko or Adlam scripts it runs left to right.

fig_range shows some Syriac text, which is right-to-left overall, containing a numeric range that is ordered RTL, ie. it starts with 240 and ends with 250.

ܛܪܦܐ 240-250 ܩܘܼܛܢ — A numeric range in Syriac language text.

The Unicode Bidirectional Algorithm automatically produces the expected ordering when a sequence or expression follows Syriac characters. However, a sequence that appears alone on a line doesn't benefit from this, so to make the text appear correctly for Syriac you should add 061C (ALM) at the start of the line (see fig_ALM). This is an invisible formatting character.

A numeric date alone on a line of RTL text, with ALM before it (top), and without (bottom). (Click on each line to see the code points.)

Similar special ordering is applied to numbers in equations, such as 1 + 2 = 3, for Syriac language text.

For additional details on how direction of ranges interacts with surrounding characters and separators used, see the section Expressions & sequences in the Modern Standard Arabic orthography description.

Glyph shaping & positioning

Experiment with examples using the Assyrian Neo-Aramaic character app or the Syriac character app.

Font styles

Syriac has 3 major variant writing styles. The code points for the consonant letters are the same, but the shapes of the letters and code points and shapes of vowel diacritics can vary significantly. fig_writing_styles shows the differences using typical fonts for each style.

ܒܪܫܝܬ ܐܝܬܘܗܝ ܗܘܐ ܡܠܬܐ. — The opening words of the Gospel of St John in (top to bottom) Estrangelo, Eastern Syriac and Western Syriac. Source `w,#Alphabet_forms`

ܒ݁ܪܹܫܝܼܬ݂ ܐܝܼܬ݂ܵܘܗ݇ܝ ܗ݇ܘܵܐ ܡܸܠܬ݂ܵܐ. — The opening words of the Gospel of St John in (top to bottom) Estrangelo, Eastern Syriac and Western Syriac. Source `w,#Alphabet_forms`

Assyrian Neo-Aramaic often uses the Assyrian Estrangela style for headings, which looks like a cross between the typical Assyrian font and the Estrangelo Edessa font.

east syriac with estrangelo headings — An East Syriac text with Assyrian Estrangela styles in the headings.`n,40`

The style of lettering in the title of fig_heading_styles_east uses a special Assyrian style of Estrangela. fig_assyrian_styles shows typical letter shapes for 2 Assyrian styles and Syriac Estrangela. The top line has shapes typically used for normal Assyrian text, and the middle line shows a style used for headings.

Assyrian letter styles — A comparison of letter shapes in Assyrian Adiabene (top; used for standard body text), in Assyrian Estrangela (middle; used for headings), and in Edessa Estrangelo (bottom; used for Syriac). `r,54`

Cursive text

Syriac is cursive, ie. letters in a word are joined up. Fonts need to produce the appropriate joining form for a code point, according to its visual context, but the code point used for a given letter doesn't change.

ܦܘܠܝܛܝܩܝܬܐ

Letters join on the right or both sides in Syriac script.

Eight letters join only to the right.

ܐ␣ܬ␣ܕ␣ܨ␣ܙ␣ܗ␣ܪ␣ܘ

All other consonants join on both sides.

Cursive joining forms

The cursive treatment produces only minor changes to glyph shapes in most cases. A small number of letters, however, exhibit noteworthy changes, especially in word final positions. fig_joining_forms and fig_right_joining_forms show all the basic shapes in Assyrian and what their joining forms look like. Significant variations are highlighted.

isolated	right-joined	dual-join	left-joined	Assyrian letters
ܒ	ـܒ	ـܒـ	ܒـ	ܒ␣ܒ݂
ܦ	ـܦ	ـܦـ	ܦـ	ܦ␣ܦ̮
ܣ	ـܣ	ـܣـ	ܣـ	ܣ
ܩ	ـܩ	ـܩـ	ܩـ	ܩ
ܫ	ـܫ	ـܫـ	ܫـ	ܫ␣ܫ̰
ܛ	ـܛ	ـܛـ	ܛـ	ܛ
ܡ	ـܡ	ـܡـ	ܡـ	ܡ
ܟ	ـܟ	ـܟـ	ܟـ	ܟ␣ܟ݂␣ܟ̰
ܚ	ـܚ	ـܚـ	ܚـ	ܚ
ܝ	ـܝ	ـܝـ	ܝـ	ܝ
ܓ	ـܓ	ـܓـ	ܓـ	ܓ␣ܔ␣ܓ݂␣ܓ̰
ܠ	ـܠ	ـܠـ	ܠـ	ܠ
ܥ	ـܥ	ـܥـ	ܥـ	ܥ
ܢ	ـܢ	ـܢـ	ܢـ	ܢ

Joining forms for shapes that join on both sides.

isolated	right-joined	Assyrian letters
ܐ	ـܐ	ܐ
ܬ	ـܬ	ܬ␣ܬ݂
ܙ	ـܙ	ܙ␣ܙ̰␣ܙ̃
ܨ	ـܨ	ܨ
ܘ	ـܘ	ܘ
ܗ	ـܗ	ܗ
ܕ	ـܕ	ܕ␣ܕ݂␣ܪ

Joining forms for shapes that join on the right only.

Managing glyph shaping

200D (ZWJ) and 200C (ZWNJ) are used to control the joining behaviour of cursive glyphs. They are particularly useful in educational contexts, but also have real world applications.

ZWJ permits a letter to form a cursive connection without a visible neighbour.

ZWNJ prevents two adjacent letters forming a cursive connection with each other when rendered.

Context-based shaping & positioning

Context-based shaping

See just above for shaping related to cursive joining.

Ligatures

Apart from the shaping required to support cursive behaviour, there are also typical ligatures, such as those shown in fig_serto_lig, some of which are optional or font-dependent.

ܐܠܗܝ — Ligatures in East Syriac style orthography.

ܬܫܟܘܚܬܐ — Ligatures in East Syriac style orthography.

Context-based positioning

Sometimes clashes between diacritic marks have to be resolved by repositioning one of the diacritics, or sometimes producing a different solution.

For example, marks are usually centred vertically over or under a base character. If, however, 0742 appears below ܕ when the glyph for that has a dot below, the mark is moved slightly to the right, as shown here.

ܕ݂

Rukkakha moves to the right to accommodate the dot under dalath.

If 0308 appears above　ܪ the mark replaces the single dot above the base letter.

ܪ̈

Combining diaeresis replaces the dot over rish.

In this example, the RISH character carries not only a combining diaeresis, but also a vowel mark, which is moved upwards to ride above the former.

ܪ̈ܵ vs ܝܵ

Rish + diaeresis + vowel mark causes stacking diacritics.

Alaph shaping

A feature of Eastern and Western Syriac styles is that an unjoined alaph within a word has a different shape according to whether or not it is word-final. For example, fig_alaph_joining shows the word ܡܠܘܿܐܵܐ where the 2 alaph characters at the end have different shapes, although both are unconnected.

ܡܠܘܿܐܵܐ — A word showing different shapes for alaph.

Alaph also ligates word-finally with ܬ when following a connecting letter, eg. compare the shaping at the end of ܐܸܓܲܪܬܵܐ and ܐܸܫܬܵܐ (see fig_alaph_ligature).

After ܠ the letter alaph typically has a special, ligated shape, which also appears at word end. fig_alaph_ligature_l shows this in the word ܠܲܝܠܹܐ, however the default font used for Assyrian text on this page (East Syriac Adiabene) doesn't support it (Noto fonts do).

Punctuation & inline features

Phrase & section boundaries

،␣؛␣܆␣܇␣.␣؟

Modern Syriac uses ASCII punctuation and punctuation borrowed from Arabic. For separators at the sentence level and below, the following are used.

phrase	، ؛ ܆ ܇
sentence	. ؟

phrase

،
؛
܆
܇

sentence

Bracketed text

(␣)

Assyrian commonly uses ASCII parentheses to insert parenthetical information into text.

	start	end
standard	(	)

Mirrored characters

The words 'left' and 'right' in the Unicode names for parentheses, brackets, and other paired characters should be ignored. LEFT should be read as if it said START, and RIGHT as END. The direction in which the glyphs point will be automatically determined according to the base direction of the text.

a > b > c — Both of these lines use > U+003E GREATER-THAN SIGN, but the direction it faces depends on the base direction at the point of display.

ܐ > ܒ > ܓ — Both of these lines use > U+003E GREATER-THAN SIGN, but the direction it faces depends on the base direction at the point of display.

The number of characters that are mirrored in this way is around 550, most of which are mathematical symbols. Some are single characters, rather than pairs. The following are some of the more common ones.

(␣)␣<␣>␣[␣]␣{␣}␣«␣»␣‹␣›

Abbreviation, ellipsis & repetition

What characters are used to indicate abbreviation, ellipsis & repetition?

070F (SAM) indicates that a sequence of characters is an abbreviation (see fig_sam_abbrev). The line would ideally have a small circle at the start, middle and end. It normally starts to the left of the nearest tall letter to the end of the abbreviation.

Modern East Syriac texts use a punctuation mark for contractions of this sort.

ܬ܏ܫܒܘ — A Syriac abbreviation mark (using the Estrangela style) applied to an abbreviation (above), and the unabbreviated word (below).

ܬܫܒܘܚܬܐ — A Syriac abbreviation mark (using the Estrangela style) applied to an abbreviation (above), and the unabbreviated word (below).

Other inline features

Numbers

The Syriac abbreviation mark is used in older texts to identify letters used as numbers by drawing a line above them. See numbers for more information.

032D is also used as a digit marker.u

Line & paragraph layout

Line breaking & hyphenation

Basic line-break opportunities occur between the space-separated words.

They are not broken at the small gaps that appear where a character doesn't join on the left.

Show (default) line-breaking properties for characters in the modern Assyrian Neo-Aramaic orthography.

Breaking between Latin words

When a line break occurs in the middle of an embedded left-to-right sequence, the items in that sequence are rearranged visually so that the reading direction remains top-to-bottom. latin-line-breaks shows how two Latin words are apparently reordered in the flow of text to accommodate this rule.

Text with no line break in Latin text. — Syriac (estrangelo) with embedded Latin text. The lower of these two images shows the result of decreasing the line width, so that text wraps between a sequence of Latin words.

Text with line break in Latin text. — Syriac (estrangelo) with embedded Latin text. The lower of these two images shows the result of decreasing the line width, so that text wraps between a sequence of Latin words.

In digital text the rearrangement is automatic. Only the positions of the font glyphs are changed: nothing affects the order of the characters in memory.

Text alignment & justification

Does text in a paragraph needs to have flush lines down both sides? Does the script need assistance to conform to a grid pattern? Does the script allow punctuation to hang outside the text box at the start or end of a line? Where adjustments are need to make a line flush, how is that done? Does the script shrink/stretch space between words and/or letters? Are word baselines stretched, as in Arabic? What about paragraph indents?

ـ can be used, as in Arabic, to lengthen the baseline inside Syriac words.

Observation: It's not clear, however, whether the use of that is for justification, or simply for word stretching. Sometimes a word appears to contain a baseline elongation in order to provide more space for wide diacritics on adjacent bases.

Baselines, line height, etc.

tbd

Syriac uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

To include the long ascenders and descenders in Syriac, plus the (sometimes stacked) diacritics, line heights need to be slightly larger than for English text.

Notes, footnotes, etc

See inlinenotes for purely inline annotations, such as ruby or warichu. This section is about annotation systems that separate the reference marks and the content of the notes.

	he + yudh	ܐܠܗܝ
	taw + alaph	ܬܫܟܘܚܬܐ
	taw + yudh	ܟܚܬܝ

Syriac, Assyrian Neo-Aramaic

Sample

Usage & history

Basic features

Character index

Letters

Consonants

Vowels

Other

Combining marks

Vowels

Other

Punctuation

ASCII

Other

To be investigated

Phonology

Vowel sounds

Plain vowels

Diphthongs

Consonant sounds

Tone

Structure

Vowels

Vowel summary table

Post-consonant vowels

Combining marks used for vowels

Consonants representing vowels (matres lectionis)

Multipart vowels

Standalone vowels

Vowel sounds to characters

Plain vowels

Diphthongs and other combinations

Consonants

Consonant summary table

Basic consonants

Hard and soft sounds

Repertoire extension (maǧlīyānā)

Plurals (syame)

Disambiguation marks

Dots (ṭipā)

Feminine marker

Isolated forms

Single letter words

Silent letters (talqana)

Consonant clusters (marhᵊtˤɑnɑ)

Consonant length

Consonant sounds to characters

Numbers

Native numbering system

Text direction

Managing text direction

Expressions & sequences

Glyph shaping & positioning

Font styles

Cursive text

Cursive joining forms

Managing glyph shaping

Context-based shaping & positioning

Context-based shaping

Ligatures

Context-based positioning

Alaph shaping

Typographic units

Word boundaries

Graphemes

Punctuation & inline features

Phrase & section boundaries

Bracketed text

Mirrored characters

Abbreviation, ellipsis & repetition

Other inline features

Numbers

Line & paragraph layout

Line breaking & hyphenation

Breaking between Latin words

Text alignment & justification

Baselines, line height, etc.

Page & book layout

General page layout & progression