Updated 24 February, 2018 • tags ethiopic, scriptnotes.
This page provides basic information about the Ethiopic script. It is not authoritative, peer-reviewed information – these are just notes I have gathered or copied from various places as i learned. For character-specific details follow the links to the Ethiopic character notes.
For similar information related to other scripts, see the Script comparison table.
Clicking on red text examples, or highlighting part of the sample text shows a list of characters, with links to more details. Click on the vertical blue bar (bottom right) to change font settings for the sample text.
አንቀጽ፡፪፤ እያንዳንዱ፡ሰው፡የዘር፡የቀለም፡የጾታ፡የቋንቋ፡የሃይማኖት፡የፖለቲካ፡ወይም፡የሌላ፡ዓይነት፡አስተሳሰብ፡የብሔራዊ፡ወይም፡የኀብረተሰብ፡ታሪክ፡የሀብት፡የትውልድ፡ወይም፡የሌላ፡ደረጃ፡ልዩነት፡ሳይኖሩ፡በዚሁ፡ውሳኔ፡የተዘረዘሩት፡መብቶችንና፡ነጻነቶች፡ሁሉ፡እንዲከበሩለት፡ይገባል። ከዚህም፡በተቀረ፡አንድ፡ሰው፡ከሚኖርበት፡አገር፡ወይም፡ግዛት፡የፖለቲካ፡የአገዛዝ፡ወይም፡የኢንተርናሽናል፡አቋም፡የተነሳ፡አገሩ፡ነጻም፡ሆነ፡በሞግዚትነት፡አስተዳደር፡ወይም፡እራሱን፡ችሎ፡የማይተዳደር፡አገር፡ተወላጅ፡ቢሆንም፡በማንኛውም፡ዓይነት፡ገደብ፡ያለው፡አገዛዝ፡ሥር፡ቢሆንም፡ልዩነት፡አይፈጸምበትም።
The Ethiopic (Ge'ez) script was developed as the writing system of the Ge'ez language, a Semitic language spoken in Ethiopia and Eritrea until the 10th to the 12th centuries. Although the language ceased to be used in vernacular speech (it now serves a liturgical function only), the script is still widely used for writing the Ethiopian and Eritrean Semitic languages such as Tigré, Amharic and Tigrinya. In some languages, the script is called fidäl (ፊደል), which means 'alphabet', and individual letters are referred to as fidel. The script is believed by many to have derived from the epigraphic South Arabian script, of Proto-Sinaitic heritage, although there is some dispute surrounding this assertion; some also believe it to have descended from Egyptian hieroglyphics. According to the tradition of the Ethiopian Orthodox Tewahedo Church, the script was divinely revealed to Enos, grandson of the first man, Adam. Unlike other Semitic scripts, Ethiopic is written from left to right.
The original Ge'ez script was an abjad - vowels were not written - but the current script is classified as an abugida. Each symbol represents a CV syllable, but vowels are not inherent in the consonant. The original Ethiopic script contained 182 characters, although the basic (unmarked) consonants number only 26. The script has since been extended for other languages and now contains over 500 symbols. Some of the new symbols represent phonological processes such as palatalization, pharyngealization and labialization.
The Ge'ez script has been adapted to write other, mostly Semitic, languages, particularly Amharic in Ethiopia, and Tigrinya in both Eritrea and Ethiopia. It is also used for Sebatbeit, Me'en, and most other languages of Ethiopia. In Eritrea it is used for Tigre, and it has traditionally been used for Blin, a Cushitic language. Tigre, spoken in western and northern Eritrea, is considered to resemble Ge'ez more than do the other derivative languages. Some other languages in the Horn of Africa, such as Oromo, used to be written using Ge'ez, but have migrated to Latin-based orthographies.
The script is a featural syllabary – it resembles an abugida in that vowels are indicated by largely standardised adaptations to the base consonant. See the table to the right for a brief overview of features, taken from the Script Comparison Table.
The script is unicameral, and has only three combining characters, which are rarely used. Characters don't interact, and the baseline is standard.
Ethiopic does have a range of native punctuation. In particular, although words in modern text are increasingly separated by spaces they may be separated by a wordspace character instead.
Ethiopic also has its own numeric digits, which are used in an additive way, rather than in the way numbers are formed in Western text.
Text runs horizontally, left to right.
The Ethiopic script characters in Unicode 10.0 are in the following block:
The following links give information about characters used for languages associated with this script. The numbers in parentheses are for non-ASCII characters.
For character-specific details see Ethiopic character notes.
The Ethiopic syllables in the basic set start with one of the following consonants. The pronunciation listed is for Amharic (which has lost the phonetic distinction between some characters).
The consonant shapes above come from the original Ge'ez script, which was an abjad. The script became an abugida when small changes were added to those shapes to indicate the following vowel sound. The consonants above can be followed by one of 8 vowel sounds (though not all combinations exist). The original consonant shape is known as the 'first order', and the other shapes constitute incremental orders. The illustration below is based on the m consonant.
Three consonants also have a -yä ending:
Then there is a set of common labiovelar consonants, which are followed by only 5 of the vowel sounds.
Additional sets of consonants match the sounds in the various different languages that use the Ethiopic script.
Amharic, Tigrinya, Tigre, and Blin each use a selection from the following set.
The remaining characters, largely including those in the extension blocks, are for writing the sounds of other languages, such as Me'en, Sebatbeit, Gamo-Gofa-Dawro, Basketo, Gumuz, etc. The set of extended characters also includes combinations of the previous characters with an oa vowel sound.
The አ and ዐ series have lost their consonantal values and are vowel carriers in modern Amharic. Though sometimes the glottal stop ʔ is pronounced in word initial and medial positions, it is often dropped, eg. አየሩ ʾäyäru əyyəru the weather.a
Many words end with a consonant followed by no vowel. These are written using the ə syllable, eg. ስም sǝmǝ sɨm name, however the syllable is ambiguous – in some cases the vowel could be read.
The same syllable is also used for clusters of consonants with no intervening vowels, eg. ኢትዮጵያ ʾitəyop̣əya ʾītyōṗṗyā Ethiopia.
Doubled consonants do occur in Amharic and other languages that use the Ethiopic script, and they can be important to distinguish one word from another. However, they are not marked in the script (see the previous example for Ethiopia).
According to Wikipediaa, Ethiopian novelist Haddis Alemayehu, who was an advocate of Amharic orthography reform, indicated gemination in his novel Fǝqǝr Ǝskä Mäqabǝr by placing a dot above the characters whose consonants were geminated, but this practice is rare. Unicode provides ◌፟ [U+135F ETHIOPIC COMBINING GEMINATION MARK] for this, or sometimes ◌̎ [U+030E COMBINING DOUBLE VERTICAL LINE ABOVE] is used.
The Ethiopic blocks have only 3 combining characters.
The first is a vowel length, the second a gemination indicator (see above), and the third a combination of both.
There are 9 punctuation mark in the Ethiopic blocks.
These are described in the Text layout section below.
European digits are often used, but Ethiopic also has a native numbering system that is additive in nature.
You can generate Ethiopic numbers using the Counter styles converter app. Type in a number at the top and select ethiopic-numeric from the select box.
Note that there should be an unbroken line across the whole number at the top and bottom.
The Ethiopic script is written horizontally, left to right.
Word boundaries.Words are often separated by spaces in modern text, however they may be separated by ፡ [U+1361 ETHIOPIC WORDSPACE] instead (see the Amharic sample text above).
Some Western punctuation may be used, but Ethiopic has several native punctuation characters.
Phrases. ፣ [U+1363 ETHIOPIC COMMA] or ፥ [U+1365 ETHIOPIC COLON] are both roughly equivalent to a comma. They are considered glyph variants for the same punctuation symbol, although usually a document will consistently use only one or the other. The latter is more common in religious texts, and is used for biblical references where English would use a colon, eg. ማቴ4፥23 Matt 4:23.
፤ [U+1364 ETHIOPIC SEMICOLON]
፦ [U+1366 ETHIOPIC PREFACE COLON] Follows clarification of a subject. It will preface validation statements and examples that support the clarification.
Sentences.። [U+1362 ETHIOPIC FULL STOP] may be used, immediately preceded by a wordspace character. It is also possible to find the ASCII full stop used.
Similarly, western question marks may be used, or ፧ [U+1367 ETHIOPIC QUESTION MARK].
¡ [U+00A1 INVERTED EXCLAMATION MARK], known as “Timirte Slaq” (ትእምርተ፡ሥላቅ) appears at the end of a sentence and denotes sarcasm.→e 2.3.1
Paragraphs. ፨ [U+1368 ETHIOPIC PARAGRAPH SEPARATOR] may be used to conclude the final paragraph of a section in lieu of ።. Like ፠ below, three or more may also be used together on a line of their own.
Sections.፠ [U+1360 ETHIOPIC SECTION MARK] Used to divide sections or subsections; generally three or more used together on a line of their own.
Ethiopic text is generally wrapped wherever it hits the right margin, whether wordspace or space are used to separate words, and no hyphenation occurs. However, a new line should not start with a space, math operator or any of the following:e
Full justification is a common typesetting practice. Ethiopic is usually justified by adjusting inter-word spacing. Where words are separated with ፡ [U+1361 ETHIOPIC WORDSPACE] this is still the case, however no extra spaces should be added – the width of the wordspace character changes.
When the wordspace character width changes, the wordspace glyph may be centred, or may appear alongside the previous word, depending on preference.
Use the control below to see how your browser justifies the text sample here.
Further information needed for this section includes: baselines, emphasis & highlighting, text decoration, abbreviations & ellipsis, hyphens & dashes glyph controls quotations, repetition, justification & alignment, first-letter styling, vertical text, notes & footnotes, page layout