Updated 16 December, 2024
This page brings together basic information about the Cyrillic script and its use for the Ukrainian language. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Ukrainian using Unicode.
Richard Ishida, Ukrainian Orthography Notes, 16-Dec-2024, https://r12a.github.io/scripts/cyrl/uk
Стаття 1. Всі люди народжуються вільними і рівними у своїй гідності та правах. Вони наділені розумом і совістю і повинні діяти у відношенні один до одного в дусі братерства.
Стаття 2. Кожна людина повинна мати всі права і всі свободи, проголошені цією Декларацією, незалежно від раси, кольору шкіри, статі, мови, релігії, політичних або інших переконань, національного чи соціального походження, майнового, станового або іншого становища. Крім того, не повинно проводитися ніякого розрізнення на основі політичного, правового або міжнародного статусу країни або території, до якої людина належить, незалежно від того, чи є ця територія незалежною, підопічною, несамоврядованою або як‐небудь інакше обмеженою у своєму суверенітеті.
Source: Unicode UDHR, articles 1 & 2
Origins of the Latin script, 7thC – today.
Phoenician
└ Greek
└ Old Italic
└ Cyrillic
+ Glagolitic
+ Latin
+ Armenian
+ Georgian
+ Coptic
+ Runes
Ukrainian is the native language of around 40 million people and is written using a version of the Cyrillic script. Although Ukraine has many Russian speakers, particularly in the east and south of the country, in 2001 around 88% of Ukrainians were able to communicate in the Ukrainian language, and its use in the population has been increasing for some years.wul
українська абетка ʊkrɐˈjinʲsʲkɐ ɐˈbɛtkɐ Ukrainian alphabet
The script is named in honor of the two Byzantine brothers, Saints Cyril and Methodius, who created the earlier Glagolitic alphabet. Modern scholars believe that Cyrillic was developed and formalized by early disciples of Cyril and Methodius.
Cyrillic is derived from the Greek uncial script, augmented by letters from the older Glagolitic alphabet, including some ligatures. These additional letters were used for Old Church Slavonic sounds not found in Greek, and the Cyrillic script was initially used for writing Old Church Slavonic (also called Old Bulgarian), The script has changed over the intervening centuries to the point that Old Church Slavonic is sometimes considered a separate script.
Several orthographic reforms were introduced over the course of the 19th and 20th centuries. The latest version of the orthography was defined by the Ukrainian National Commission on Spelling and approved by the Cabinet of Ministers of Ukraine on May 22, 2019.
Sources: Scriptsource, Wikipedia.
Cyrillic is an alphabet. Letters typically represent a consonant or vowel sound. See the table to the right for a brief overview of features for the modern Ukrainian language.
Of the 441 characters in the Unicode Cyrillic blocks, 177 are historic (33%) and 2 are for Lithuanian dialectology. The remaining 262 are just letters – no punctuation, digits, or combining characters. These are all bicameral, which brings the number of distinct modern letters to 131. Although modern Cyrillic text tends to use precomposed forms, rather than combining diacritics separately with base letters, many extended characters are formed by slightly tweaking a set of basic shapes.
Ukrainian text runs left-to-right in horizontal lines. Words are separated by spaces. The visual forms of letters don't usually interact.
The script is bicameral. The shapes of the upper and lowercase forms are typically the same. There can be a significant difference, however, between regular and cursive/italic shapes for the same character.
Modern Ukrainian has 32 basic consonant letters (64 if you include both upper- and lowercase), plus the hard sign and soft sign.
❯ basicV
Ukrainian is an alphabet where vowels are written using 10 vowel letters, 4 of which usually indicate palatalisation of the previous consonant. Palatalisation can otherwise be applied or removed using the hard and soft signs.
There are no special mechanisms to represent standalone vowels. Combining marks are normally not used, and only occur in decomposed text.
Numbers use ASCII digits.
The visual forms of letters don't usually interact.
These are sounds for the Ukrainian language. For a useful summary of allophonic variants see Wikipedia.
Click on the sounds to reveal locations in this document where they are mentioned.
Phones in a lighter colour are non-native or allophones. Source Wikipedia.
u̯ and i̯ appear after the syllable nucleus where the orthography has в or й, respectively.
The other lighter coloured phones are allophones that appear in unstressed vowels, according to the following transformations:wup
Ukrainian is not a tonal language.
tbd
Click on the characters to find where they are mentioned in this page.
The Ukrainian alphabet has 33 letters.wua Each has upper and lowercase forms, shown above and below, respectively.
The following table summarises the main vowel to character assigments.
Vowels preceded by ʲ cause the preceding consonant to be palatalised. Where 2 transcriptions are given for a symbol they show stressed and unstressed pronunciations. Diphones typically occur for standalone vowel sounds. The bottom row shows the glides/semivowels used to make diphthongs.
Simple | ||
---|---|---|
Diphones | ||
Diphthongs |
For additional details see vowel_mappings.
Vowels following consonants are written using 10 vowel letters, 4 of which usually indicate palatalisation of the previous consonant. Palatalisation can otherwise be applied or removed using the hard and soft signs.
Standard Ukrainian uses 10 vowel letters (20 characters).
The following 6 vowels are normally used after a 'hard' consonant (ie. one that isn't palatalised).
The other letters usually occur after a 'soft' (palatalized) consonant or as standalone vowels.
When used after a palatalised consonant these vowels don't begin with j-, eg. the following shows letter to sound associations for the word будяк
On the other hand, these letters may also be used after a hard consonant followed by ʼ (the hard sign), in which case the vowel sound is preceded by j-, eg. інтервʼю
The vowels generally have different pronunciations in stressed vs. unstressed syllables, however such allophonic variations are not captured in the orthography. Click on the letters in the tables above for more information on that.
Vowel diacritics only appear in decomposed text. In normal Ukrainian texts there are no combining marks.
The marks that appear after decomposition are:
Two consonant characters are used to form diphthongs.
й is occasionally used as a consonant, but is mainly used after a vowel to create diphthongs, eg. буйний
в is generally used as a consonant, but when it appears as the coda of a syllable it is pronounced u̯, eg. кров
At the start of a word, or after another vowel, Ukrainian uses the 'ioticised' letters. Used in these locations, the sound of all the above vowels except і is preceded by j-, eg. юний союз компʼютер імла
This section maps Ukrainian vowel sounds to common graphemes in the Cyrillic orthography.
The right-hand side of each item shows uppercase forms.
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.
І lc і
Й lc й when it occurs after a vowel, or (in some words) before an initial consonant.
Ї lc ї
И lc и
У lc у in unstressed syllables.
Ю lc ю after a palatalised consonant in unstressed syllables.
Ю lc ю as a standalone vowel or after a hard sign in unstressed syllables.
У lc у in stressed syllables.
Ю lc ю after palatalised consonant.
В lc в allophone occurring as a syllable coda.
Ю lc ю as a standalone vowel or after a hard sign.
Е lc е in unstressed syllables.
Є lc є after a palatalised consonant in unstressed syllables.
И lc и in unstressed syllables.
Є lc є in unstressed syllables.
Е lc е
Є lc є after a palatalised consonant.
Є lc є as a standalone vowel or after a hard sign.
О lc о
А lc а in stressed syllables.
Я lc я after a palatalised consonant in stressed syllables.
Я lc я as a standalone vowel or after a hard sign.
А lc а after a palatalised consonant in unstressed syllables.
Я lc я after a palatalised consonant in unstressed syllables.
Я lc я as a standalone vowel or after a hard sign in unstressed syllables.
The following table summarises the main consonant to character assigments.
The left column is lowercase, and the right uppercase.
Standard | ||
---|---|---|
Palatalisation |
For additional details see consonant_mappings.
Ukrainian uses 20 consonants (40 characters, if you include uppercase and lowercase), plus a hard and soft sign. Click on the lowercase letters for detail information about usage.
Most of the consonants can be pronounced with or without palatisation, ie. 'hard' or 'soft', respectively. During palatalisation the tongue is raised towards the palate while the consonant is pronounced, eg. compare банан баня
Palatalisation is indicated by the letter that follows the consonant. When the vowel letters ї ю є я follow a consonant, the initial j- sound which is associated with those characters in standalone positions is dropped, and the preceding consonant is palatalised.
Consonants can be palatalised when no vowel follows using the soft sign, ь, eg. більшість
The soft sign may also be used when you want the consonant to be palatalised but retain the initial j- of the following vowel, eg. досьє
To write a hard consonant followed by a vowel that retains the initial j-, follow the consonant with the hard sign, ʼ, eg. пʼять
Note that this shows that there is a difference between пʼят and пять.
A few consonant sounds are generally always hard.
Unlike Russian, voiced consonants at the end of a syllable or word remain voiced, eg. шоколад трубкозуб
This section maps Ukrainian consonant sounds to common graphemes in the Cyrillic orthography.
The right-hand side of each item shows uppercase forms.
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.
П lc п
Б lc б
Т lc т
Тʼ lc тʼ
Т lc т when followed by a softening vowel.
ТЬ lc ть
Ц lc ц
Цʼ lc цʼ
Ц lc ц when followed by a softening vowel.
ЦЬ lc ць
Ч lc ч
Д lc д
Дʼ lc дʼ
Д lc д when followed by a softening vowel.
ДЬ lc дь
ДЗ lc дз
ДЖ lc дж
К lc к
К lc к when followed by 0456.
Ґ lc ґ
В lc в Allophone sometimes found before back vowels.
Ф lc ф
Ф lc ф Usually only when followed by і.
С lc с
Сʼ lc сʼ
С lc с when followed by a softening vowel.
СЬ lc сь
З lc з
Зʼ lc зʼ
З lc з when followed by a softening vowel.
ЗЬ lc зь
Ш lc ш
Ш lc ш when followed by a softening vowel.
ШЬ lc шь
Щ lc щ
Щ lc щ when followed by a softening vowel.
Ж lc ж
Х lc х
Х lc х when followed by і.
Г lc г
М lc м
М lc м when followed by і.
Н lc н
Нʼ lc нʼ
Н lc н when followed by a softening vowel.
НЬ lc нь
В lc в generally found before back vowels.
В lc в allophone occuring before front vowels.
Р lc р
Рʼ lc рʼ
Р lc р when followed by a softening vowel.
РЬ lc рь
Л lc л
Лʼ lc лʼ
Л lc л
ЛЬ lc ль
Й lc й
0301 is used to indicate where the stress falls in a word for educational materials, dictionaries, and such. The position of the stress is distinctive, and not always predictable, eg. compare за́мок замо́к
The other diacritics shown above are produced only by Unicode decomposition: 0306 from й, and 0308 from ї. Normally precomposed characters are used.
Cyrillic runs left to right in horizontal lines.
Show default bidi_class
properties for characters in the Ukrainian orthography described here.
Cyrillic uses ASCII digits.
You can experiment with examples using the All Cyrillic character app and the Ukrainian character app.
Cyrillic doesn't normally have any of the changeability of complex scripts. Characters are typically separate and self-contained. However, there can be a significant difference in shape between regular and italic/cursive font shapes for the same character.
Note in particular the italic form of т in the figure just above, which looks similar to the italic form of м shown in the previous figure.
The shapes of the italic forms can also vary by language.w
The shape of the breve sign in Cyrillic is different from that used for Latin text.s A font such as Brill can detect the appropriate shape from the adjacent characters.
Ukrainian is bicameral, and applications may need to enable transforms to allow the user to switch between cases.
Words are separated by spaces.
Ukrainian graphemes are straightforward, and can be mapped to Unicode grapheme clusters.
Base (Combining_mark)*
The 2 combining marks that occur in Ukrainian appear only on the rare occasions when the text is decomposed, and only one combining mark at a time appears after any base. All such decompositions conform to Unicode grapheme clusters.
Click on the text version of this word to see more detail about the composition.
гіркий | |
(decomposed) |
Cyrillic uses ASCII punctuation.
phrase | , ; : |
---|---|
sentence | . ? ! |
Ukrainian commonly uses ASCII parentheses to insert parenthetical information into text.
start | end | |
---|---|---|
standard | ( |
) |
Ukrainian uses angle quotation marks without space, like Russian, or the low and high marks used for Polish and German.wua,#Letterforms_and_typography
start | end | |
---|---|---|
initial | « |
» |
nested | „ |
“ |
Spaces between words provide the primary line break opportunities.u
As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.
Show (default) line-breaking properties for characters in the modern Ukrainian orthography.
The following list gives examples of typical behaviours for some of the characters used in Ukrainian. Context may affect the behaviour of some of these and other characters.
Click/tap on the characters to show what they are.
Justification is done, principally, by adjusting the space between words.
Cyrillic uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.
Cyrillic has little in the way of ascenders and descenders, and mostly the font metrics are the same as for ASCII text. One difference is the use of a couple of diacritics, which rise above the ASCII ascender height in capital letters..
To give an approximate idea, fig_baselines compares Latin and Cyrillic glyphs from Noto fonts.
fig_baselines_other shows similar comparisons for the Doulos SIL and Helvetica fonts.
You can experiment with counter styles using the Counter styles converter. Patterns for using these styles in CSS can be found in Ready-made Counter Styles, and we use the names of those patterns here to refer to the various styles.
The modern Ukrainian orthography uses 4 alphabetic styles, besides the ASCII decimal numeric style.
The lower-Ukrainian alphabetic style uses these letters.
Examples:
The lower-Ukrainian-full alphabetic style uses these letters.
Examples:
The upper-Ukrainian alphabetic style uses these letters.
Examples:
The upper-Ukrainian-full alphabetic style uses these letters.
Examples:
The default list style uses a full stop + space as a suffix.
Examples: