Use accesskey "n" to jump to the internal navigation links at any point. Right now you can

 
r12a >> docs

Modern Israeli Hebrew

orthography notes

Updated 26 November, 2024 • recent changes scripts/hebr/he • leave a comment

This page brings together basic information about the Hebrew script and its use for the modern Israeli Hebrew language. It doesn't aim to cover Biblical usage. It aims to provide a brief, descriptive summary of the modern, printed orthography and typographic features, and to advise how to write Hebrew using Unicode.

Referencing this document

Richard Ishida, Modern Israeli Hebrew Orthography Notes, 26-Nov-2024, https://r12a.github.io/scripts/hebr/he

 

Click to toggle Table of Contents.

Phonological transcriptions should be treated as a guide, only. They are taken from the sources consulted, and may be narrow or broad, phonemic or phonetic, depending on what is available. They mostly represent pronunciation of words in isolation. For more detailed information about allophones, alternations, sandhi, dialectal differences, and so on, follow the links to cited references.

This is an interactive document. Click/tap on the following to reveal detailed information and examples for each character: (a) coloured characters in examples and lists; (b) link text on character names. If your browser supports it, your cursor will change to look like as you hover over these items.

More about using this page

Character names. The names of characters in codepoint markup drop the initial HEBREW label (purely to reduce the length of the examples). In other places the full name can be found.

Navigation. The Toggle images icon opens the table of contents in a popup window. Dismiss it by clicking on the X alongside it, or by hitting the ESC key.

Detailed character notes. Clicking on coloured characters in lists or on character names opens panels that give detailed information about each character. This information is taken from the companion document, Hebrew Character Notes. (Those panels can be dismissed by pressing on the ESC key.)

Transcriptions & transliterations. Phonological transcriptions are surrounded by ⌈corner brackets⌋, to indicate that they vary between narrow, [phonetic] and broad, /phonemic/ transcriptions.
Latin transcriptions between <angle brackets>, represent the letters as commonly written in the Latin script.
A transliteration has also been developed especially for this orthography, and is generally based on the sound of a letter where possible, but where a letter has multiple pronunciations, the transliteration represents only one.
Transliterations provide perfect round-trip conversion between the native script and Latin, whereas Latin transcriptions rarely do.
When you click on an example to see its composition, the top of the panel that opens contains a transliteration, followed by the native text, then (if available) an IPA transcription.

Copied !
TOC.
Accessibility settings
ˇ

Languages using the Hebrew scriptHebrew pickerTerms listCharacter notesHebrew linksOther orthography notes

Sample

Select part of this sample text to show a list of characters, with links to more details.
Change size:   28px

סעיף א. כל בני אדם נולדו בני חורין ושווים בערכם ובזכויותיהם. כולם חוננו בתבונה ובמצפון, לפיכך חובה עליהם לנהוג איש ברעהו ברוח של אחוה.

סעיף ב. כל אדם זכאי לזכויות ולחרויות שנקבעו בהכרזש זו ללא הפליה כלשהיא מטעמי גזע, צבע, מין, לשון, דח, דעה פוליטית או דעה בבעיות אחרות, בגלל מוצא לאומי או חברתי, קנין, לידה או מעמד אחר. גדולה מזו, לא יופלה אדם על פי מעמדה המדיני, על פי סמכותה או על פי מעמדה הבינלאומי של המדינה או הארץ שאליה הוא שייך, דין שהארץ היא עצמאית, ובין שהיא נתונה לנאמנות, בין שהיא נטולת שלטון עצמי ובין שריבונותה מוגבלת כל הגבלה אחרת.

Source: Unicode UDHR, articles 1 & 2

Usage & history

Origins of the Hebrew script, BCE 3rdC – today.

Phoenician

└ Aramaic

└ Hebrew

+ Nabataean

+ Syriac

+ Palmyrene

+ Edessan

+ Hatran

+ Elymaic

+ Mandaic

+ Pallavi

+ Kharosthi

The Hebrew script is widely used by the Jewish community and is used to write modern Hebrew in Israel. It is the script used for Jewish sacred texts. It is also used for a number of other languages, including Samaritan, Yiddish, and Judeo-Arabic.

אָלֶף־בֵּית עִבְרִי alefbet ivri Hebrew alphabet

Before the Jewish exile in Babylon, Hebrew was written using a Paleo-Hebrew script that resembles the Samaritan alphabet. The current script, known as 'square', or 'block' script, derives from Aramaic writing. It is generally referred to as the Ashuri (Assyrian) script, although there are a few alternate writing styles. It dates from the 5th century BCE.

More information: Scriptsource and Wikipedia.

Script codehebr
Language codehe
Script typeabjad
Originwasia
Native speakers9,000,000
  
Total characters59
Letters27
Combining marks16
Symbols1
Punctuation6
Other9
Possible other14
Unicode blocks2
  
Character counts above are for this
orthography but exclude ASCII.
  
Text directionrtl
Post-consonant vowelsletters
marks
hides vowels
Standalone vowels
Case distinctionno
Cursive scriptno
Combining marks>1 per base
Clusters markedno
Dedicated finalsletters
Other ligaturesno
Word separatorspace
Wraps atword
Hyphenation?
G Clusters OK?yes
Justificationspaces
Baselineromn

Basic features

Hebrew is essentially an abjad. This means that in normal use the script represents consonants but not all vowels. This approach is helped by the strong emphasis on consonant patterns in Semitic languages. See the table to the right for a brief overview of features for the modern Hebrew orthography.

Note that the focus of this page is on everyday use for contemporary Israeli Hebrew, including educational materials, but not including biblical texts, prayer books, and the like. The latter tend to include additional characters, such as cantillation marks.

Hebrew text runs right-to-left in horizontal lines, but numbers and embedded Latin text are read left-to-right.

There is no case distinction. Words are separated by spaces.

The Modern Israeli Hebrew alphabet uses 22 letters, plus 5 word-final letters that have their own code points. Additional sounds can be represented using dagesh, shin/sin dots, or geresh.

❯ More vowel details

Hebrew has 11 vowel diacritics in regular use to express vowel sounds (called niqqud or points), but rarely uses them in normal text. Hebrew readers are usually able to understand the pronunciation from the context and the regular structure of Hebrew words. These and other phonetic diacritics are written, however, where needed to clarify ambiguities or for educational purposes.

Vowel locations can be marked by 4 matres lectionis (consonants indicating vowel locations).

A spelling innovation introduced by modern Hebrew uses matres lectionis to spell certain short vowels that would not have been marked in older texts. Although the hiding of short vowel niqqud would generally qualify Hebrew as an abjad, this 'full spelling' approach makes it partially alphabetic.

In vowelled text, there is a diacritic to indicate the absence of a vowel in consonant clusters.

Modern Hebrew uses both European digits, and ASCII punctuation marks.

Character index

The index points to locations where a character is mentioned in this page, and indicates whether it is used by the Hebrew orthography described here.

Manage characters.

Click on the image to the left to view all the 'main' and 'infrequent' characters in the index in various groupings or open related apps.

Letters

Show

Consonants

list all 27
א05D0
HEBREW LETTER ALEFconsonant ʔ ∅
ע05E2
HEBREW LETTER AYINconsonant ʔ ∅
ט05D8
HEBREW LETTER TETconsonant t t
ת05EA
HEBREW LETTER TAVconsonant t t
ד05D3
HEBREW LETTER DALETconsonant d d
ק05E7
HEBREW LETTER QOFconsonant k k
ג05D2
HEBREW LETTER GIMELconsonant ɡ g
צ05E6
HEBREW LETTER TSADIconsonant t͡s ts
פ05E4
HEBREW LETTER PEconsonant f p f
ב05D1
HEBREW LETTER BETconsonant v b v
ו05D5
HEBREW LETTER VAVconsonant v w v
ש05E9
HEBREW LETTER SHINconsonant s ʃ s sh
ס05E1
HEBREW LETTER SAMEKHconsonant s s
ז05D6
HEBREW LETTER ZAYINconsonant z z
ח05D7
HEBREW LETTER HETconsonant χ
כ05DB
HEBREW LETTER KAFconsonant x k kh
ר05E8
HEBREW LETTER RESHconsonant ʁ r
ה05D4
HEBREW LETTER HEconsonant h ∅ h
מ05DE
HEBREW LETTER MEMconsonant m m
נ05E0
HEBREW LETTER NUNconsonant n n
ל05DC
HEBREW LETTER LAMEDconsonant l l
י05D9
HEBREW LETTER YODconsonant j y
ץ05E5
HEBREW LETTER FINAL TSADIword-final consonant variant t͡s ts
ף05E3
HEBREW LETTER FINAL PEword-final consonant variant f p f
ך05DA
HEBREW LETTER FINAL KAFword-final consonant variant x k kh
ם05DD
HEBREW LETTER FINAL MEMword-final consonant variant m m
ן05DF
HEBREW LETTER FINAL NUNword-final consonant variant n n

Combining marks

Show

Vowels

list all 13
ִ05B4
(rare)    HEBREW POINT HIRIQvowel point i
ֻ05BB
(rare)    HEBREW POINT QUBUTSvowel point u
ֵ05B5
(rare)    HEBREW POINT TSEREvowel point e ej
ֶ05B6
(rare)    HEBREW POINT SEGOLvowel point e
ֱ05B1
(rare)    HEBREW POINT HATAF SEGOLvowel point e
ֹ05B9
(rare)    HEBREW POINT HOLAMvowel point o o
ֳ05B3
(rare)    HEBREW POINT HATAF QAMATSvowel point o o
ְ05B0
(rare)    HEBREW POINT SHEVAepenthetic vowel/no vowel e ∅
ָ05B8
(rare)    HEBREW POINT QAMATSvowel point a o
ַ05B7
(rare)    HEBREW POINT PATAHvowel point a
ֲ05B2
(rare)    HEBREW POINT HATAF PATAHvowel point a
ׇ05C7
(rare)    HEBREW POINT QAMATS QATANvowel o o
ֺ05BA
(rare)    HEBREW POINT HOLAM HASER FOR VAVvowel point used only with consonantal VAV o o

Other

list all 3
ּ05BC
(rare)    HEBREW POINT DAGESH OR MAPIQhard consonant diacritic
ׁ05C1
(rare)    HEBREW POINT SHIN DOTshin dot
ׂ05C2
(rare)    HEBREW POINT SIN DOTsin dot

Punctuation

Show
list all 6
־05BE
HEBREW PUNCTUATION MAQAFhyphen
״05F4
HEBREW PUNCTUATION GERSHAYIMacronym indicator
׳05F3
HEBREW PUNCTUATION GERESHpronunciation change/initialism/numeral indicator
2019
RIGHT SINGLE QUOTATION MARKquotation mark
201D
RIGHT DOUBLE QUOTATION MARKquotation mark
201E
(infrequent)    DOUBLE LOW-9 QUOTATION MARKquotation mark becoming old fashioned

ASCII

list all 11
(0028
LEFT PARENTHESISparenthesis (
)0029
RIGHT PARENTHESISparenthesis )
,002C
COMMAcomma ,
-002D
HYPHENhyphen -
.002E
FULL STOPfull stop .
:003A
COLONcolon :
;003B
SEMICOLONsemicolon ;
?003F
QUESTION MARKquestion mark ?
!0021
EXCLAMATION MARKexclamation mark !
%0025
PERCENT SIGNpercentage mark
/002F
SOLIDUSfor the keyboard

Symbols

Show
list
20AA
NEW SHEQEL SIGNshekel

Other

Show
list all 9
RLI2067
RIGHT-TO-LEFT ISOLATErtl isolate
RLE202B
RIGHT-TO-LEFT EMBEDDINGrtl embed
LRI2066
LEFT-TO-RIGHT ISOLATEltr isolate
LRE202A
LEFT-TO-RIGHT EMBEDDINGltr embed
FSI2068
FIRST STRONG ISOLATEfirst-strong isolate
PDI2069
POP DIRECTIONAL ISOLATEpop direction isolate
PDF202C
POP DIRECTIONAL FORMATTINGpop direction
RLM200F
RIGHT-TO-LEFT MARKrtl mark
LRM200E
LEFT-TO-RIGHT MARKltr mark

To be investigated

list all 19
[005B
(tbc)    LEFT SQUARE BRACKETbracket [
]005D
(tbc)    RIGHT SQUARE BRACKETbracket ]
{007B
(tbc)    LEFT CURLY BRACKET
}007D
(tbc)    RIGHT CURLY BRACKET
ʼ02BC
(tbc)    MODIFIER LETTER APOSTROPHEapostrophe ʼ
͏034F
(tbc)    COMBINING GRAPHEME JOINERcombining grapheme joiner
2011
(tbc)    NON-BREAKING HYPHENnon-breaking hyphen
2013
(tbc)    EN DASHen dash
2014
(tbc)    EM DASHem dash
2018
(tbc)    LEFT SINGLE QUOTATION MARKquotation mark
201C
(tbc)    LEFT DOUBLE QUOTATION MARKquotation mark
2026
(tbc)    HORIZONTAL ELLIPSISellipsis
2030
(tbc)    PER MILLE SIGNper mille mark
2060
(tbc)    U+2060 WORD JOINERword joiner

Phonology

These are phonemes of Israeli Hebrew.

Click on the sounds to reveal locations in this document where they are mentioned.

Phones in a lighter colour are non-native or allophones. Source Wikipedia.

Notes on phonology

Modern Israeli Hebrew was born from speakers who brought their own accents and pronunciations from different parts of the world. There are still variations in pronunciation, but two main types predominate today: Oriental and Occidental. Oriental Hebrew was chosen as the preferred accent for Israel by the Academy of the Hebrew Language, but has since declined in popularity. Age is often a factor in individual pronunciation.13→

In particular, there are alternative pronunciations for x~ħ, ʁ~r, ʔ~ʕ. In this document we use the left-hand side of each of these pairings.

Younger speakers also tend to make all consonants in a cluster voiced or unvoiced, depending on the last consonant, eg. לִסְגֹּר lisᵊgoˑʁ lis'ɡoʁ to close becomes liz'ɡoʁ, and אַבְטָחָה ʔavᵊtāxāh avta'xa security becomes afta'xa.

For more details, see: Wikipedia.

Vowel sounds

Plain vowels

i u e o ə ə a

Diphthongs

ej aj

See also the Notes on phonology.

Consonant sounds

labial alveolar post-
alveolar
palatal velar uvular glottal
stop p b t d     k ɡ   ʔ
affricate   t͡s t͡ʃ d͡ʒ        
fricative f v s z ʃ ʒ
  x ʁ h
nasal m n        
approximant w l   j    
trill/flap   r ɾ    

x is sometimes described as χ, and ʁ as r. For more variants see Notes on phonology.

Final -h is rarely pronounced in modern Hebrew.13

Tone

Hebrew is not a tonal language.

Structure

tbd

Vowels

Hebrew has 11 vowel diacritics in regular use to express vowel sounds (called niqqud or points), but rarely uses them in normal text. Hebrew readers are usually able to understand the pronunciation from the context and the regular structure of Hebrew words. These and other phonetic diacritics are written, however, where needed to clarify ambiguities or for educational purposes.

Vowel locations can be marked by 4 matres lectionis (consonants indicating vowel locations).

A spelling innovation introduced by modern Hebrew uses matres lectionis to spell certain short vowels that would not have been marked in older texts. Although the hiding of short vowel niqqud would generally qualify Hebrew as an abjad, this 'full spelling' approach makes it partially alphabetic.

In vowelled text, there is a diacritic to indicate the absence of a vowel in consonant clusters.

Hebrew spelling

Wikipedia describes 3 main types of spelling found in contemporary Israeli text.

כתיב חסר Ktiv haser (missing spelling) uses no niqqud or additional matres lectionis. It looks anachronistic and can be ambiguous but can still be found sometimes in newspapers and published books.

אמץ ˈˀomets courage

אויר ˀaˈvir air

חלקה ħaluˈqa distribution

כתיב מנוקד Ktiv menuqad (pointed spelling), rarely used in everyday life because it is too cumbersome, shows all the vowels using the niqqud points. It is used wherever the pronunciation needs to be clear and unambiguous, such as in children's books, poetry, language instruction for newcomers, or ambiguous or foreign terms.

אֹמֶץ ˈˀomets courage

אֲוִיר ˀaˈvir air

חֲלֻקָּה ħaluˈqa distribution

כתיב מלא Ktiv male (full spelling) or ktiv hasar niqqud is the predominant approach, including in personal correspondence, movies subtitles, etc. It adds matres lectionis to words where certain niqqud points would occur in the pointed spelling approach. To avoid confusion, consonantal וU+05D5 LETTER VAV v and יU+05D9 LETTER YOD j are doubled in the middle of words.

אומץ ˈˀomets courage

אוויר ˀaˈvir air

חלוקה ħaluˈqa distribution

More vowel details

Hebrew is normally unvocalised (ie. the vowel diacritics are hidden). In this case, vowels are written using a small number of matres lectionis (consonants marking vowel locations), and some vowels are not marked at all. Figure 1 shows ways of writing Hebrew vowels in unvocalised text. For each letter there are 3 lines, indicating initial, medial, and final forms. In 3 places no vowel is marked at all.

i אי- ע-
-י- --
-י -יא
u או- ו- עו-
-ו-
-ו
e א- ע-
--
-ה -א -י
o או- עו-
-ו-
-ו -ה
a א- ע-
-- -א-
-ה -ע -א
Vowels in unvocalised Hebrew text.

In vocalised text Hebrew uses a mixture of matres lectionis and combining diacritics to represent vowels. Figure 2 shows ways of writing Hebrew vowels in vocalised text.

i אִ- אִי- עִ-
-◌ִי- -◌ִ-
-◌ִי -◌ִיא
u אוּ- וּ- עוּ-
-וּ-
-וּ
e אֵ- אֶ- אֱ- עֵ-
-◌ֵ- -◌ֶ- -◌ְ-
-◌ֶה -◌ֶא -◌ֵא -◌ֵי
o אוֹ- אֳ- עוֹ-
-וֹ- -◌ֹ- -◌ֳ-
-וֹ -◌ֹה
a אַ- אֲ- אָ- עַ- עֲ- עָ-
-◌ַ- -◌ָ- -◌ֲ- -◌ָא-
-◌ָה -◌ַע -◌ַא
Vowels in vocalised Hebrew text.

Vowel basics

Matres lectionis

In the spelling of Arabic, Hebrew, and other Semitic languages, mater lectionis refers to a consonant that may also indicate the location of a vowel.w

The following consonant letters may indicate the location of a vowel in full spelling text. (They may alternatively indicate the location of a consonant.) In pointed text, when used as matres lectionis, they appear after the niqqud point that indicates the vowel sound.


4
א∅ a eʔ05D0
ה∅ a ehh05D4
וu ov05D5
יiyy05D9

Click on the characters in the list above for detailed information; here we will just summarise a few key points.

There is a trend in Modern Hebrew towards the use of matres lectionis to indicate vowels that have traditionally gone unwritten, a practice known as full spelling.15 For example, compare the pointed vs full spellings of the following:

אישה  ≡  אִשָּׁה iʃˈa wife, woman

אלוה  ≡  אֱלֹהַּ, אֱלוֹהַּ eˈlo.a a god

Since modern Israeli Hebrew doesn't make vowel length distinctive, the use of matres lectionis doesn't correspond to clear distinctions between vowel length or quality in the way they do in Arabic. They may, however, reflect historical differences that are still maintained in the spelling.

אU+05D0 LETTER ALEF is most commonly found as a glottal stop or vowel carrier for standalone vowels (see Standalone vowels), whereas הU+05D4 LETTER HE is most likely to be found in word-final position (especially after a word that ends with a).

Ayin

עU+05E2 LETTER AYIN often appears where a vowel would be pronounced, or as a vowel carrier in vocalised text. It is not a mater lectionis, and nominally represents a glottal stop, although the stop is often not pronounced.

Niqqud points

A series of points, known as niqqud, can be used to give precision about vowel sounds. They are rarely used outside of educational, children's, and religious texts, or for foreign or ambiguous words.

אָלֶף־בֵּית עִבְרִי
'Hebrew alphabet', alef-bet ivri, spelled out using diacritic points.
show composition

אָלֶף־בֵּית עִבְרִי alef-bet ivri Hebrew alphabet

The Modern Israeli Hebrew orthography we are discussing here uses the following niqqud to represent vowel sounds.


11
ִrareii05B4
ֻrareuu05BB
ֵraree ejē05B5
ֶrareee05B6
ֱrareeĕ05B1
ֹrareoo05B9
ֳrareoŏ05B3
ְraree ∅05B0
ָrarea oā05B8
ַrareaa05B7
ֲrareaă05B2

Redundancy arises because the modern orthography retains alternative points that in the past expressed length differences, whereas modern Israeli Hebrew pronunciation ignores phonetic length.

Three of the above code points have glyphs that combine the glyph for ְU+05B0 POINT SHEVA (sh'va) with that of another point. These combinations were used in older Hebrew to indicate particularly short vowel lengths, and the spelling has been retained in modern Hebrew. The Unicode Standard recommends that you use a single, precomposed Unicode code point for each of these combinations. These precomposed characters don't decompose during normalisation.


both
ׇrareo05C7
ֺrareo05BA

The two code points just above may be used, mostly in biblical texts or prayer books, to preserve fine typographic differences in usage. For more information, click on the characters.

Vowel length

Hebrew spelling points back to a time when there were different ways of writing long vowels, short vowels, and very short vowels. However, these vowel length differences have fallen away in modern Israeli Hebrew, and length, where it varies, is determined by phonetic context.

That said, echoes of historically long vowels can be found in the use of certain matres lectionis, particularly in pointed text.

Standalone vowels

Standalone vowels are vowel sounds that are not preceded by a consonant sound, or are preceded by only a glottal stop. They may appear at the beginning of a word or in the middle of a word after a preceding vowel.

In theory, Hebrew doesn't have any true standalone vowels, since vowels are always preceded by a consonant. However, when that consonant is a glottal stop, the stop is often dropped and in spoken Hebrew a vowel then occurs at the beginning of a word or word-medially after another vowel.

These vowels are always written with a base consonant, which is usually one of אU+05D0 LETTER ALEF or עU+05E2 LETTER AYIN. The alef is by far the most common when word initial.

אֵזוֹר eˈzoʁ region, zone

אוֹתְיוֹם otˈjom immediately

עֵץ et͡s tree

עוֹלָם oˈlam the world, Earth

בעיה be.aˈja problem

גַּבְרִיאֵל ɡavʁiˈɛl Gabriel

Vowel absence

In pointed text, ְU+05B0 POINT SHEVA may be used to express an absence of vowel between two consonants. However, in various contexts the sh'va may indicate that the consonant is followed by a short, epenthetic vowel e.

אגרוף  ≡  אֶגְרוֹף eɡˈʁof fist

בהמות  ≡  בְּהֵמוֹת beheˈmot behemoth

A word-final consonant typically doesn't take a diacritic. (See Finals.)

אַנְגְּלִית anɡˈlit English

בָּחוּר baˈχur guy, boy

Vowel sounds to characters

This section maps modern Israeli Hebrew vowel sounds to common graphemes in the Hebrew orthography, as found in the accompanying terms list.

Graphemes in full spelling are shown to the left, and pointed spelling is shown on the right. Note that sometimes these may differ more than just by addition of niqqud.

'Initial' means a vowel that follows an optional glottal stop, either word-initially or within a word.

Plain vowels

i
initial

איU+05D0 LETTER ALEF + U+05D9 LETTER YOD
אישה iʃˈa wife, woman
איראן iˈʁan Iran.

עU+05E2 LETTER AYIN
עברית ivˈʁit Hebrew

אִU+05D0 LETTER ALEF + U+05B4 POINT HIRIQ
אִשָּׁה (ʔ)iˈʃa woman, wife.

אִיU+05D0 LETTER ALEF + U+05B4 POINT HIRIQ + U+05D9 LETTER YOD
אִירָאן iˈʁan Iran.

עִU+05E2 LETTER AYIN + U+05B4 POINT HIRIQ
עִבְרִית ivˈʁit Hebrew

 
medial

יU+05D9 LETTER YOD
נין nin grandson

No marker.
מסעדה mis.ʔaˈda restaurant

ִיU+05B4 POINT HIRIQ + U+05D9 LETTER YOD
נִין nin grandson

ִU+05B4 POINT HIRIQ
מִסְעָדָה mis.ʔaˈda restaurant

 
final

יU+05D9 LETTER YOD
פרי pʁi fruit

יאU+05D9 LETTER YOD + U+05D0 LETTER ALEF
הביא heˈvi to bring

u
initial

אוU+05D0 LETTER ALEF + U+05D5 LETTER VAV
אולי uˈlaj maybe

וU+05D5 LETTER VAV
ובכן uvˈχɛn in that case

עוU+05E2 LETTER AYIN + U+05D5 LETTER VAV
עוגייה u.ɡiˈja cookie

 
medial

וU+05D5 LETTER VAV
אבולוציה (ʔ)evoˈlut͡sja evolution

וּU+05D5 LETTER VAV + U+05BC POINT DAGESH OR MAPIQ
אֶבוֹלוּצְיָה (ʔ)evoˈlut͡sja evolution

 
final

וU+05D5 LETTER VAV
הודו ˈho.du India

וּU+05D5 LETTER VAV + U+05BC POINT DAGESH OR MAPIQ
הֹדּוּ ˈho.du India

e
initial

אU+05D0 LETTER ALEF
אזור eˈzoʁ region, zone
אפשר efˈʃaʁ possible
אלוה eˈlo.a a god

עU+05E2 LETTER AYIN
עץ et͡s tree

אֵU+05D0 LETTER ALEF + U+05B5 POINT TSERE
אֵזוֹר eˈzoʁ region, zone

אֶU+05D0 LETTER ALEF + U+05B6 POINT SEGOL
אֶפְשַׁר efˈʃaʁ possible

אֱU+05D0 LETTER ALEF + U+05B1 POINT HATAF SEGOL
אֱלֹהַּ eˈlo.a a god

עֵU+05E2 LETTER AYIN + U+05B5 POINT TSERE
עֵץ et͡s tree

 
medial

No marker.
בן ben son
ברז ˈbeʁez tap, faucet
בהמות beheˈmot behemoth

ֵU+05B5 POINT TSERE
בֵּן ben son

ֶU+05B6 POINT SEGOL
בֶּרֶז ˈbeʁez tap, faucet

ְU+05B0 POINT SHEVA
בְּהֵמוֹת beheˈmot behemoth, but only in certain circumstances. For details of usage in modern Israeli, see Wikipedia.

 
final

הU+05D4 LETTER HE
אופה oˈfe baker

אU+05D0 LETTER ALEF
דשא ˈdeʃe grass
ייבא jiˈbe to import

יU+05D9 LETTER YOD
אחרי a.χaˈʁe afterword

ֶהU+05B6 POINT SEGOL + U+05D4 LETTER HE
אוֹפֶה oˈfe baker

ֶאU+05B6 POINT SEGOL + U+05D0 LETTER ALEF
דֶּשֶׁא ˈdeʃe grass

ֵאU+05B5 POINT TSERE + U+05D0 LETTER ALEF
יִבֵּא jiˈbe to import

ֵיU+05B5 POINT TSERE + U+05D9 LETTER YOD
אַחֲרֵי a.χaˈʁe afterword

o
initial

אוU+05D0 LETTER ALEF + U+05D5 LETTER VAV
אופה oˈfe baker
אורנים oʁaˈnim pines

עוU+05E2 LETTER AYIN + U+05D5 LETTER VAV
עוֹלָם oˈlam the world, Earth

אוֹU+05D0 LETTER ALEF + U+05D5 LETTER VAV + U+05B9 POINT HOLAM
אוֹפֶה oˈfe baker

אֳU+05D0 LETTER ALEF + U+05B3 POINT HATAF QAMATS
אֳרָנִים oʁaˈnim pines

עוֹU+05E2 LETTER AYIN + U+05D5 LETTER VAV + U+05B9 POINT HOLAM
עוֹלָם oˈlam the world, Earth

 
medial

וU+05D5 LETTER VAV
אותיום otˈjom immediately
אדום aˈdom red
ציפורים tsipoˈʁim birds

וֹU+05D5 LETTER VAV + U+05B9 POINT HOLAM
אוֹתְיוֹם otˈjom immediately

ֹU+05B9 POINT HOLAM
אָדֹם aˈdom red

ֳU+05B3 POINT HATAF QAMATS
צִפֳּרִים tsipoˈʁim birds

 
final

וU+05D5 LETTER VAV
יריחו jɛʁiˈχo Jericho

הU+05D4 LETTER HE
איפה eˈfo where?

וֹU+05D5 LETTER VAV + U+05B9 POINT HOLAM
יְרִיחוֹ jɛʁiˈχo Jericho

ֹהU+05B9 POINT HOLAM + U+05D4 LETTER HE
אֵיפֹה eˈfo where?

ə
 

Any of the five short vowels may be realized as a schwa when far from lexical stress.13

a
initial

אU+05D0 LETTER ALEF
אברהם av.ʁaˈham Abraham
אגם aˈɡam lake, pool
אמר aˈmaʁ to say

עU+05E2 LETTER AYIN
עקרב akˈʁav scorpion
עבודה avoˈda worship
עש moth

אַU+05D0 LETTER ALEF + U+05B7 POINT PATAH
אַבְרָהָם av.ʁaˈham Abraham

אֲU+05D0 LETTER ALEF + U+05B2 POINT HATAF PATAH
אֲגַם aˈɡam lake, pool

אָU+05D0 LETTER ALEF + U+05B8 POINT QAMATS
אָמַר aˈmaʁ to say

עַU+05E2 LETTER AYIN + U+05B7 POINT PATAH
עַקְרָב akˈʁav scorpion

עֲU+05E2 LETTER AYIN + U+05B2 POINT HATAF PATAH
עֲבוֹדָה avoˈda worship

עָU+05E2 LETTER AYIN + U+05B8 POINT QAMATS
עָשׁ moth

 
medial

No marker.
אחת aˈχat one (f)
אזהרה az.haˈra warning
אהבה a.haˈva love, affection

אU+05D0 LETTER ALEF
איראן iˈʁan Iran

ַU+05B7 POINT PATAH
אַחַת aˈχat one (f)

ָU+05B8 POINT QAMATS
אַזְהָרָה az.haˈra warning

ֲU+05B2 POINT HATAF PATAH
אַהֲבָה a.haˈva love, affection

ָאU+05B8 POINT QAMATS + U+05D0 LETTER ALEF
אִירָאן iˈʁan Iran

 
final

הU+05D4 LETTER HE
אדמה adaˈma earth, soil

עU+05E2 LETTER AYIN
ארבע aʁˈba four (f)

אU+05D0 LETTER ALEF
אבא ˈa.ba dad

ָהU+05B8 POINT QAMATS + U+05D4 LETTER HE
אֲדָמָה adaˈma earth, soil

ַעU+05B7 POINT PATAH + U+05E2 LETTER AYIN
אַרְבַּע aʁˈba four (f)

ַאU+05B7 POINT PATAH + U+05D0 LETTER ALEF
אַבָּא ˈa.ba dad

Diphthongs

Consonants

The Modern Israeli Hebrew alphabet uses 22 letters, plus 5 word-final letters that have their own code points. Additional sounds can be represented using dagesh, shin/sin dots, or geresh.

Basic consonants

These are the basic consonant letters used in modern Hebrew.


22
אʔ ∅ʔ05D0
עʔ ∅ ʔ̇05E2
   
טttt05D8
תtt05EA
דddd05D3
קkkk05E7
גɡgg05D2
   
צt͡stsʦ05E6
   
פf pff05E4
בv bvv05D1
וv wv05D5
שs ʃs sh05E9
סsss05E1
זzzz05D6
חχχ05D7
כx kkhx05DB
רʁrʁ05E8
הh ∅hh05D4
   
מmmm05DE
נnnn05E0
   
לlll05DC
יjyy05D9

Matres lectionis

Three of the letters can also represent vowel locations. See Matres lectionis.

Repertoire extensions

Methods used to modify the sound of a consonant. See also Cantillation & other marks.

Dagesh

ּU+05BC POINT DAGESH OR MAPIQ is used in pointed text with 3 consonant letters (and one final form) to indicate that they map to 'hard' sounds. This is similar to the distinction made in Syriac. Dagesh is the only diacritic to appear inside a consonant. Below, the hard sounds are shown to the left, and the normal to the right.


8
פּp05E4
05BC
בּb05D1
05BC
כּk05DB
05BC
ךּk05DA
05BC
פf05E4
בv05D1
כx05DB
ךx05DA

Dagesh can also be found alongside other letters, without any sound change, due to preservation of archaic spelling. The pairs tθ, dð and ɡɣ were lost over time, leaving:


3
תּttṫˑ05EA
05BC
דּdd05D3
05BC
גּɡg05D2
05BC

See also Consonant length.

Shin & sin dots


both
שׁʃshṡᶴ05E9
05C1
שׂssṡˢ05E9
05C2

שU+05E9 LETTER SHIN represents two phonemes: ʃ and s. If it is necessary to indicate which is intended, one of two diacritics (used only with this character) are used: ׂU+05C2 POINT SIN DOT and ׁU+05C1 POINT SHIN DOT. They look identical, but the side to which they are positioned makes the difference.

שקל  ≡  שֶׁקֶל ˈʃɛkɛl shekel

משוש  ≡  מָשׂוֹשׂ maˈsos rejoicing

Geresh

Certain consonants are extended to represent non-native sounds by use of a following ׳U+05F3 PUNCTUATION GERESH.

This first set is used in loanwords and slang that are part of the everyday Hebrew colloquial vocabulary.15


5
ג׳loand͡ʒǧ05D2
05F3
ז׳loanʒž05D6
05F3
צ׳loant͡ʃčʦ´05E6
05F3
ו׳ wwv̇´05D5
05F3
וו w v̇v̇05D5
05D5

The graphemes ו׳ and וו are alternative ways of writing the same thing.

A second set is only used to transliterate foreign sounds, especially Arabic.15


5
ד׳loanð 05D3
05F3
ת׳loanθ ṫ´05EA
05F3
ח׳loanχ 05D7
05F3
ר׳loanʁ ʁ´05E8
05F3
ע׳loanʁ ʔ̇´05E2
05F3

Onsets

Consonant clusters in syllable onsets are simply written using a sequence of consonant letters in full spelling text, and apply ְU+05B0 POINT SHEVA in pointed text.

ברכה  ≡  בְּרָכָה bʁaˈχa congratulation

גבול  ≡  גְּבוּל ɡvul boundary

Finals

Five letters have special word-final forms, called sofit. They are encoded as separate code points in Unicode, and appear as separate keys on a keyboard, so no special processing is needed to display or store them (unlike Arabic).


5
ץt͡stsʦ̽05E5
ףf pf05E3
ךx kkh05DA
םmm05DD
ןnn05DF

Foreign words and names may sometimes use the normal forms at the end of a word, rather than the sofit form. In those cases, use the non-final code points.

Consonants that are not followed by a vowel are normally written with ְU+05B0 POINT SHEVA in pointed text, but not when they are word-final.

בִּגְלַל biˈɡlal because

אֶגְזוֹז eɡˈzoz exhaust pipe

אָקוֹרְד aˈkoʁd chord (music)

Observation: One exception appears to be ךְU+05DA LETTER FINAL KAF + U+05B0 POINT SHEVA.

בֶּרֶךְ ˈbɛʁɛχ knee

רַךְ ʁaχ soft

Consonant clusters

As with onsets, consonant clusters are simply written using a sequence of consonant letters in full spelling text, and are marked by ְU+05B0 POINT SHEVA in pointed text. The offglide of a diphthong is written in the same way.

גמבה  ≡  גַּמְבָּה ˈɡamba bell pepper

בייגל  ≡  בֵּיְגֶּל ˈbej.ɡel bagel

Consonant length

From Wikipedia: gemination is not adhered to in modern Hebrew and is only used in careful pronunciation, such as reading of scriptures in a synagogue service, recitations of biblical or traditional texts or on ceremonious occasions, and then only by very precise readers.

In pre-Modern Hebrew, gemination was indicated using ּU+05BC POINT DAGESH OR MAPIQ.

Consonant sounds to characters

This section maps modern Israeli Hebrew consonant sounds to common graphemes in the Hebrew orthography.

Graphemes in full spelling are shown to the left, while pointed spelling and word-final forms are shown on the right.

p
 

פU+05E4 LETTER PE
פונדק punˈdak inn

פּU+05E4 LETTER PE + U+05BC POINT DAGESH OR MAPIQ
פּוּנְדָק punˈdak inn

t
 

טU+05D8 LETTER TET
קט kat tiny.

תU+05EA LETTER TAV
תות tut strawberry.

תּU+05EA LETTER TAV + U+05BC POINT DAGESH OR MAPIQ. Archaic spelling, still found sometimes in pointed text.
תּוּת tut strawberry.

d
 

דU+05D3 LETTER DALET
דוד dod uncle.

דּU+05D3 LETTER DALET + U+05BC POINT DAGESH OR MAPIQ. Archaic spelling, still found sometimes in pointed text.
דּוּד dod boiler.

k
 

קU+05E7 LETTER QOF
קול kol voice, sound

כU+05DB LETTER KAF
הכה hiˈka to hit

ɡ
 

גU+05D2 LETTER GIMEL
גג gag roof

גּU+05D2 LETTER GIMEL + U+05BC POINT DAGESH OR MAPIQ Archaic spelling, still sometimes used in pointed text.
גַּג gag roof

ʔ
 

עU+05E2 LETTER AYIN
מועיל moʔil useful

אU+05D0 LETTER ALEF
שאל ʃaʔal to ask

t͡s
 

צU+05E6 LETTER TSADI
ציץ t͡sit͡s bud

ץU+05E5 LETTER FINAL TSADI Word-final.
ציץ t͡sit͡s bud

t͡ʃ
 

צ׳U+05E6 LETTER TSADI + U+05F3 PUNCTUATION GERESH Used in loanwords and slang.
ריצ׳רץ׳ ʁit͡ʃʁat͡ʃ zip

ץ׳U+05E5 LETTER FINAL TSADI + U+05F3 PUNCTUATION GERESH Word-final.
ריצ׳רץ׳ ʁit͡ʃʁat͡ʃ zip

d͡ʒ
 

ג׳U+05D2 LETTER GIMEL + U+05F3 PUNCTUATION GERESH Used in loanwords and slang.
ג׳וק d͡ʒuk cockroach, bug

f
 

פU+05E4 LETTER PE
פיספס fisfɛs to miss

ףU+05E3 LETTER FINAL PE Word-final.
כנף kaˈnaf wing

v
 

בU+05D1 LETTER BET
טוב tov goodness, fairness

וU+05D5 LETTER VAV
וו vav hook

ווU+05D5 LETTER VAV + U+05D5 LETTER VAV when word-medial in full spelling.
צוואר t͡saˈvaʁ neck

θ
 

Foreign sound.

ת׳U+05EA LETTER TAV + U+05F3 PUNCTUATION GERESH
ת׳רסטון Thurston.

ð
 

Foreign sound.

ד׳U+05D3 LETTER DALET + U+05F3 PUNCTUATION GERESH
ד׳ו אל-חיג׳ה ðu al ħid͡ʒa (Dhu al-Hijjah).

s
 

סU+05E1 LETTER SAMEKH
סוף ˈsof end

שU+05E9 LETTER SHIN
משוש maˈsos rejoicing

שׂU+05E9 LETTER SHIN + U+05C2 POINT SIN DOT Explicit form, used in pointed text to distinguish from ʃ
שָׂם sam to put, turn on

z
 

זU+05D6 LETTER ZAYIN
זה ze this

ʃ
 

שU+05E9 LETTER SHIN
שקל ˈʃɛkɛl shekel

שׁU+05E9 LETTER SHIN + U+05C1 POINT SHIN DOT Explicit form, used in pointed text to distinguish from s.
שֶׁקֶל ˈʃɛkɛl shekel

ʒ
 

ז׳U+05D6 LETTER ZAYIN + U+05F3 PUNCTUATION GERESH Used in loan words & slang.
ז׳רגון ʒarɡon jargon

χ
 

כU+05DB LETTER KAF
סכך sχaχ branch-roofing

חU+05D7 LETTER HET
חם ˈχam hot

ח׳U+05D7 LETTER HET + U+05F3 PUNCTUATION GERESH Used to indicate that this sound should be used rather than h in non-Hebrew (esp. Arabic) text.
שייח׳ شيخ Sheikh

ךU+05DA LETTER FINAL KAF Word final.

ʁ
 

רU+05E8 LETTER RESH
עיר city

ר׳U+05E8 LETTER RESH + U+05F3 PUNCTUATION GERESH Explicitly indicates the sound for Arabic transliteration.
ר׳ג׳ר Ghajar

h
 

הU+05D4 LETTER HE
הד hɛd echo

m
 

מU+05DE LETTER MEM
מום mum defect

םU+05DD LETTER FINAL MEM Word-final.
מום mum defect

n
 

נU+05E0 LETTER NUN
נין nin grandson

ןU+05DF LETTER FINAL NUN Word-final.
נין nin grandson

ŋɡ
 

נגU+05E0 LETTER NUN + U+05D2 LETTER GIMEL
אנגלית anɡˈlit English

w
 

ווU+05D5 LETTER VAV + U+05D5 LETTER VAV Non-standard orthography.
אוטווה otawa Ottawa

ו׳U+05D5 LETTER VAV + U+05F3 PUNCTUATION GERESH Non-standard orthography (not common).
ו׳יליאם ˈwiljam William

וU+05D5 LETTER VAV In pointed text.
אוֹטָוָה otawa Ottawa

r
 

See ʁ.

l
 

לU+05DC LETTER LAMED
לשון laˈʃon language

j
 

יU+05D9 LETTER YOD
ים ˈjam sea

ייU+05D9 LETTER YOD + U+05D9 LETTER YOD when word-medial in full spelling.
איים iˈjem to threaten

Other features

Cantillation & other marks

In Biblical and older Hebrew texts, many additional diacritics are attached to the base character alongside the niqqud. Nearly all of the following additional marks in the Hebrew Unicode block are cantillation marks, used to indicate how to chant ritual readings from the Hebrew Bible in synagogue services.


35
֑unused0591
֒unused0592
֓unused0593
֔unused0594
֕unused0595
֖unused0596
֗unused0597
֘unused0598
֙unused0599
֚unused059A
֛unused059B
֜unused059C
֝unused059D
֞unused059E
֟unused059F
֠unused05A0
֡unused05A1
֢unused05A2
֣unused05A3
֤unused05A4
֥unused05A5
֦unused05A6
֧unused05A7
֨unused05A8
֩unused05A9
֪unused05AA
֫unused05AB
֬unused05AC
֭unused05AD
֮unused05AE
֯unused05AF
ֽunused05BD
ֿunused05BF
ׄunused05C4
ׅunused05C5

Formatting characters

Hebrew text makes use of a set of invisible formatting characters, especially in plain text, most of which are used to manage text direction. Descriptions of these characters can be found in the following section:

Encoding choices

This section is still in development.

Contents should include:

Numbers

This section describes typographic features related to digits, dates, currencies, etc.

Hebrew uses the same digits as the Latin script.

For about a thousand years from the 2nd century BC, Hebrew used letters as numbers. Nowadays, they are only used this way for the Hebrew calendar, for school grades, for counter styles, and in religious contexts. See Additive in the section about counter styles for more information.

Currency


¤20AA

The denomination is generally expressed by the following abbreviation10, which stands for שקל חדש ˈʃɛkɛl χaˈdaʃ new shekel:

ש״ח ʃaχ new shekel

U+20AA NEW SHEQEL SIGN may also be used. It is displayed to the left of the amount, with no separation or with a thin space, eg.

₪12,000

Observation: Wikipedia says that this requires the sheqel sign to be typed after the amount, however, the opposite is the case for all major browsers.10

Text direction

Hebrew script is written right-to-left in the main, but as with most RTL scripts, numbers and embedded LTR script text are written left-to-right (producing bidirectional text). In the following example, the Hebrew words are read right-to-left, starting with the one on the right, and the numeric expression ("10-12") is read left-to-right, ie. it starts with 10 and ends with 12. (Note that this is unlike Arabic, where the 10 and 12 would be in opposite positions.)

התאריכים 10-12 במרץ

Bidirectional Hebrew text.

Characters in Unicode text are typed and stored in 'logical' order, ie. the order in which they are pronounced. The Unicode Bidirectional Algorithm then steps in when text is printed or displayed to rearrange the text as needed. It automatically takes care of the ordering for all the text in Figure 4, as long as the 'base direction' is set to RTL. In HTML this can be set using the dir attribute (which also sets the alignment of the text), or in plain text using formatting controls.

If the base direction is not set appropriately, the directional runs will be ordered incorrectly as shown in Figure 5, and can become unreadable.

ב־HMTL5 זה מתבצע על ידי הוספת אלמנט ה־inline bdo.

ב־HMTL5 זה מתבצע על ידי הוספת אלמנט ה־inline bdo.

The exact same sequence of characters with the base direction set to RTL (top), and with no base direction set on this LTR page (bottom).

Show default bidi_class properties for characters in the Hebrew orthography described here.

For more information about how directionality and base direction work, see Unicode Bidirectional Algorithm basics. For information about plain text formatting characters see How to use Unicode controls for bidi text. And for working with markup in HTML, see Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts.

On this page, see also Expressions & sequences and Line breaking & hyphenation for additional features related to direction.

Managing text direction

Unicode provides a set of 10 formatting characters that can be used to control the direction of text when displayed. These characters have no visual form in the rendered text, however text editing applications may have a way to show their location.

‫U+202B RIGHT-TO-LEFT EMBEDDING (RLE), ‪U+202A LEFT-TO-RIGHT EMBEDDING (LRE), and ‬U+202C POP DIRECTIONAL FORMATTING (PDF) are in widespread use to set the base direction for an inline range of characters. RLE/LRE come at the start, and PDF at the end of a range of characters for which the base direction is to be set.

More recently, the Unicode Standard added a set of characters which do the same thing but also isolate the content from surrounding characters, in order to avoid spillover effects. They are ⁧U+2067 RIGHT-TO-LEFT ISOLATE (RLI), ⁦U+2066 LEFT-TO-RIGHT ISOLATE (LRI), and ⁩U+2069 POP DIRECTIONAL ISOLATE (PDI). The Unicode Standard recommends that these be used instead.

There is also ⁨U+2068 FIRST STRONG ISOLATE (FSI), used initially to set the base direction according to the first recognised strongly-directional character.

‏U+200F RIGHT-TO-LEFT MARK (RLM) and ‎U+200E LEFT-TO-RIGHT MARK (LRM) are invisible characters with strong directional properties that are also sometimes used to produce the correct ordering of text.

For more information about how to use these formatting characters see How to use Unicode controls for bidi text. Note, however, that when writing HTML you should generally use markup rather than these control codes. For information about that, see Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts.

Expressions & sequences

A sequence of numbers, for example a range separated by hyphens, generally runs left to right in Hebrew (unlike Arabic).

Glyph shaping & positioning

This section describes typographic features related to font/writing styles, cursive text, context-based shaping, context-based positioning, letterform slopes, weights & italics, and case & other character transforms.

You can experiment with examples using the Hebrew character app.

Font styles

Hebrew has a number of different writing styles.

The standard, 'square script' is derived from Aramaic. There are serif and sans-serif fonts.

Serif (top) and sans (bottom) examples of the standard writing style.

The STAM style is used for sacred texts such as the Torah. Certain letters have decorative tags above.4

Text written in the STAM writing style.

The rashi style is used for commentaries on sacred texts. Letters have a more rounded, almost cursive style.4

Text written in the rashi writing style.

Hebrew also has a 'cursive' style, which means 'handwriting' style. Letters are not normally joined. Cursive fonts are only used as display fonts. Many glyphs look very different from the standard letter forms.

Text in Yoav Cursive font.
Text written in the 'cursive' writing style.

Before the Babylonian exile (from which the square script derives), Hebrew was written with different shapes, which are similar to those used for Samaritan.

Context-based shaping & positioning

Are special glyph forms needed, depending on the context in which a character is used? Do glyphs interact in some circumstances? Are there requirements to position diacritics or other items specially, depending on context? Does the script have multiple diacritics competing for the same location relative to the base?

In Hebrew several characters have a different shape at the end of a word, but each shape variant has it's own code point and keyboard key, so there is no need for rendering rules to choose the correct glyph.

This example shows מU+05DE LETTER MEM and םU+05DD LETTER FINAL MEM (on the left).

מקום
Two different shapes for mem, depending on position in the word.
show composition

מקום  ≡  מָקוֹם maˈkom station

Multiple diacritics for one base character are common where the various types of diacritic are mixed.

תֹ֙הוּ֙ וָבֹ֔הוּ וְחֹ֖שֶׁךְ עַל־פְּנֵ֣י
Text using a mixture of vowel (light highlight), consonant, and cantillation (dark highlight) diacritic points.

Combinations of ְU+05B0 POINT SHEVA with other vowel diacritics are represented by single, non-decomposable code points, eg. ֱU+05B1 POINT HATAF SEGOL.

In NFC normalised text, a dagesh or shin/sin dot always follows the vowel diacritic. It may be necessary to reorder the diacritics for some applications, eg. for transcriptions that map a consonant+dagesh to a single letter.

The diacritic ֹU+05B9 POINT HOLAM illustrates how positioning can be context-sensitive. Figure 12 shows 3 examples.

תֹּאַר תֹּאר שֹׂבַע
Three slightly different placements of ֹU+05B9 POINT HOLAM, depending on the surrounding context.

Letterform slopes, weights, & italics

Are italicisation, bolding, oblique, etc relevant? Do italic fonts lean in the right direction? Is synthesised italicisation problematic? Are there other problems relating to bolding or italicisation - perhaps relating to generalised assumptions of applicability?

Bold text is used as one way to highlight or emphasise text. The degree of bolding is often quite light. Bold-italic is typically only used for large display text.3

Italics may also be used, however its use is not abundant, and many of the italic faces in fonts are designed for display use, rather than to accompany a regular font.3

There are different preferences for the direction of the slant for italicised Hebrew text. The choice as to which is preferred appears to be down to the individual, and is a question of whether the slant matches the direction of the Hebrew text, or embedded Latin text.3

עברית
Example of forward-leaning italics (bottom).
show composition

עברית  ≡  עִבְרִית ivˈʁit Hebrew

Typographic units

Word boundaries

Are words separated by spaces, or other characters? Are there special requirements when double-clicking on the text? Are words hyphenated?

The concept of 'word' is difficult to define in any language (see What is a word?). Here, a word is a vaguely-defined, but recognisable semantic unit that is typically smaller than a phrase and may comprise one or more syllables.

Words are separated by spaces.

Hyphens.־U+05BE PUNCTUATION MAQAF is the proper punctuation for representing hyphens between compounds,7.

תל־אביב tel (ʔ)a.ˈviv Tel Aviv

However, it is less common online because it is not always easily available on keyboards. Therefore, - U+002D HYPHEN-MINUS is often substituted, even though the position of that character is too low when displayed.7

The Unicode Standard indicates that lines should not break on either side of the maqaf.g

Graphemes

Grapheme clusters

A grapheme is a user-perceived unit of text. Text operations that use graphemes as a unit of text include line-breaking, forwards deletion, cursor movement & selection, character counts, text spacing, text insertion, justification, case conversions, and sorting. The Unicode Standard uses generalised rules to define 'grapheme clusters', which approximate the likely grapheme boundaries in a writing system, however they don't work well with many complex scripts.

Hebrew typographic units consist of base characters, optionally followed by one or more combining marks. Unicode grapheme clusters can be applied to Hebrew without problems. There are no special issues related to operations that use grapheme clusters as their basic unit of text.

Punctuation & inline features

This section describes typographic features related to word boundaries, phrase & section boundaries, bracketed text, quotations & citations, emphasis, abbreviation, ellipsis & repetition, inline notes & annotations, other punctuation, and other inline text decoration.

Phrase & section boundaries

What characters are used to indicate the boundaries of phrases, sentences, and sections?


6
,002C
;003B
:003A
.002E
?003F
!0021

Hebrew uses ASCII punctuation for the most part. Full stops, question marks, exclamation marks, and commas are used as in English. There are 6 additional punctuation characters in the Hebrew Unicode block.

phrase

,U+002C COMMA

;U+003B SEMICOLON

:U+003A COLON

sentence

.U+002E FULL STOP

?U+003F QUESTION MARK

!U+0021 EXCLAMATION MARK

Note that the direction of the question mark (?) is the same as in English, and not reversed like for Arabic. The same is true for the comma ( , ).

Biblical & liturgical usage. ׀ is used as a word separator.7

Prayer books and similar use ׃ as a full stop.7

Bracketed text


both
(0028
)0029

Hebrew commonly uses ASCII parentheses to insert parenthetical information into text.

  start end
standard

)U+0029 RIGHT PARENTHESIS

(U+0028 LEFT PARENTHESIS

Hebrew uses the same parentheses as English, and uses (U+0028 LEFT PARENTHESIS] at the start (right) and )U+0029 RIGHT PARENTHESIS] at the end (left).7 These are mirrored characters in Unicode, so the glyph for each character is automatically reversed in RTL text.

For example, click on the following to see the component characters.

(סוגריים)

The first character in memory is the paren on the right. The consequence of this is that, when writing Hebrew, the parentheses should be used as if they were named U+0028 START PARENTHESIS and U+0028 END PARENTHESIS, respectively.

Quotations & citations

What characters are used to indicate quotations? Do quotations within quotations use different characters? What characters are used to indicate dialogue? Are the same mechanisms used to cite words, or for scare quotes, etc? What about citing book or article names?


3
201D
201D
2019

Hebrew texts use quotation marks around quotations. Note, however, that these are not paired. Of course, due to keyboard design, quotations may also be surrounded by ASCII double and single quote marks.

  start end
initial

U+201D RIGHT DOUBLE QUOTATION MARK

U+201D RIGHT DOUBLE QUOTATION MARK
nested

U+2019 RIGHT SINGLE QUOTATION MARK

U+2019 RIGHT SINGLE QUOTATION MARK

In principle, for modern quotations, U+201D RIGHT DOUBLE QUOTATION MARK is used at the start and at the end.

”ישראל” Israel

Nested quotations use different quote marks, which would typically be U+2019 RIGHT SINGLE QUOTATION MARK.1

However, in practice, Hebrew texts often use " U+0022 QUOTATION MARK and ' U+0027 APOSTROPHE.

Up to around 1970 Hebrew used U+201E DOUBLE LOW-9 QUOTATION MARK instead for the initial quotation mark, ie. „ישראל”but this changed, partly due to inadequate keyboard designs.10

Emphasis

How are emphasis and highlighting achieved? If lines are drawn alongside, over or through the text, do they need to be a special distance from the text itself? Is it important to skip characters when underlining, etc? How do things change for vertically set text?

Increased tracking is a common way to express emphasis in Hebrew.

The last part of this text is stretched to show emphasis.

Aternatives include the use of a different typeface, and/or underlining.3

Abbreviation, ellipsis & repetition

What characters are used to indicate abbreviation, ellipsis & repetition?


both
״05F4
׳05F3

Acronyms and abbreviations are indicated by placing ״U+05F4 PUNCTUATION GERSHAYIM before the last character.

אונסכ״ו uˈnes.ko UNESCO

סופ״ש soˈfaʃ weekend

׳U+05F3 PUNCTUATION GERESH may also be used to indicate an abbreviation,8 eg. גברת gvéret madam, lady is abbreviated as גב׳ gvéret Mrs.

Due to keyboard inadequacies, these are often replaced by ASCII single and double quote characters, even though in general they are visually too high. 

Line & paragraph layout

This section describes typographic features related to line breaking & hyphenation, text alignment & justification, text spacing, baselines, line height, counters, lists, and styling initials.

Line breaking & hyphenation

Are there special rules about the way text wraps when it hits the end of a line? Does line-breaking wrap whole 'words' at a time, or characters, or something else (such as syllables in Tibetan and Javanese)? What characters should not appear at the end or start of a line, and what should be done to prevent that? Is hyphenation used, or something else? What rules are used? What difficulties exist?

Lines are normally broken at word boundaries.

Breaking between Latin words

When a line break occurs in the middle of an embedded left-to-right sequence, the items in that sequence need to be rearranged visually so that it isn't necessary to read lines from top to bottom.

Figure 15 shows how two Latin words are apparently reordered in the flow of text to accommodate this rule. Of course, the rearragement is only that of the visual glyphs: nothing affects the order of the characters in memory.

Text with no line break in Latin text.

Text with line break in Latin text.
The lower of these two images shows the result of decreasing the line width, so that text wraps between a sequence of Latin words.

Line-edge rules

As in almost all writing systems, certain punctuation characters should not appear at the end or the start of a line. The Unicode line-break properties help applications decide whether a character should appear at the start or end of a line.

Show (default) line-breaking properties for characters in the modern Hebrew orthography described here.

The following list gives examples of typical behaviours for some of the characters used in modern Hebrew. Context may affect the behaviour of some of these and other characters.

Click/tap on the characters to show what they are.

  • ’ ” (   should not be the last character on a line.
  • ’ ” ) . , ; ! ? ־ %   should not begin a new line.
  •   should be kept with any number, even if separated by a space or parenthesis.

Text spacing

This section looks at ways in which spacing is applied between characters over and above that which is introduced during justification. For example, does the orthography create emphasis or other effects by spacing out the words, letters or syllables in a word? (For justification related spacing, see Text alignment & justification, above).

Increased tracking is a common way to express emphasis in Hebrew.

Examples of letter-spacing (highlighted by the red lines) in Hebrew text.

Baselines, line height, etc.

Does the script have special requirements for baseline alignment between mixed scripts and in general? Is line height special for this script? Are there other aspects that affect line spacing, or positioning of items vertically within a line?

Hebrew uses the so-called 'alphabetic' baseline, which is the same as for Latin and many other scripts.

The Hebrew characters are commonly slightly taller than the Latin x-height. Figure 17 shows ascenders and descenders for Hebrew letters in the Noto Serif fonts. In this font combination the maximum height of the Hebrew letters reaches slightly higher than the Latin extenders.

qhxאבלך
Font metrics for text in the Noto Serif and Noto Serif Hebrew fonts.

Counters, lists, etc.

Are there list or other counter styles in use? If so, what is the format used? Do counters need to be upright in vertical text? Are there other aspects related to counters and lists that need to be addressed?

You can experiment with counter styles using the Counter styles converter. Patterns for using these styles in CSS can be found in Ready-made Counter Styles, and we use the names of those patterns here to refer to the various styles.

The Hebrew orthography uses an additive style, in addition to numeric decimal style based on ASCII digits.

Additive

The hebrew additive style uses the letters shown below. It is specified for a range between 1 and 10,999. This system manually specifies the values for 19-15 to force the correct display of 15 and 16, which are commonly rewritten to avoid a close resemblance to the Tetragrammaton. Implementations may, and some do, implement this manually to a higher range.


37
י׳ 1000005D9
05F3
ט׳ 900005D8
05F3
ח׳loan800005D7
05F3
ז׳loan700005D6
05F3
ו׳ 600005D5
05F3
ה׳ 500005D4
05F3
ד׳loan400005D3
05F3
ג׳loan300005D2
05F3
ב׳ 200005D1
05F3
א׳ 100005D0
05F3
ת 40005EA
ש 30005E9
ר 20005E8
ק 10005E7
צ 9005E6
פ 8005E4
ע 7005E2
ס 6005E1
נ 5005E0
מ 4005DE
ל 3005DC
כ 2005DB
יט 1905D9
05D8
יח 1805D9
05D7
יז 1705D9
05D6
טז 1605D8
05D6
טו 1505D8
05D5
י 1005D9
ט 905D8
ח 805D7
ז 705D6
ו 605D5
ה 505D4
ד 405D3
ג 305D2
ב 205D1
א 105D0

Examples:


12
א105D0
ב205D1
ג305D2
ד405D3
יא1105D9
05D0
כב2205DB
05D1
לג3305DC
05D2
מד4405DE
05D3
קיא11105E7
05D9
05D0
רכב22205E8
05DB
05D1
שלג33305E9
05DC
05D2
תמד44405EA
05DE
05D3

Prefixes and suffixes

The default list style uses a full stop + space as a suffix.

Examples:

א. ב. ג. ד. ה.
Separator for Hebrew list counters.

Styling initials

Does the script use special styling of the initial letter of a line or paragraph, such as for drop caps or similar? How about the size relationship between the large letter and the lines alongide? where does the large letter anchor relative to the lines alongside? is it normal to include initial quote marks in the large letter? is the large letter really a syllable? Are dropped, sunken, and raised types found? etc.

It is possible to find the first letter in a paragraph styled so that it is larger and sits alongside several lines of the continuing paragraph text.

An enlarged initial letter in the word לפגי at the beginning of a paragraph.

Observation: The glyph in Figure 19 rises above the normal top line of most Hebrew characters. It also rises above the top line of the adjacent glyphs when positioned alongside them. The bottom of the glyph is aligned with the bottom of the glyphs on the 3rd line down.

Boxed initials can also be found, such as the one in Figure 20. Here, the initial letter is centred horizontally and vertically inside the space created by the box. The box extends from the top line of the first line of text to the baseline of the 6th line.

An enlarged initial letter in the word גבול at the beginning of a paragraph, set in a box.

Page & book layout

This section describes typographic features related to general page layout & progression; grids & tables, notes, footnotes, etc, forms & user interaction, and page numbering, running headers, etc.

General page layout & progression

How are the main text area and ancilliary areas positioned and defined? Are there any special requirements here, such as dimensions in characters for the Japanese kihon hanmen? The book cover for scripts that are read right-to-left scripts is on the right of the spine, rather than the left. When content can flow vertically and to the left or right, how to specify the location of objects, text, etc. relative to the flow? Do tables and grid layouts work as expected? How do columns work in vertical text? Can you mix block of vertical and horizontal text? Does text scroll in a different direction?

Hebrew books, magazines, etc., are bound on the right-hand side, and pages progress from right to left.

عنوان كتاب

Binding configuration for Hebrew books, magazines, etc.

Columns are vertical but run right-to-left across the page.

Acknowledgements

Many thanks for detailed review and comments by Ben Denckla.

References & sources

1Hebrew Academy, האקרמיה ללשון העברת

2Peter T. Daniels and William Bright, The World's Writing Systems, Oxford University Press, 487-497, ISBN 0-19-507993-0

3Liron Lavi Turkenich, Go bolder, just slant it—secondary styles in Hebrew (video), in TYPO International Design Talks

4ScriptSource, Hebrew

5Unicode Consortium, The Unicode Standard, Version 13.0, Chapter 9.1: Middle East-I, Hebrew, 359-364, ISBN 978-1-936213-16-0.

6Unicode Consortium, Unicode Line Breaking Algorithm (UAX#14)

7Wikipedia, Cantillation

8Wikipedia, Geresh

9Wikipedia, Holam

10Wikipedia, Hebrew punctuation

11Wikipedia, Help:IPA/Hebrew

12Wikipedia, Kamatz

13Wikipedia, Modern Hebrew Phonology

14Wikipedia, Schwa

15Wikipedia, Hebrew alphabet

16Wikipedia, Waw (letter)

See recent changes.  •  Make a comment.  •  Licence CC-By © r12a.