Latin character notes

Updated 11 June, 2018 • tags bengali, scriptnotes

This page lists characters in the following Unicode block and provides information about them.

This is not authoritative, peer-reviewed information – these are just notes I have gathered and copied from various places.

We have usage data for 243 languages that use the Latin script. Because of the size of the Latin repertoire, this page only shows characters for which notes exist.

Find

If you click on any red example text, you will see at the bottom right of the page a list of the characters that make up the example.

Read more about how to use this page.

To find a character by codepoint, type #char0000 at the end of the URL in the address bar, where 0000 is a four-figure, hex codepoint number, all in uppercase. Or type the character or the hex number in the Find control above.

To view this page as intended, you need a good font. Click the blue vertical bar at the bottom right of the page to apply other fonts, if you have them on your system. I recommend the excellent and free Doulos SIL font. The large character in the box will not be rendered unless the webfont downloaded with the page or a system font has a glyph for it. (If there is no glyph and you want to see what it looks like, click on See in UniView.)

Information about languages that use these characters is taken from the list maintained for the Character Use app. The list is not exhaustive.

References are indicated by superscript characters. Wherever possible, those contain direct links to the source material. When such a pointer is alongside an arrow → it means that it's worth following the link for the additional information it provides. Digits refer to the main sources, which are listed at the bottom of a set of notes.

When you are using UniView and you turn on Show notes, UniView will pull in information about characters from this page.

Punctuation

'

U+0027 APOSTROPHE

Description in the Unicode standard:

= apostrophe-quote (1.0)
= APL quote
• neutral (vertical) glyph with mixed usage
• 2019 is preferred for apostrophe
• preferred characters in English for paired quotation marks are 2018 & 2019
• 05F3 is preferred for geresh when writing Hebrew
→ (modifier letter prime - 02B9)
→ (modifier letter apostrophe - 02BC)
→ (modifier letter vertical line - 02C8)
→ (combining acute accent - 0301)
→ (hebrew punctuation geresh - 05F3)
→ (prime - 2032)
→ (latin small letter saltillo - A78C)

Lisu nasalisation mark.

Placed after a vowel to make it nasalised, eg. ꓳ'33 ʔõ goose.

-

U+002D HYPHEN-MINUS

Description in the Unicode standard:

= hyphen or minus sign
• used for either hyphen or minus sign
→ (hyphen - 2010)
→ (non-breaking hyphen - 2011)
→ (figure dash - 2012)
→ (en dash - 2013)
→ (hyphen bullet - 2043)
→ (minus sign - 2212)
→ (roman uncia sign - 10191)

Lisu punctuation

Used to bind together syllables in a name, eg. ꓡꓲ-ꓢꓴ lisu (Lisu).

_

U+005F LOW LINE

Description in the Unicode standard:

= spacing underscore (1.0)
• this is a spacing character
→ (modifier letter low macron - 02CD)
→ (combining macron below - 0331)
→ (combining low line - 0332)
→ (double low line - 2017)

Lisu vowel, a glide.

Pronounced ɑ and usually bearing a 31 pitch. Written after a verbal form to mark various aspects, eg. ꓠꓴ ꓙꓰꓻ_ ꓥꓳꓻ nu33ʤe33ɑ44ŋo33 (you will go) and ꓖꓳꓻ ꓡꓱꓻ ꓥ_ ꓟꓲꓻ go3333ŋɑ44ɑ31mi33 (but).

¡

U+00A1 INVERTED EXCLAMATION MARK

Description in the Unicode standard:

• Spanish, Asturian, Galician
→ (exclamation mark - 0021)

Spanish, Asturian, Galician punctuation

Used to signal the start of a sentence or phrase that is an exclamation. The end of the exclamation is signalled by ASCII U+0021 EXCLAMATION MARK !, eg. "No sabe nadar, ¡pero sí que sabe correr!" He doesn't know how to swim, but he certainly knows how to run!

Refs: [1] [elreq] §2.3.1

Amharic punctuation, timirte slaq ትእምርተ፡ሥላቅ

Used at the end of a sentence to indicate sarcasm.

Refs: [1] [elreq] §2.3.1

«

U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK

Description in the Unicode standard:

= left guillemet
= chevrons (in typography)
• usually opening, sometimes closing
→ (much less-than - 226A)
→ (left double angle bracket - 300A)

Used for quotations in Armenian, etc.

»

U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK

Description in the Unicode standard:

= right guillemet
• usually closing, sometimes opening
→ (much greater-than - 226B)
→ (right double angle bracket - 300B)

Used for quotations in Armenian, etc.

Reference expansions

  1. [Daniels] Peter T. Daniels and William Bright, The World's Writing Systems, Oxford University Press, ISBN 0-19-507993-0
  2. [elreq] Daniel Yacob, Ethiopic Layout Requirements
First published 3 Feb 2014. This version 2018-06-11 8:20 GMT.  •  Copyright r12a@w3.org. Licence CC-By.