This Unicode character picker allows you to produce or analyse runs of Tibetan text using the Tibetan script. Character pickers are especially useful for people who don't know a script well, as characters are displayed in ways that aid identification.
To properly display the text you will need to choose a font that is loaded on your system or device. The font list indicates which fonts are standard for Mac (Snow Leopard/Lion) and Windows7. Note that the fonts aren't guarranteed to work on every system/device, because the font often relies on rendering algorithms provided by the operating system. See more information about standard OS fonts in Mac and Windows.
You can also add codepoints and escapes via the "Add codepoint" field (hit return to add to the output field). You can also paste text into the output field to get information about it. Use the yellow boxes to set preferences or search. Regular expressions are allowed when searching – for example, to find characters with the word KA in their name, enter \bka\b, or the short form :ka:.
When working on an iPad or similar device, you should turn off more controls/Autofocus. This prevents the keyboard popping up after you input every character. You may also need to select a character twice to add it to the output field.
About the chart
Includes characters in the Unicode Tibetan block as of Unicode 7.0.
All Tibetan text is output in Unicode normalisation form NFC by default. You can change to NFD or no normalisation by clicking on 'more controls' and using the switch in the yellow area. Note that normalization only takes place when you add a character by clicking on the table – text pasted into the box won't be normalised until you click on another character below, or change the control in the yellow area.
The following alternative views are available by clicking on the vertical grey bar to the left of the selection area.
Default Clicking on this turns off the other features described in this section. The default table is likely to be most useful to people who are somewhat familiar with the alphabet and characters of Tibetan.
Characters have been arranged so that is easy to input them, and especially to ensure that multiple combining characters are input in the right normalised order. Consonants used in native Tibetan words appear to the left, with native Tibetan vowels below. To the right of each consonant is its subjoined version. Progressing to the right, we have syllable and other separators, then additional consonants and vowels used in transliterations of languages such as Sanskrit, Chinese and English. The far right hand block contains numerals and some symbols, and below all run some head marks and paired punctuation.
Open the expanding link for more cantillation signs, symbols, astrological signs, religious signs, vocalic modification and transliteration head letters, and other marks and signs. Another expanding link displays half numbers and precomposed characters that are decomposed in NFC. Decomposed versions of these characters are available from the main table.
Another expanding link below the main chart area reveals characters that are normalised under NFC to more than one character. (Note that you need to turn normalisation off for this to output these characters as single code points.) The decomposed form of these graphemes appear in the main table, and it is best to use those. You will also find the half numbers in the this section.
Hints This changes the behaviour of the table view so that, when you mouse over a character, characters that are similar in appearance, and may be easily confused, are automatically highlighted. This can be particularly useful for people who are not familiar with the script, to avoid confusing similar characters, or to find the right character when two or more look similar.
Transcriptions There are two transcription panels available in this picker: Latin, and Wylie. The latter panel allows you to follow a transcription to produce some Tibetan text. Where there are multiple possible choices, these choices are presented in a small pop-up box; click on the choice you want to insert it into the output area.
The Latin characters panel provides additional characters you may need while typing in a Latin transcription from the keyboard.
The Wylie transcription panel allows you to generate some Tibetan text from a THL Extended Wylie Transliteration Schema transcription. It currently offers only a basic set of characters.
Tools above the input box
Controls above the input box allow you to run various operations on the text in the box. Most of them work on what you have selected within the box, or the whole box if nothing is selected.
Copy, select, delete. The icons on the left above the input box allow you to copy the text to the clipboard, select the text or delete the text, respectively.
Show codepoints. This produces a list of the Unicode code points in the input box. You can usually follow a link from a code point item to more detailed information about that character.
Convert to escapes. This opens a new window for the converter app, which shows various different ways of representing the text in the input box using escapes.
Old English to IPA. This produces a rough phonetic transcription of the highlighted text in the box. It's useful for generating reminders about how to pronounce the text. The transcription appears just below the input box, where you can copy it, move it into the input box at the caret, or delete it.
Transcribe to Wylie. Transcribes the contents of the box below from tibetan to latin script using the extended version of Wylie's transcription method as described by The Tibetan & Himalayan Library. By default, the whole content of the box is transcribed, but if you have selected a range of text only that range will be transcribed. The result appears in a separate area below the box.
Where ambiguity exists, and there is more than one way to transcribe a character, the displayed text will show alternatives. Click on the alternative that you prefer, and the others will disappear.
Note: I implemented most of the rules, but I used diacritics over lowercase letters rather than uppercase letters, except for the fixed form characters. I also didn't provide conversions for many of the symbols – they will appear without change in the transcription. There may be some bugs for the less common aspects of the transcription, but the are not likely to occur for native tibetan words.
Make example. This may be useful to speed up the creation of examples. You can create an example with up to four parts, delimited by /, in the following order:  Tibetan text,  IPA transcription,  other transcription,  meaning. You don't need to add all four elements, but if you want to skip one in the middle of the sequence, use //.
Character markup. This may be useful to speed up the creation of markup for a specific character or set of characters. Select one or more characters in the output area, then click this button. It will return something like the following for each of the characters:
<span class="uname">U+0F40 TIBETAN LETTER KA</span> (<span lang="bo">ཀ</span>)
Notes on other controls
Controls at the bottom of the page allow you to modify fonts used, the font size, and the height of the output box.
Searching by character name or codepoint. If you are searching for a particular character and know (at least part of) the name or the codepoint, type that in the search box and hit return. All characters with matching text in the name or codepoint number will be highlighted. The highlighting is only removed when you click on the X next to the search input field. You can also use regular expression syntax to improve your search results. For example, to find the letter 'ha', but not 'gha' etc, you can use '\bha\b' (or the shortcut, ':b:').