This Unicode character picker allows you to produce or analyse runs of Urdu text using the Arabic script. Character pickers are especially useful for people who don't know a script well, as characters are displayed in ways that aid identification.
To properly display the text you will need to choose a font that is loaded on your system or device. The font list indicates which fonts are standard for Mac (Snow Leopard/Lion) and Windows7. See more information about standard OS fonts in Mac and Windows.
You can also add codepoints and escapes via the "Add codepoint" field (hit return to add to the output field). You can also paste text into the output field to get information about it. Use the yellow boxes to set preferences or search. Regular expressions are allowed when searching – for example, to find characters with the word KA in their name, enter \bka\b, or the short form :ka:.
When working on an iPad or similar device, you should turn off more controls/Autofocus. This prevents the keyboard popping up after you input every character. You may also need to select a character twice to add it to the output field.
About the chart
Includes characters in the Unicode Arabic block that are used for Urdu. Most of the characters in the Urdu standard UZT 1.01 are included.
All text is output in Unicode normalisation form NFC by default. You can change to NFD or no normalisation by clicking on the buttons in the yellow area. Note that normalization only takes place when you click on a character - text pasted into the box won't be normalised until you click on another character above, or click on a button in the yellow area. (Note: normalization is turned off for Han characters in this application.)
The following alternative views are available by clicking on the vertical grey bar to the left of the selection area.
Default Clicking on this turns off the other features described in this section. The default table is likely to be most useful to people who are somewhat familiar with the alphabet and characters of Urdu.
By default, the table is right-aligned and flows from right to left, however it is possible to flip it using more controls/table direction.
Hints This changes the behaviour of the table view so that, when you mouse over a character, characters that are similar in appearance, and may be easily confused, are automatically highlighted. This can be particularly useful for people who are not familiar with the script, to avoid confusing similar characters, or to find the right character when two or more look similar.
Transcriptions There are two transcription panels available in this picker: Latin, and Library of Congress. The latter panel allows you to follow a transcription to produce some Urdu text. Where there are multiple possible choices, these choices are presented in a small pop-up box; click on the choice you want to insert it into the output area
The Latin panel provides additional characters you may need while typing in a Latin transcription from the keyboard.
The LOC panel allows you to generate some Urdu text from characters used for Urdu transcription in both the Library of Congress scheme and Teach Yourself Urdu by David Matthews and Mohamed Kasim Dalvi.
Notes on other controls
The Split syllables button adds a copy of the text to the cursor position that has been split apart at syllable boundaries. Syllables joined by subjoining consonants do not have a space inserted. As with other controls, this works on a subset of the text if it is highlighted, and replaces any highlighted text.
Controls at the bottom of the page allow you to modify fonts used, the font size, and the height of the output box.
Searching by character name or codepoint. If you are searching for a particular character and know (at least part of) the name or the codepoint, type that in the search box and hit return. All characters with matching text in the name or codepoint number will be highlighted. The highlighting is only removed when you click on the X next to the search input field. You can also use regular expression syntax to improve your search results. For example, to find the letter 'ha', but not 'gha' etc, you can use \bha\b (or the shortcut, :ha:).
CC base. You would normally expect combining characters, such as accents and vowel signs, when displayed alone to be associated with a dotted circle, however these font glyphs are handled inconsistently from one browser/font to the next. The picker is set up for a given web font initially, but if you change the table font you may need to do something to ensure that combining characters display in a way that helps you click on them. The CC base control allows you to specify a base character that will be used before each combining character (or no base character). This should hopefully help for most font and browser combinations.