Language Characters

This document lists the characters you need to recognize in order to support localization of applications to various languages.

The characters represent those that can be entered by a standard keyboard for the language or are expected to be found in typical Internet usage of the language. Depending on your application, you may need fewer of the characters than listed here. If the characters your application uses can be entirely pre-determined then the list can likely be reduced. If your application allows arbitrary keyboard input or system clipboard support then your application may need to be prepared to deal with any Unicode character passed to it. Some characters listed below aren't commonly used in the given language but appear on keyboards for the language and thus are directly accessible to users.

For most languages there are characters that are occasionally used but which aren't listed here. Even in English we see this with words like résumé and mediæval. This document does not list these characters, as they are rare, usually have workarounds (e.g. resume and mediaeval), and the extent of them doesn't have any clear bounds.

A number of the characters are English alphabetic characters with accents of various sorts, such as è. An application can support these characters as independent characters or instead may choose to support them as a two character pair (e.g. e + `).

An application should support most or all of the requisite special characters, as they are important for controlling text layout. The EAText package supports all Unicode functionality of significance related to these characters.

Text from this document can be directly read by the BitmapFontEditor application, though you need to paste it into a Unicode text file first and read it from there. BitmapFontEditor recognizes the characters below and recognizes U+XXXX sequences for any characters that aren't visible.

US English

<Requisite Special Characters>
<Basic Symbols>
U+0041 A
U+0042 B
U+0043 C
U+0044 D
U+0045 E
U+0046 F
U+0047 G
U+0048 H
U+0049 I
U+004a J
U+004b K
U+004c L
U+004d M
U+004e N
U+004f O
U+0050 P
U+0051 Q
U+0052 R
U+0053 S
U+0054 T
U+0055 U
U+0056 V
U+0057 W
U+0058 X
U+0059 Y
U+005a Z
U+0061 a
U+0062 b
U+0063 c
U+0064 d
U+0065 e
U+0066 f
U+0067 g
U+0068 h
U+0069 i
U+006a j
U+006b k
U+006c l
U+006d m
U+006e n
U+006f o
U+0070 p
U+0071 q
U+0072 r
U+0073 s
U+0074 t
U+0075 u
U+0076 v
U+0077 w
U+0078 x
U+0079 y
U+007a z

UK English

<US English>
<Basic Symbols>
<Requisite Special Characters>
U+00A6 ¦
U+00C9 É
U+00D3 Ó
U+00CD Í
U+00DA Ú
U+00E9 é
U+00ED í
U+00F3 ó
U+00FA ú
U+20AC €

Dutch

The Dutch alphabet is the same as the 26 letter English alphabet and since the spelling reform of the 1990s it no longer uses accents except for foreign words (particularly French). You may or may not need the accented characters, depending on your application. DutchGrammar.com discusses this fairly well. You can also support most of the accented characters via <Combining Accents>.

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c0 À
U+00c1 Á
U+00c2 Â
U+00c4 Ä
U+00c7 Ç
U+00c8 È
U+00c9 É
U+00ca Ê
U+00cb Ë
U+00cc Ì
U+00cd Í
U+00ce Î
U+00cf Ï
U+00d2 Ò
U+00d3 Ó
U+00d4 Ô
U+00d6 Ö
U+00d9 Ù
U+00da Ú
U+00db Û
U+00dc Ü
U+00e0 à
U+00e1 á
U+00e2 â
U+00e4 ä
U+00e7 ç
U+00e8 è
U+00e9 é
U+00ea ê
U+00eb ë
U+00ec ì
U+00ed í
U+00ee î
U+00ef ï
U+00f2 ò
U+00f3 ó
U+00f4 ô
U+00f6 ö
U+00f9 ù
U+00fa ú
U+00fb û
U+00fc ü
U+00a8 ¨
U+00b0 °
U+00b4 ´
U+20AC €

Icelandic

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c1 Á
U+00c4 Ä
U+00c5 Å
U+00c6 Æ
U+00c9 É
U+00cb Ë
U+00cd Í
U+00d0 Ð
U+00d3 Ó
U+00d6 Ö
U+00da Ú
U+00dc Ü
U+00dd Ý
U+00de Þ
U+00e1 á
U+00e4 ä
U+00e5 å
U+00e6 æ
U+00e9 é
U+00eb ë
U+00ed í
U+00f0 ð
U+00f3 ó
U+00f6 ö
U+00fa ú
U+00fc ü
U+00fd ý
U+00fe þ
U+00ff ÿ
U+0178 Ÿ
U+00a8 ¨
U+00b0 °
U+00b4 ´
U+20AC €

Norwegian

<US English>
<Basic Symbols>
<Requisite Special Characters>
U+00c5 Å
U+00c6 Æ
U+00d8 Ø
U+00e5 å
U+00e6 æ
U+00f8 ø
U+00a3 £
U+00a4 ¤
U+00a7 §
U+00a8 ¨
U+00b4 ´
U+00b5 µ
U+20AC €

Hungarian

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c1 Á
U+00c9 É
U+00cd Í
U+00d3 Ó
U+00d6 Ö
U+00da Ú
U+00dc Ü
U+00e1 á
U+00e9 é
U+00ed í
U+00f3 ó
U+00f6 ö
U+00fa ú
U+00fc ü
U+0150 Ő
U+0151 ő
U+0170 Ű
U+0171 ű
U+00a7 §
U+00d7 ×
U+00f7 ÷
U+20AC €

Finnish

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c4 Ä
U+00c5 Å
U+00d6 Ö
U+00e4 ä
U+00e5 å
U+00f6 ö
U+00a3 £
U+00a4 ¤
U+00a7 §
U+00a8 ¨
U+00bd ½
U+20AC €
U+00b5 µ

Swedish

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c4 Ä
U+00c5 Å
U+00d6 Ö
U+00e4 ä
U+00e5 å
U+00f6 ö
U+00a3 £
U+00a4 ¤
U+00a7 §
U+00a8 ¨
U+00bd ½
U+20AC €
U+00b5 µ

French

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00e0 à
U+00e2 â
U+00e7 ç
U+00e8 è
U+00e9 é
U+00ea ê
U+00eb ë
U+00ee î
U+00ef ï
U+00f4 ô
U+00f9 ù
U+00fb û
U+00fc ü
U+00a3 £
U+00a4 ¤
U+00a7 §
U+00a8 ¨
U+00b0 °
U+00b2 ²
U+00b5 µ
U+20AC €

German

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c4 Ä
U+00d6 Ö
U+00dc Ü
U+00df ß
U+00e4 ä
U+00f6 ö
U+00fc ü
U+00a7 §
U+00b0 °
U+00b2 ²
U+00b3 ³
U+03bc μ
U+20AC €

Spanish

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c1 Á
U+00c9 É
U+00cd Í
U+00d1 Ñ
U+00d3 Ó
U+00da Ú
U+00dc Ü
U+00e1 á
U+00e9 é
U+00ed í
U+00f1 ñ
U+00f3 ó
U+00fa ú
U+00fc ü
U+00a1 ¡
U+00a8 ¨
U+00ac ¬
U+00b4 ´
U+00bf ¿
U+20AC €

Danish

<US English>
<Basic Symbols>
<Requisite Special Characters>
U+00c5 Å
U+00c6 Æ
U+00d8 Ø
U+00e5 å
U+00e6 æ
U+00f8 ø
U+00a3 £
U+03bc μ
U+20AC €

Italian

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c0 À
U+00c8 È
U+00c9 É
U+00cc Ì
U+00d2 Ò
U+00d3 Ó
U+00d9 Ù
U+00e0 à
U+00e8 è
U+00e9 é
U+00ec ì
U+00f2 ò
U+00f3 ó
U+00f9 ù
U+20AC €

Polish

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00d3 Ó
U+00f3 ó
U+0104 Ą
U+0105 ą
U+0106 Ć
U+0107 ć
U+0118 Ę
U+0119 ę
U+0141 Ł
U+0142 ł
U+0143 Ń
U+0144 ń
U+015a Ś
U+015b ś
U+0179 Ź
U+017a ź
U+017b Ż
U+017c ż

Russian

<Basic Symbols>
<Requisite Special Characters>
U+0401 Ё
U+0410 А
U+0411 Б
U+0412 В
U+0413 Г
U+0414 Д
U+0415 Е
U+0416 Ж
U+0417 З
U+0418 И
U+0419 Й
U+041a К
U+041b Л
U+041c М
U+041d Н
U+041e О
U+041f П
U+0420 Р
U+0421 С
U+0422 Т
U+0423 У
U+0424 Ф
U+0425 Х
U+0426 Ц
U+0427 Ч
U+0428 Ш
U+0429 Щ
U+042a Ъ
U+042b Ы
U+042c Ь
U+042d Э
U+042e Ю
U+042f Я
U+0430 а
U+0431 б
U+0432 в
U+0433 г
U+0434 д
U+0435 е
U+0436 ж
U+0437 з
U+0438 и
U+0439 й
U+043a к
U+043b л
U+043c м
U+043d н
U+043e о
U+043f п
U+0440 р
U+0441 с
U+0442 т
U+0443 у
U+0444 ф
U+0445 х
U+0446 ц
U+0447 ч
U+0448 ш
U+0449 щ
U+044a ъ
U+044b ы
U+044c ь
U+044d э
U+044e ю
U+044f я
U+0451 ё

Greek

<Basic Symbols>
<Requisite Special Characters>
U+00a3 £
U+00a4 ¤
U+00a6 ¦
U+00a7 §
U+00ab «
U+00ac ¬
U+00b0 °
U+00b1 ±
U+00b2 ²
U+00b3 ³
U+00b6 ¶
U+00bb »
U+00bd ½
U+0386 Ά
U+0388 Έ
U+0389 Ή
U+038a Ί
U+038c Ό
U+038e Ύ
U+038f Ώ
U+0391 Α
U+0392 Β
U+0393 Γ
U+0394 Δ
U+0395 Ε
U+0396 Ζ
U+0397 Η
U+0398 Θ
U+0399 Ι
U+039a Κ
U+039b Λ
U+039c Μ
U+039d Ν
U+039e Ξ
U+039f Ο
U+03a0 Π
U+03a1 Ρ
U+03a3 Σ
U+03a4 Τ
U+03a5 Υ
U+03a6 Φ
U+03a7 Χ
U+03a8 Ψ
U+03a9 Ω
U+03aa Ϊ
U+03ab Ϋ
U+03ac ά
U+03ad έ
U+03ae ή
U+03af ί
U+03b1 α
U+03b2 β
U+03b3 γ
U+03b4 δ
U+03b5 ε
U+03b6 ζ
U+03b7 η
U+03b8 θ
U+03b9 ι
U+03ba κ
U+03bb λ
U+03bc μ
U+03bd ν
U+03be ξ
U+03bf ο
U+03c0 π
U+03c1 ρ
U+03c2 ς
U+03c3 σ
U+03c4 τ
U+03c5 υ
U+03c6 φ
U+03c7 χ
U+03c8 ψ
U+03c9 ω
U+03ca ϊ
U+03cb ϋ
U+03cc ό
U+03cd ύ
U+03ce ώ
U+20AC €

Portuguese (Portugal)

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c0 À
U+00c1 Á
U+00c2 Â
U+00c3 Ã
U+00c7 Ç
U+00c9 É
U+00ca Ê
U+00cd Í
U+00d3 Ó
U+00d4 Ô
U+00d5 Õ
U+00da Ú
U+00e0 à
U+00e1 á
U+00e2 â
U+00e3 ã
U+00e7 ç
U+00e9 é
U+00ea ê
U+00ed í
U+00f3 ó
U+00f4 ô
U+00f5 õ
U+00fa ú
U+00a3 £
U+00a7 §
U+00aa ª
U+00ab «
U+00a8 ¨
U+00b0 °
U+00b4 ´
U+00bb »
U+20AC €

Portuguese (Brazil)

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c0 À
U+00c1 Á
U+00c2 Â
U+00c3 Ã
U+00c7 Ç
U+00c9 É
U+00ca Ê
U+00cd Í
U+00d3 Ó
U+00d4 Ô
U+00d5 Õ
U+00da Ú
U+00dc Ü
U+00e0 à
U+00e1 á
U+00e2 â
U+00e3 ã
U+00e7 ç
U+00e9 é
U+00ea ê
U+00ed í
U+00f3 ó
U+00f4 ô
U+00f5 õ
U+00fa ú
U+00fc ü
U+00a2 ¢
U+00a3 £
U+00a4 ¤
U+00a6 ¦
U+00a7 §
U+00ac ¬
U+20a2 ₢
U+00b0 °
U+00b2 ²
U+00b3 ³
U+00b4 ´
U+00b9 ¹
U+00bd ½
U+20AC €

Czech

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c1 Á
U+00c9 É
U+00cc Ì
U+00d3 Ó
U+00da Ú
U+00dd Ý
U+00e1 á
U+00e9 é
U+00ec ì
U+00f3 ó
U+00fa ú
U+00fd ý
U+010c Č
U+010d č
U+010e Ď
U+010f ď
U+011a Ě
U+011b ě
U+0147 Ň
U+0148 ň
U+0158 Ř
U+0159 ř
U+0160 Š
U+0161 š
U+0164 Ť
U+0165 ť
U+016e Ů
U+016f ů
U+017d Ž
U+017e ž
U+00a7 §

Romanian

The four characters U+0218 - U+0219, U+021a, U+021b, are unique to Romanian. They are S, s, T, t with comma below. Many older TrueType fonts don't have these characters and so instead the cedilla is often used instead of a comma: ŞşŢţ. However, there is a difference between a comma and a cedilla and the comma is more correct.

<US English>
<Basic Symbols>
<Requisite Special Characters>
<possibly Combining Accents>
U+00c2 Â
U+00ce Î
U+00e2 â
U+00ee î
U+0102 Ă
U+0103 ă
U+0218 Ș
U+0219 ș
U+021a Ț
U+021b ț

Chinese (Traditional)

<US English>
<Full Width English>
<Basic Symbols>
<Requisite Special Characters>
U+3000 " " Ideographic Space
U+4E00 - U+9FFF (subset of these)

Chinese (Simplified)

<US English>
<Full Width English>
<Basic Symbols>
<Requisite Special Characters>
U+3000 " " Ideographic Space
U+4E00 - U+9FFF (subset of these)

Japanese

<US English>
<Full Width English>
<Basic Symbols>
<Requisite Special Characters>
U+3000 " " Ideographic Space
U+3041 - U+309F
U+30A0 - U+30FF
U+4E00 - U+9FFF (subset of these)

Korean

<US English>
<Full Width English>
<Basic Symbols>
<Requisite Special Characters>
<possibly various Han ideographs>
<possibly U+3131 - U+318E (compatibility Jamo)>
U+3000 " " Ideographic Space
U+AC00 - U+D7A3
U+1100 - U+11F9

Thai

A few of the Thai characters are uncommonly used, including the Thai digits (U+0E50 - U+0E5A).

<Basic Symbols>
<Requisite Special Characters>
U+0e01 ก
U+0e02 ข
U+0e03 ฃ
U+0e04 ค
U+0e05 ฅ
U+0e06 ฆ
U+0e07 ง
U+0e08 จ
U+0e09 ฉ
U+0e0a ช
U+0e0b ซ
U+0e0c ฌ
U+0e0d ญ
U+0e0e ฎ
U+0e0f ฏ
U+0e10 ฐ
U+0e11 ฑ
U+0e12 ฒ
U+0e13 ณ
U+0e14 ด
U+0e15 ต
U+0e16 ถ
U+0e17 ท
U+0e18 ธ
U+0e19 น
U+0e1a บ
U+0e1b ป
U+0e1c ผ
U+0e1d ฝ
U+0e1e พ
U+0e1f ฟ
U+0e20 ภ
U+0e21 ม
U+0e22 ย
U+0e23 ร
U+0e24 ฤ
U+0e25 ล
U+0e26 ฦ
U+0e27 ว
U+0e28 ศ
U+0e29 ษ
U+0e2a ส
U+0e2b ห
U+0e2c ฬ
U+0e2d อ
U+0e2e ฮ
U+0e2f ฯ
U+0e30 ะ
U+0e31  ั
U+0e32 า
U+0e33 ำ
U+0e34  ิ
U+0e35  ี
U+0e36  ึ
U+0e37  ื
U+0e38  ุ
U+0e39  ู
U+0e3a  ฺ
U+0e3f ฿
U+0e40 เ
U+0e41 แ
U+0e42 โ
U+0e43 ใ
U+0e44 ไ
U+0e45 ๅ
U+0e46 ๆ
U+0e47  ็
U+0e48  ่
U+0e49  ้
U+0e4a  ๊
U+0e4b  ๋
U+0e4c  ์
U+0e4d  ํ
U+0e4e  ๎
U+0e4f ๏
U+0e50 ๐
U+0e51 ๑
U+0e52 ๒
U+0e53 ๓
U+0e54 ๔
U+0e55 ๕
U+0e56 ๖
U+0e57 ๗
U+0e58 ๘
U+0e59 ๙
U+0e5a ๚
U+0e5b ๛
U+f700 
U+f701  
U+f702  
U+f703  
U+f704  
U+f705  
U+f706  
U+f707  
U+f708  
U+f709  
U+f70a  
U+f70b  
U+f70c  
U+f70d  
U+f70e  
U+f70f 
U+f710  
U+f711  
U+f712  
U+f713  
U+f714  
U+f715  
U+f716  
U+f717  
U+f718  
U+f719  
U+f71a  

Hebrew

<Basic Symbols>
<Requisite Special Characters>
<possibly combining chars U+0591 - U+05C4>
U+05bc ּ
U+05be ־
U+05d0 א
U+05d1 ב
U+05d2 ג
U+05d3 ד
U+05d4 ה
U+05d5 ו
U+05d6 ז
U+05d7 ח
U+05d8 ט
U+05d9 י
U+05da ך
U+05db כ
U+05dc ל
U+05dd ם
U+05de מ
U+05df ן
U+05e0 נ
U+05e1 ס
U+05e2 ע
U+05e3 ף
U+05e4 פ
U+05e5 ץ
U+05e6 צ
U+05e7 ק
U+05e8 ר
U+05e9 ש
U+05ea ת
U+05f0 װ
U+05f1 ױ
U+05f2 ײ
U+05f3 ׳
U+05f4 ״

Arabic

A number of these characters are uncommonly used, including the Arabic digits. Arabic is cursive (like handwriting) and cannot be displayed by simply stringing together characters as with English and other Latin-based languages. Arabic needs to have OpenType information within the font which tells how to substitute glyphs for other glyphs based on their positions within words so that it looks like smooth handwriting. 

<Basic Symbols>
<Requisite Special Characters>
<contextually substituted glyphs, usually via OpenType>
U+0600 ؀
U+0601 ؁
U+0602 ؂
U+0603 ؃
U+060c ،
U+060d ؍
U+060e ؎
U+060f ؏
U+0610 ؐ
U+0611 ؑ
U+0612 ؒ
U+0613 ؓ
U+0614 ؔ
U+0615 ؕ
U+061b ؛
U+061f ؟
U+0621 ء
U+0622 آ
U+0623 أ
U+0624 ؤ
U+0625 إ
U+0626 ئ
U+0627 ا
U+0628 ب
U+0629 ة
U+062a ت
U+062b ث
U+062c ج
U+062d ح
U+062e خ
U+062f د
U+0630 ذ
U+0631 ر
U+0632 ز
U+0633 س
U+0634 ش
U+0635 ص
U+0636 ض
U+0637 ط
U+0638 ظ
U+0639 ع
U+063a غ
U+0640 ـ
U+0641 ف
U+0642 ق
U+0643 ك
U+0644 ل
U+0645 م
U+0646 ن
U+0647 ه
U+0648 و
U+0649 ى
U+064a ي
U+064b ً
U+064c ٌ
U+064d ٍ
U+064e َ
U+064f ُ
U+0650 ِ
U+0651 ّ
U+0652 ْ
U+0653 ٓ
U+0654 ٔ
U+0655 ٕ
U+0656 ٖ
U+0657 ٗ
U+0658 ٘
U+0660 ٠
U+0661 ١
U+0662 ٢
U+0663 ٣
U+0664 ٤
U+0665 ٥
U+0666 ٦
U+0667 ٧
U+0668 ٨
U+0669 ٩
U+066a ٪
U+066b ٫
U+066c ٬
U+066d ٭
U+066e ٮ
U+066f ٯ
U+0670 ٰ
U+0671 ٱ
U+0672 ٲ
U+0673 ٳ
U+0674 ٴ
U+0675 ٵ
U+0676 ٶ
U+0677 ٷ
U+0678 ٸ
U+0679 ٹ
U+067a ٺ
U+067b ٻ
U+067c ټ
U+067d ٽ
U+067e پ
U+067f ٿ
U+0680 ڀ
U+0681 ځ
U+0682 ڂ
U+0683 ڃ
U+0684 ڄ
U+0685 څ
U+0686 چ
U+0687 ڇ
U+0688 ڈ
U+0689 ډ
U+068a ڊ
U+068b ڋ
U+068c ڌ
U+068d ڍ
U+068e ڎ
U+068f ڏ
U+0690 ڐ
U+0691 ڑ
U+0692 ڒ
U+0693 ړ
U+0694 ڔ
U+0695 ڕ
U+0696 ږ
U+0697 ڗ
U+0698 ژ
U+0699 ڙ
U+069a ښ
U+069b ڛ
U+069c ڜ
U+069d ڝ
U+069e ڞ
U+069f ڟ
U+06a0 ڠ
U+06a1 ڡ
U+06a2 ڢ
U+06a3 ڣ
U+06a4 ڤ
U+06a5 ڥ
U+06a6 ڦ
U+06a7 ڧ
U+06a8 ڨ
U+06a9 ک
U+06aa ڪ
U+06ab ګ
U+06ac ڬ
U+06ad ڭ
U+06ae ڮ
U+06af گ
U+06b0 ڰ
U+06b1 ڱ
U+06b2 ڲ
U+06b3 ڳ
U+06b4 ڴ
U+06b5 ڵ
U+06b6 ڶ
U+06b7 ڷ
U+06b8 ڸ
U+06b9 ڹ
U+06ba ں
U+06bb ڻ
U+06bc ڼ
U+06bd ڽ
U+06be ھ
U+06bf ڿ
U+06c0 ۀ
U+06c1 ہ
U+06c2 ۂ
U+06c3 ۃ
U+06c4 ۄ
U+06c5 ۅ
U+06c6 ۆ
U+06c7 ۇ
U+06c8 ۈ
U+06c9 ۉ
U+06ca ۊ
U+06cb ۋ
U+06cc ی
U+06cd ۍ
U+06ce ێ
U+06cf ۏ
U+06d0 ې
U+06d1 ۑ
U+06d2 ے
U+06d3 ۓ
U+06d4 ۔
U+06d5 ە
U+06d6 ۖ
U+06d7 ۗ
U+06d8 ۘ
U+06d9 ۙ
U+06da ۚ
U+06db ۛ
U+06dc ۜ
U+06dd ۝
U+06de ۞
U+06df ۟
U+06e0 ۠
U+06e1 ۡ
U+06e2 ۢ
U+06e3 ۣ
U+06e4 ۤ
U+06e5 ۥ
U+06e6 ۦ
U+06e7 ۧ
U+06e8 ۨ
U+06e9 ۩
U+06ea ۪
U+06eb ۫
U+06ec ۬
U+06ed ۭ
U+06ee ۮ
U+06ef ۯ
U+06f0 ۰
U+06f1 ۱
U+06f2 ۲
U+06f3 ۳
U+06f4 ۴
U+06f5 ۵
U+06f6 ۶
U+06f7 ۷
U+06f8 ۸
U+06f9 ۹
U+06fa ۺ
U+06fb ۻ
U+06fc ۼ
U+06fd ۽
U+06fe ۾
U+06ff ۿ
U+00d7 ×
U+00f7 ÷

Hindi

Hindi (Devanagari script) has a number of combined characters that aren't representable as Unicode characters and require OpenType instructions from the .ttf font in order to be correctly displayed.

<Basic Symbols>
<Requisite Special Characters>
U+0901 ँ
U+0902 ं
U+0903 ः
U+0904 ऄ
U+0905 अ
U+0906 आ
U+0907 इ
U+0908 ई
U+0909 उ
U+090a ऊ
U+090b ऋ
U+090c ऌ
U+090d ऍ
U+090e ऎ
U+090f ए
U+0910 ऐ
U+0911 ऑ
U+0912 ऒ
U+0913 ओ
U+0914 औ
U+0915 क
U+0916 ख
U+0917 ग
U+0918 घ
U+0919 ङ
U+091a च
U+091b छ
U+091c ज
U+091d झ
U+091e ञ
U+091f ट
U+0920 ठ
U+0921 ड
U+0922 ढ
U+0923 ण
U+0924 त
U+0925 थ
U+0926 द
U+0927 ध
U+0928 न
U+0929 ऩ
U+092a प
U+092b फ
U+092c ब
U+092d भ
U+092e म
U+092f य
U+0930 र
U+0931 ऱ
U+0932 ल
U+0933 ळ
U+0934 ऴ
U+0935 व
U+0936 श
U+0937 ष
U+0938 स
U+0939 ह
U+093c ़
U+093d ऽ
U+093e ा
U+093f ि
U+0940 ी
U+0941 ु
U+0942 ू
U+0943 ृ
U+0944 ॄ
U+0945 ॅ
U+0946 ॆ
U+0947 े
U+0948 ै
U+0949 ॉ
U+094a ॊ
U+094b ो
U+094c ौ
U+094d ्
U+0950 ॐ
U+0951 ॑
U+0952 ॒
U+0953 ॓
U+0954 ॔
U+0958 क़
U+0959 ख़
U+095a ग़
U+095b ज़
U+095c ड़
U+095d ढ़
U+095e फ़
U+095f य़
U+0960 ॠ
U+0961 ॡ
U+0962 ॢ
U+0963 ॣ
U+0964 ।
U+0965 ॥
U+0966 ०
U+0967 १
U+0968 २
U+0969 ३
U+096a ४
U+096b ५
U+096c ६
U+096d ७
U+096e ८
U+096f ९
U+0970 ॰

Basic Symbols

Possibly U+00a9 © and U+00ae ® should be included in this list.

U+0021 !
U+0022 "
U+0023 #
U+0024 $
U+0025 %
U+0026 &
U+0027 '
U+0028 (
U+0029 )
U+002a *
U+002b +
U+002c ,
U+002d -
U+002e .
U+002f /
U+0030 0
U+0031 1
U+0032 2
U+0033 3
U+0034 4
U+0035 5
U+0036 6
U+0037 7
U+0038 8
U+0039 9
U+003a :
U+003b ;
U+003c <
U+003d =
U+003e >
U+003f ?
U+0040 @
U+005b [
U+005c \
U+005d ]
U+005e ^
U+005f _
U+0060 `
U+007b {
U+007c |
U+007d }
U+007e ~

Extended Symbols

U+00a1 ¡
U+00a2 ¢
U+00a3 £
U+00a4 ¤
U+00a5 ¥
U+00a6 ¦
U+00a7 §
U+00a8 ¨
U+00a9 ©
U+00aa ª
U+00ab «
U+00ac ¬
U+00ad ­
U+00ae ®
U+00af ¯
U+00b0 °
U+00b1 ±
U+00b2 ²
U+00b3 ³
U+00b4 ´
U+00b5 µ
U+00b6 ¶
U+00b7 ·
U+00b8 ¸
U+00b9 ¹
U+00ba º
U+00bb »
U+00bc ¼
U+00bd ½
U+00be ¾
U+00bf ¿
U+00d7 ×
U+00f7 ÷
U+20AC €
U+2018 ‘
U+2019 ’
U+201a ‚
U+201b ‛
U+201c “
U+201d ”
U+201e „
U+201f ‟
U+2026 …
U+2318 ⌘
U+25cf ●

Space Characters

The Unicode standard describes numerous space

U+0009 " " Horizontal Tab
U+0020 " " Space
U+00a0 " " No-Break Space
U+2000 " " En Quad
U+2001 " " Em Quad
U+2002 " " En Space  
U+2003 " " Em Space  
U+2004 " " Three-per-Em Space
U+2005 " " Four-per-Em Space
U+2006 " " Six-per-Em Space
U+2007 " " Figure Space (non-breaking, same size as digits)
U+2008 " " Punctuation Space
U+2009 " " Thin Space  
U+200a " " Hair Space
U+200b "​" Zero Width Space​
U+3000 " " Ideographic Space

Requisite Special Characters

U+0009 " " Horizontal Tab
U+000A " " Linefeed
U+000D " " Return
U+0020 " " Space
U+00a0 " " No-Break Space
U+200B "​" Zero-Width Space
U+200C "‌"Zero-Width Non-Joiner
U+200D "‍" Zero-Width Joiner
U+2026 … (ellipsis)
U+2060 "⁠" Word Joiner
U+25CF ● (password char)

Additional Special Characters

U+000B Vertical Tab
U+001B Escape
U+0085 Next Line (replaces /r/n)
U+2000 En Quad
U+2001 Em Quad
U+2002 En Space
U+2003 Em Space
U+2004 3-Em Space
U+2005 4-Em Space
U+2006 6-Em Space
U+2007 Figure Space
U+2008 Punctuation Space
U+2009 Thin Space
U+200A Hair Space
U+2028 Line Separator
U+2029 Paragraph Separator
U+25CC Dotted Circle
U+3000 Ideographic Space
U+FFFC Object replacement character
U+FFFD Replacement character
U+FEFF Zero-Width No-Break Space

Combining Accents

These can be used to add accents to existing non-accented characters and save font memory. A combining accent is not the same as a standard spacing accent, as with the ASCII ` char. Combining accents have font metrics that cause them to draw to the left of the current position and thus above (or below) the previous character.

U+0300   ̀
U+0301   ́
U+0302   ̂
U+0303   ̃
U+0307   ̇
U+0308   ̈
U+030a   ̊
U+030b   ̋
U+030c   ̌
U+0315   ̕
U+0327   ̧
U+0328   ̨

Full Width English

U+ff01 !
U+ff02 "
U+ff03 #
U+ff04 $
U+ff05 %
U+ff06 &
U+ff07 '
U+ff08 (
U+ff09 )
U+ff0a *
U+ff0b +
U+ff0c ,
U+ff0d -
U+ff0e .
U+ff0f /
U+ff10 0
U+ff11 1
U+ff12 2
U+ff13 3
U+ff14 4
U+ff15 5
U+ff16 6
U+ff17 7
U+ff18 8
U+ff19 9
U+ff1a :
U+ff1b ;
U+ff1c <
U+ff1d =
U+ff1e >
U+ff1f ?
U+ff20 @
U+ff21 A
U+ff22 B
U+ff23 C
U+ff24 D
U+ff25 E
U+ff26 F
U+ff27 G
U+ff28 H
U+ff29 I
U+ff2a J
U+ff2b K
U+ff2c L
U+ff2d M
U+ff2e N
U+ff2f O
U+ff30 P
U+ff31 Q
U+ff32 R
U+ff33 S
U+ff34 T
U+ff35 U
U+ff36 V
U+ff37 W
U+ff38 X
U+ff39 Y
U+ff3a Z
U+ff3b [
U+ff3c \
U+ff3d ]
U+ff3e ^
U+ff3f _
U+ff40 `
U+ff41 a
U+ff42 b
U+ff43 c
U+ff44 d
U+ff45 e
U+ff46 f
U+ff47 g
U+ff48 h
U+ff49 i
U+ff4a j
U+ff4b k
U+ff4c l
U+ff4d m
U+ff4e n
U+ff4f o
U+ff50 p
U+ff51 q
U+ff52 r
U+ff53 s
U+ff54 t
U+ff55 u
U+ff56 v
U+ff57 w
U+ff58 x
U+ff59 y
U+ff5a z
U+ff5b {
U+ff5c |
U+ff5d }
U+ff5e ~
U+ffe1 £
U+ffe4 ¦
U+ffe5 ¥