Unicode

ASCII 7 bits (128 characters)
ISO8859 8 bits (256 characters)
Other Encoding: EBCDIC (IBM); JIS, Shift-JIS (Japan); TIS (Thailand), ISCII (India)
Unicode à 16 bits (normal) to 21 bits. 16 bits = 65000 characters
Registered in Unicode Standard: Latin; Cyrillic (Eastern Europe); Arabic, Hebrew (Middle East); Han characters (China, Taiwan, Japan, Korea); Hiragana, Katakana (Japan); Hangul (Korea); Thai, Lao, Khmer, Burmese (South East Asia); Devanagari, Bengali, Tamil, Telugu, Malayalam, Gurmukhi, Punjabi, Sinhala (India, Srilangka)
Unicode – first 6500 characters