47 lines
968 B
Plaintext
47 lines
968 B
Plaintext
Following is a list of character sets along with their widths:
|
|
--------------------------------------------------------------
|
|
|
|
1 Octet 8bit:
|
|
-------------
|
|
Windows 125* (CP125*)
|
|
CP*
|
|
ANSI
|
|
ISO-8859-* (IEC-8859-*)
|
|
Macintosh (Mac OS Roman)
|
|
KOI8-U (potentially KOI*8-*)
|
|
KOI8-R
|
|
MIK
|
|
Cork (T1)
|
|
ISCII
|
|
VISCII
|
|
|
|
|
|
1 Octet 7bit:
|
|
-------------
|
|
US-ASCII
|
|
K0I7
|
|
|
|
2 octets 16 bit:
|
|
----------------
|
|
UCS-2
|
|
UTF-16* (UTF-16BE etc)
|
|
|
|
4-octets 32 bit:
|
|
----------------
|
|
UCS-4
|
|
UTF-32
|
|
|
|
Variable-width:
|
|
----------------------------
|
|
Big5 - http://en.wikipedia.org/wiki/Big5 (1-2 bytes: 00-7f=1, 81-fe=2)
|
|
HKSCS - http://en.wikipedia.org/wiki/HKSCS (a big5 variant, but some variants use 10646)
|
|
ISO-10646 (IEC-10646) - http://en.wikipedia.org/wiki/ISO_10646 (unicode)
|
|
UTF-8 (1-5 bytes)
|
|
ISO-2022 (IEC-2022) - http://en.wikipedia.org/wiki/ISO_2022
|
|
Shift-JIS - http://en.wikipedia.org/wiki/Shift-JIS
|
|
|
|
A good resource:
|
|
----------------
|
|
http://en.wikipedia.org/wiki/Character_encoding#Simple_character_sets
|
|
|