47 lines
968 B
Plaintext
47 lines
968 B
Plaintext
|
Following is a list of character sets along with their widths:
|
||
|
--------------------------------------------------------------
|
||
|
|
||
|
1 Octet 8bit:
|
||
|
-------------
|
||
|
Windows 125* (CP125*)
|
||
|
CP*
|
||
|
ANSI
|
||
|
ISO-8859-* (IEC-8859-*)
|
||
|
Macintosh (Mac OS Roman)
|
||
|
KOI8-U (potentially KOI*8-*)
|
||
|
KOI8-R
|
||
|
MIK
|
||
|
Cork (T1)
|
||
|
ISCII
|
||
|
VISCII
|
||
|
|
||
|
|
||
|
1 Octet 7bit:
|
||
|
-------------
|
||
|
US-ASCII
|
||
|
K0I7
|
||
|
|
||
|
2 octets 16 bit:
|
||
|
----------------
|
||
|
UCS-2
|
||
|
UTF-16* (UTF-16BE etc)
|
||
|
|
||
|
4-octets 32 bit:
|
||
|
----------------
|
||
|
UCS-4
|
||
|
UTF-32
|
||
|
|
||
|
Variable-width:
|
||
|
----------------------------
|
||
|
Big5 - http://en.wikipedia.org/wiki/Big5 (1-2 bytes: 00-7f=1, 81-fe=2)
|
||
|
HKSCS - http://en.wikipedia.org/wiki/HKSCS (a big5 variant, but some variants use 10646)
|
||
|
ISO-10646 (IEC-10646) - http://en.wikipedia.org/wiki/ISO_10646 (unicode)
|
||
|
UTF-8 (1-5 bytes)
|
||
|
ISO-2022 (IEC-2022) - http://en.wikipedia.org/wiki/ISO_2022
|
||
|
Shift-JIS - http://en.wikipedia.org/wiki/Shift-JIS
|
||
|
|
||
|
A good resource:
|
||
|
----------------
|
||
|
http://en.wikipedia.org/wiki/Character_encoding#Simple_character_sets
|
||
|
|