Jump to: navigation, search

Difference between revisions of "CharacterEncodings"


Line 1: Line 1:
 
==UBIK Character Encodings==
 
==UBIK Character Encodings==
  
UBIK supports the import and export of text files (CSV, etc.) for different character encodings. The encoding can be defined by entering its name (e.g. "iso-8859-1").
+
UBIK supports the import and export of text files (CSV, etc.) for different [http://en.wikipedia.org/wiki/Character_encoding character encodings]. The encoding can be defined by entering its name (e.g. "iso-8859-1").
  
 
{| class="wikitable sortable" | width = "50%"
 
{| class="wikitable sortable" | width = "50%"

Revision as of 09:52, 27 February 2014

UBIK Character Encodings

UBIK supports the import and export of text files (CSV, etc.) for different character encodings. The encoding can be defined by entering its name (e.g. "iso-8859-1").

Name Description
IBM037 IBM EBCDIC (US-Canada)
IBM437 OEM United States
IBM500 IBM EBCDIC (International)
ASMO-708 Arabic (ASMO 708)
DOS-720 Arabic (DOS)
ibm737 Greek (DOS)
ibm775 Baltic (DOS)
ibm850 Western European (DOS)
ibm852 Central European (DOS)
IBM855 OEM Cyrillic
ibm857 Turkish (DOS)
IBM00858 OEM Multilingual Latin I
IBM860 Portuguese (DOS)
ibm861 Icelandic (DOS)
DOS-862 Hebrew (DOS)
IBM863 French Canadian (DOS)
IBM864 Arabic (864)
IBM865 Nordic (DOS)
cp866 Cyrillic (DOS)
ibm869 Greek, Modern (DOS)
IBM870 IBM EBCDIC (Multilingual Latin-2)
windows-874 Thai (Windows)
cp875 IBM EBCDIC (Greek Modern)
shift_jis Japanese (Shift-JIS)
gb2312 Chinese Simplified (GB2312)
ks_c_5601-1987 Korean
big5 Chinese Traditional (Big5)
IBM1026 IBM EBCDIC (Turkish Latin-5)
IBM01047 IBM Latin-1
IBM01140 IBM EBCDIC (US-Canada-Euro)
IBM01141 IBM EBCDIC (Germany-Euro)
IBM01142 IBM EBCDIC (Denmark-Norway-Euro)
IBM01143 IBM EBCDIC (Finland-Sweden-Euro)
IBM01144 IBM EBCDIC (Italy-Euro)
IBM01145 IBM EBCDIC (Spain-Euro)
IBM01146 IBM EBCDIC (UK-Euro)
IBM01147 IBM EBCDIC (France-Euro)
IBM01148 IBM EBCDIC (International-Euro)
IBM01149 IBM EBCDIC (Icelandic-Euro)
utf-16 Unicode
unicodeFFFE Unicode (Big endian)
windows-1250 Central European (Windows)
windows-1251 Cyrillic (Windows)
Windows-1252 Western European (Windows)
windows-1253 Greek (Windows)
windows-1254 Turkish (Windows)
windows-1255 Hebrew (Windows)
windows-1256 Arabic (Windows)
windows-1257 Baltic (Windows)
windows-1258 Vietnamese (Windows)
Johab Korean (Johab)
macintosh Western European (Mac)
x-mac-japanese Japanese (Mac)
x-mac-chinesetrad Chinese Traditional (Mac)
x-mac-korean Korean (Mac)
x-mac-arabic Arabic (Mac)
x-mac-hebrew Hebrew (Mac)
x-mac-greek Greek (Mac)
x-mac-cyrillic Cyrillic (Mac)
x-mac-chinesesimp Chinese Simplified (Mac)
x-mac-romanian Romanian (Mac)
x-mac-ukrainian Ukrainian (Mac)
x-mac-thai Thai (Mac)
x-mac-ce Central European (Mac)
x-mac-icelandic Icelandic (Mac)
x-mac-turkish Turkish (Mac)
x-mac-croatian Croatian (Mac)
utf-32 Unicode (UTF-32)
utf-32BE Unicode (UTF-32 Big endian)
x-Chinese-CNS Chinese Traditional (CNS)
x-cp20001 TCA Taiwan
x-Chinese-Eten Chinese Traditional (Eten)
x-cp20003 IBM5550 Taiwan
x-cp20004 TeleText Taiwan
x-cp20005 Wang Taiwan
x-IA5 Western European (IA5)
x-IA5-German German (IA5)
x-IA5-Swedish Swedish (IA5)
x-IA5-Norwegian Norwegian (IA5)
us-ascii US-ASCII
x-cp20261 T.61
x-cp20269 ISO-6937
IBM273 IBM EBCDIC (Germany)
IBM277 IBM EBCDIC (Denmark-Norway)
IBM278 IBM EBCDIC (Finland-Sweden)
IBM280 IBM EBCDIC (Italy)
IBM284 IBM EBCDIC (Spain)
IBM285 IBM EBCDIC (UK)
IBM290 IBM EBCDIC (Japanese katakana)
IBM297 IBM EBCDIC (France)
IBM420 IBM EBCDIC (Arabic)
IBM423 IBM EBCDIC (Greek)
IBM424 IBM EBCDIC (Hebrew)
x-EBCDIC-KoreanExtended IBM EBCDIC (Korean Extended)
IBM-Thai IBM EBCDIC (Thai)
koi8-r Cyrillic (KOI8-R)
IBM871 IBM EBCDIC (Icelandic)
IBM880 IBM EBCDIC (Cyrillic Russian)
IBM905 IBM EBCDIC (Turkish)
IBM00924 IBM Latin-1
EUC-JP Japanese (JIS 0208-1990 and 0212-1990)
x-cp20936 Chinese Simplified (GB2312-80)
x-cp20949 Korean Wansung
cp1025 IBM EBCDIC (Cyrillic Serbian-Bulgarian)
koi8-u Cyrillic (KOI8-U)
iso-8859-1 Western European (ISO)
iso-8859-2 Central European (ISO)
iso-8859-3 Latin 3 (ISO)
iso-8859-4 Baltic (ISO)
iso-8859-5 Cyrillic (ISO)
iso-8859-6 Arabic (ISO)
iso-8859-7 Greek (ISO)
iso-8859-8 Hebrew (ISO-Visual)
iso-8859-9 Turkish (ISO)
iso-8859-13 Estonian (ISO)
iso-8859-15 Latin 9 (ISO)
x-Europa Europa
iso-8859-8-i Hebrew (ISO-Logical)
iso-2022-jp Japanese (JIS)
csISO2022JP Japanese (JIS-Allow 1 byte Kana)
iso-2022-jp Japanese (JIS-Allow 1 byte Kana - SO/SI)
iso-2022-kr Korean (ISO)
x-cp50227 Chinese Simplified (ISO-2022)
euc-jp Japanese (EUC)
EUC-CN Chinese Simplified (EUC)
euc-kr Korean (EUC)
hz-gb-2312 Chinese Simplified (HZ)
GB18030 Chinese Simplified (GB18030)
x-iscii-de ISCII Devanagari
x-iscii-be ISCII Bengali
x-iscii-ta ISCII Tamil
x-iscii-te ISCII Telugu
x-iscii-as ISCII Assamese
x-iscii-or ISCII Oriya
x-iscii-ka ISCII Kannada
x-iscii-ma ISCII Malayalam
x-iscii-gu ISCII Gujarati
x-iscii-pa ISCII Punjabi
utf-7 Unicode (UTF-7)
utf-8 Unicode (UTF-8)
ANSI Use ANSI Codepage defined in the Windows System Locale Setting