You are here: Reference > appendix > character sets
Character sets
The character set tells the browser what character encoding needs to be used to encode characters.
To set a character set for a HTML document, use the meta element.
To get the character encoding used for the document, use the charset and characterSet properties.
To set or retrieve a character set for a script block or a linked document, use the charset property.
List of available character sets:
Description | Charset Name | Aliases |
---|---|---|
Arabic (864) | IBM864 | cp864, csIBM864 |
Arabic (DOS) | DOS-720 | |
Arabic (ISO) | iso-8859-6 | iso-ir-127, ISO_8859-6, ISO-8859-6:1987, ECMA-114, ASMO-708, arabic, csISOLatinArabic |
Arabic (Mac) | x-mac-arabic | |
Arabic (Windows) | windows-1256 | |
Baltic (DOS) | ibm775 | cp775, csPC775Baltic |
Baltic (ISO) | iso-8859-4 | iso-ir-110, ISO_8859-4, ISO-8859-4:1988, latin4, l4, csISOLatin4 |
Baltic (Windows) | windows-1257 | |
Central European (DOS) | ibm852 | cp852, 852, csPCp852 |
Central European (ISO) | iso-8859-2 | iso-ir-101, ISO_8859-2, ISO-8859-2:1987, latin2, l2, csISOLatin2 |
Central European (Mac) | x-mac-ce | |
Central European (Windows) | windows-1250 | x-cp1250 |
Chinese Simplified (EUC) | EUC-CN | x-euc-cn |
Chinese Simplified (GB18030) | GB18030 | |
Chinese Simplified (GB2312) | gb2312 | CN-GB, csGB2312, csGB231280, GB_2312-80, GB231280, GB2312-80, GBK |
Chinese Simplified (GB2312-80) | x-cp20936 | iso-ir-58, chinese, csISO58GB231280 |
Chinese Simplified (HZ) | hz-gb-2312 | |
Chinese Simplified (ISO-2022) | x-cp50227 | |
Chinese Simplified (Mac) | x-mac-chinesesimp | |
Chinese Traditional (Big5) | big5 | csBig5, cn-big5, x-x-big5 |
Chinese Traditional (CNS) | x-Chinese-CNS | |
Chinese Traditional (Eten) | x-Chinese-Eten | |
Chinese Traditional (Mac) | x-mac-chinesetrad | |
Croatian (Mac) | x-mac-croatian | |
Cyrillic (DOS) | IBM866 | cp866, 866, csIBM866 |
Cyrillic (ISO | Windows) | iso-8859-5 | windows-1251, iso-ir-144, ISO_8859-5, ISO-8859-5:1988, cyrillic, csISOLatinCyrillic, KOI8-U |
Cyrillic (KOI8-R) | koi8-r | csKOI8R, koi8-r, koi, koi8, koi8r |
Cyrillic (KOI8-U) | koi8-u | koi8-ru |
Cyrillic (Mac) | x-mac-cyrillic | |
Cyrillic (Windows) | windows-1251 | x-cp1251 |
Estonian (ISO) | iso-8859-13 | |
Europa | x-Europa | |
French Canadian (DOS) | IBM863 | cp863, 863, csIBM863 |
German (IA5) | x-IA5-German | |
Greek (DOS) | ibm737 | |
Greek (ISO) | iso-8859-7 | iso-ir-126, ISO_8859-7, ISO-8859-7:1987, ELOT_928, ECMA-118, greek, greek8, csISOLatinGreek |
Greek (Mac) | x-mac-greek | |
Greek (Windows) | windows-1253 | |
Greek, Modern (DOS) | ibm869 | cp869, 869, cp-gr, csIBM869 |
Hebrew (DOS) | DOS-862 | |
Hebrew (ISO-Logical) | iso-8859-8-i | csISO88598I, ISO_8859-8-I, logical |
Hebrew (ISO-Visual) | iso-8859-8 | iso-ir-138, ISO_8859-8, ISO-8859-8:1988, hebrew, csISOLatinHebrew, visual |
Hebrew (Mac) | x-mac-hebrew | |
Hebrew (Windows) | windows-1255 | ISO_8859-8-I, ISO-8859-8 |
IBM EBCDIC (Arabic) | IBM420 | cp420, ebcdic-cp-ar1, csIBM420, x-EBCDIC-Arabic |
IBM EBCDIC (Cyrillic Russian) | IBM880 | cp880, EBCDIC-Cyrillic, csIBM880, x-EBCDIC-CyrillicRussian |
IBM EBCDIC (Cyrillic Serbian-Bulgarian) | cp1025 | x-EBCDIC-CyrillicSerbianBulgarian |
IBM EBCDIC (Denmark-Norway) | IBM277 | EBCDIC-CP-DK, EBCDIC-CP-NO, csIBM277, x-EBCDIC-DenmarkNorway |
IBM EBCDIC (Denmark-Norway-Euro) | IBM01142 | CCSID01142, CP01142, ebcdic-dk-277+euro, ebcdic-no-277+euro, x-ebcdic-denmarknorway-euro |
IBM EBCDIC (Finland-Sweden) | IBM278 | CP278, ebcdic-cp-fi, ebcdic-cp-se, csIBM278, x-EBCDIC-FinlandSweden |
IBM EBCDIC (Finland-Sweden-Euro) | IBM01143 | CCSID01143, CP01143, ebcdic-fi-278+euro, ebcdic-se-278+euro, x-ebcdic-finlandsweden-euro |
IBM EBCDIC (France) | IBM297 | cp297, ebcdic-cp-fr, csIBM297, x-ebcdic-france |
IBM EBCDIC (France-Euro) | IBM01147 | CCSID01147, CP01147, ebcdic-fr-297+euro, x-ebcdic-france-euro |
IBM EBCDIC (Germany) | IBM273 | CP273, csIBM273, x-EBCDIC-Germany |
IBM EBCDIC (Germany-Euro) | IBM01141 | CCSID01141, CP01141, ebcdic-de-273+euro, x-ebcdic-germany-euro |
IBM EBCDIC (Greek Modern) | cp875 | x-EBCDIC-GreekModern |
IBM EBCDIC (Greek) | IBM423 | cp423, ebcdic-cp-gr, csIBM423, x-EBCDIC-Greek |
IBM EBCDIC (Hebrew) | IBM424 | cp424, ebcdic-cp-he, csIBM424, x-EBCDIC-Hebrew |
IBM EBCDIC (Icelandic) | IBM871 | CP871, ebcdic-cp-is, csIBM871, x-EBCDIC-Icelandic |
IBM EBCDIC (Icelandic-Euro) | IBM01149 | CCSID01149, CP01149, ebcdic-is-871+euro, x-ebcdic-icelandic-euro |
IBM EBCDIC (International) | IBM500 | CP500, ebcdic-cp-be, ebcdic-cp-ch, csIBM500, x-ebcdic-international |
IBM EBCDIC (International-Euro) | IBM01148 | CCSID01148, CP01148, ebcdic-international-500+euro, x-ebcdic-international-euro |
IBM EBCDIC (Italy) | IBM280 | CP280, ebcdic-cp-it, csIBM280, x-EBCDIC-Italy |
IBM EBCDIC (Italy-Euro) | IBM01144 | CCSID01144, CP01144, ebcdic-it-280+euro, x-ebcdic-italy-euro |
IBM EBCDIC (Japanese and Japanese Katakana) | x-EBCDIC-JapaneseAndKana | |
IBM EBCDIC (Japanese and Japanese-Latin) | x-EBCDIC-JapaneseAndJapaneseLatin | |
IBM EBCDIC (Japanese and US-Canada) | x-EBCDIC-JapaneseAndUSCanada | |
IBM EBCDIC (Japanese katakana) | IBM290 | cp290, EBCDIC-JP-kana, csIBM290, x-EBCDIC-JapaneseKatakana |
IBM EBCDIC (Korean and Korean Extended) | x-EBCDIC-KoreanAndKoreanExtended | |
IBM EBCDIC (Korean Extended) | x-EBCDIC-KoreanExtended | |
IBM EBCDIC (Multilingual Latin-2) | IBM870 | CP870, ebcdic-cp-roece, ebcdic-cp-yu, csIBM870 |
IBM EBCDIC (Simplified Chinese) | x-EBCDIC-SimplifiedChinese | |
IBM EBCDIC (Spain) | IBM284 | CP284, ebcdic-cp-es, csIBM284, x-EBCDIC-Spain |
IBM EBCDIC (Spain-Euro) | IBM01145 | CCSID01145, CP01145, ebcdic-es-284+euro, x-ebcdic-spain-euro |
IBM EBCDIC (Thai) | IBM-Thai | csIBMThai, x-EBCDIC-Thai |
IBM EBCDIC (Traditional Chinese) | x-EBCDIC-TraditionalChinese | |
IBM EBCDIC (Turkish Latin-5) | IBM1026 | CP1026, csIBM1026 |
IBM EBCDIC (Turkish) | IBM905 | CP905, ebcdic-cp-tr, csIBM905, x-EBCDIC-Turkish |
IBM EBCDIC (UK) | IBM285 | CP285, ebcdic-cp-gb, csIBM285, x-EBCDIC-UK |
IBM EBCDIC (UK-Euro) | IBM01146 | CCSID01146, CP01146, ebcdic-gb-285+euro, x-ebcdic-uk-euro |
IBM EBCDIC (US-Canada) | IBM037 | cp037, ebcdic-cp-us, ebcdic-cp-ca, ebcdic-cp-wt, ebcdic-cp-nl, csIBM037, ebcdic-cp-us |
IBM EBCDIC (US-Canada-Euro) | IBM01140 | CCSID01140, CP01140, ebcdic-us-37+euro, x-ebcdic-cp-us-euro |
IBM Latin-1 | IBM01047 | |
IBM Latin-1-Euro | IBM00924 | CCSID00924, CP00924, ebcdic-Latin9--euro |
IBM5550 Taiwan | x-cp20003 | |
Icelandic (DOS) | ibm861 | cp861, 861, cp-is, csIBM861 |
Icelandic (Mac) | x-mac-icelandic | |
ISCII Assamese | x-iscii-as | |
ISCII Bengali | x-iscii-be | |
ISCII Devanagari | x-iscii-de | |
ISCII Gujarati | x-iscii-gu | |
ISCII Kannada | x-iscii-ka | |
ISCII Malayalam | x-iscii-ma | |
ISCII Oriya | x-iscii-or | |
ISCII Punjabi | x-iscii-pa | |
ISCII Tamil | x-iscii-ta | |
ISCII Telugu | x-iscii-te | |
ISO-6937 | x-cp20269 | |
Japanese (EUC) | euc-jp | csEUCPkdFmtJapanese, Extended_UNIX_Code_Packed_Format_for_Japanese, x-euc, x-euc-jp |
Japanese (JIS-Allow 1 byte Kana) | iso-2022-jp | csISO2022JP, _iso-2022-jp |
Japanese (JIS-Allow 2 byte Kana) | ISO-2022-JP-2 | csISO2022JP2 |
Japanese (Katakana) | JIS_C6220-1969-jp | JIS_C6220-1969, iso-ir-13, katakana, x0201-7, csISO13JISC6220jp |
Japanese (Mac) | x-mac-japanese | |
Japanese (Shift-JIS) | shift_jis | ms_Kanji , csShiftJIS, csWindows31J, shift-jis, x-ms-cp932, x-sjis |
Korean | ks_c_5601-1987 | iso-ir-149, KS_C_5601-1989, KSC_5601, korean, csKSC56011987 |
Korean (EUC) | euc-kr | csEUCKR |
Korean (ISO) | iso-2022-kr | csISO2022KR |
Korean (Johab) | Johab | |
Korean (Mac) | x-mac-korean | |
Korean Wansung | x-cp20949 | |
Latin 3 (ISO) | iso-8859-3 | iso-ir-109, ISO_8859-3, ISO-8859-3:1988, latin3, l3, csISOLatin3 |
Latin 9 (ISO) | iso-8859-15 | ISO_8859-15, Latin-9, l9, csISOLatin9 |
Nordic (DOS) | IBM865 | cp865, 865, csIBM865 |
Norwegian (IA5) | x-IA5-Norwegian | |
OEM Cyrillic | IBM855 | cp855, 855, csIBM855 |
OEM Multilingual Latin I | IBM00858 | CCSID00858, CP00858, PC-Multilingual-850+euro |
OEM United States | IBM437 | cp437, 437, csPC8CodePage437 |
Portuguese (DOS) | IBM860 | cp860, 860, csIBM860 |
Romanian (Mac) | x-mac-romanian | |
Swedish (IA5) | x-IA5-Swedish | |
T.61 | x-cp20261 | |
TCA Taiwan | x-cp20001 | |
TeleText Taiwan | x-cp20004 | |
Thai (Mac) | x-mac-thai | |
Thai (Windows) | windows-874 | DOS-874, iso-8859-11, TIS-620 |
Turkish (DOS) | ibm857 | cp857, 857, csIBM857 |
Turkish (ISO | Windows) | iso-8859-9 | windows-1254, iso-ir-148, ISO_8859-9, ISO-8859-9:1989, latin5, l5, csISOLatin5 |
Turkish (Mac) | x-mac-turkish | |
Ukrainian (Mac) | x-mac-ukrainian | |
Unicode | utf-16 | unicode |
Unicode (UTF-16 Big-Endian) | unicodeFFFE | UTF-16BE |
Unicode (UTF-32 Big-Endian) | utf-32BE | |
Unicode (UTF-32) | utf-32 | |
Unicode (UTF-7) | utf-7 | UNICODE-1-1-UTF-7, csUnicode11UTF7, x-unicode-2-0-utf-7 |
Unicode (UTF-8) | utf-8 | unicode-1-1-utf-8, unicode-2-0-utf-8, x-unicode-2-0-utf-8 |
US-ASCII | us-ascii | iso-ir-6, ANSI_X3.4-1986, ISO_646.irv:1991, ASCII, ISO646-US, us, IBM367, cp367, csASCII |
Vietnamese (Windows) | windows-1258 | |
Wang Taiwan | x-cp20005 | |
Western European (DOS) | ibm850 | cp850, 850, csPC850Multilingual |
Western European (IA5) | x-IA5 | |
Western European (ISO) | iso-8859-1 | iso-ir-100, ISO_8859-1, ISO_8859-1:1987, latin1, l1, IBM819, CP819, csISOLatin1 |
Western European (Mac) | macintosh | mac, csMacintosh |
Western European (Windows) | Windows-1252 |
External links:
User Contributed Comments