Webmaster Tools: codes for setting html charset


HTML charset

A character encoding consists of a code that pairs a set of characters with a set of something else, such as numbers or electrical pulses, in order to facilitate the storage of text in computers and the transmission of text through telecommunication networks.

Some Basic Ones

charset=big5      Chinese Traditional (Big5)
charset=euc-kr   Korean (EUC)
charset=iso-8859-1    Western Alphabet
charset=iso-8859-2     Central European Alphabet (ISO)
charset=iso-8859-3     Latin 3 Alphabet (ISO)
charset=iso-8859-4     Baltic Alphabet (ISO)
charset=iso-8859-5     Cyrillic Alphabet (ISO)
charset=iso-8859-6     Arabic Alphabet (ISO)
charset=iso-8859-7     Greek Alphabet (ISO)
charset=iso-8859-8     Hebrew Alphabet (ISO)
charset=koi8-r           Cyrillic Alphabet (KOI8-R)
charset=shift-jis         Japanese (Shift-JIS)
charset=x-euc           Japanese (EUC)
charset=utf-8            Universal Alphabet (UTF-8)
charset=windows-1250   Central European Alphabet (Windows)
charset=windows-1251   Cyrillic Alphabet (Windows)
charset=windows-1252   Western Alphabet (Windows)
charset=windows-1253   Greek Alphabet (Windows)
charset=windows-1254   Turkish Alphabet      
charset=windows-1255    Hebrew Alphabet (Windows)
charset=windows-1256    Arabic Alphabet (Windows)
charset=windows-1257    Baltic Alphabet (Windows)
charset=windows-1258    Vietnamese Alphabet (Windows)
charset=windows-874      Thai (Windows)

A Longer List

Arabic (ASMO 708) ##charset=ASMO-708
Arabic (DOS) ##charset=DOS-720
Arabic (ISO) ##charset=iso-8859-6
Arabic (Mac) ##charset=x-mac-arabic
Arabic (Windows) ##charset=windows-1256
Baltic (DOS) ##charset=ibm775
Baltic (ISO) ##charset=iso-8859-4
Baltic (Windows) ##charset=windows-1257
Central European (DOS) ##charset=ibm852
Central European (ISO) ##charset=iso-8859-2
Central European (Mac) ##charset=x-mac-ce
Central European (Windows) ##charset=windows-1250
Chinese Simplified (EUC) ##charset=EUC-CN
Chinese Simplified (GB2312) ##charset=gb2312
Chinese Simplified (HZ) ##charset=hz-gb-2312
Chinese Simplified (Mac) ##charset=x-mac-chinesesimp
Chinese Traditional (Big5) ##charset=big5
Chinese Traditional (CNS) ##charset=x-Chinese-CNS
Chinese Traditional (Eten) ##charset=x-Chinese-Eten
Chinese Traditional (Mac) ##charset=x-mac-chinesetrad ##charset=950
Cyrillic (DOS) ##charset=cp866
Cyrillic (ISO) ##charset=iso-8859-5
Cyrillic (KOI8-R) ##charset=koi8-r
Cyrillic (KOI8-U) ##charset=koi8-u
Cyrillic (Mac) ##charset=x-mac-cyrillic
Cyrillic (Windows) ##charset=windows-1251
Europa ##charset=x-Europa
German (IA5) ##charset=x-IA5-German
Greek (DOS) ##charset=ibm737
Greek (ISO) ##charset=iso-8859-7
Greek (Mac) ##charset=x-mac-greek
Greek (Windows) ##charset=windows-1253

Greek, Modern (DOS) ##charset=ibm869
Hebrew (DOS) ##charset=DOS-862
Hebrew (ISO-Logical) ##charset=iso-8859-8-i
Hebrew (ISO-Visual) ##charset=iso-8859-8
Hebrew (Mac) ##charset=x-mac-hebrew
Hebrew (Windows) ##charset=windows-1255
IBM EBCDIC (Arabic) ##charset=x-EBCDIC-Arabic
IBM EBCDIC (Cyrillic Russian) ##charset=x-EBCDIC-CyrillicRussian
IBM EBCDIC (Cyrillic Serbian-Bulgarian) ##charset=x-EBCDIC-CyrillicSerbianBulgarian
IBM EBCDIC (Denmark-Norway) ##charset=x-EBCDIC-DenmarkNorway
IBM EBCDIC (Denmark-Norway-Euro) ##charset=x-ebcdic-denmarknorway-euro
IBM EBCDIC (Finland-Sweden) ##charset=x-EBCDIC-FinlandSweden
IBM EBCDIC (Finland-Sweden-Euro) ##charset=x-ebcdic-finlandsweden-euro
IBM EBCDIC (Finland-Sweden-Euro) ##charset=x-ebcdic-finlandsweden-euro
IBM EBCDIC (France-Euro) ##charset=x-ebcdic-france-euro
IBM EBCDIC (Germany) ##charset=x-EBCDIC-Germany
IBM EBCDIC (Germany-Euro) ##charset=x-ebcdic-germany-euro
IBM EBCDIC (Greek Modern) ##charset=x-EBCDIC-GreekModern
IBM EBCDIC (Greek) ##charset=x-EBCDIC-Greek
IBM EBCDIC (Hebrew) ##charset=x-EBCDIC-Hebrew
IBM EBCDIC (Icelandic) ##charset=x-EBCDIC-Icelandic
IBM EBCDIC (Icelandic-Euro) ##charset=x-ebcdic-icelandic-euro
IBM EBCDIC (International-Euro) ##charset=x-ebcdic-international-euro
IBM EBCDIC (Italy) ##charset=x-EBCDIC-Italy
IBM EBCDIC (Italy-Euro) ##charset=x-ebcdic-italy-euro
IBM EBCDIC (Japanese and Japanese Kaakana) ##charset=x-EBCDIC-JapaneseAndKana
IBM EBCDIC (Japanese and Japanese-Latin) ##charset=x-EBCDIC-JapaneseAndJapaneseLatin
IBM EBCDIC (Japanese and US-Canada) ##charset=x-EBCDIC-JapaneseAndUSCanada
IBM EBCDIC (Japanese katakana) ##charset=x-EBCDIC-JapaneseKatakana
IBM EBCDIC (Korean and Korean Extended) ##charset=x-EBCDIC-KoreanAndKoreanExtended
IBM EBCDIC (Korean Extended) ##charset=x-EBCDIC-KoreanExtended
IBM EBCDIC (Multilingual Latin-2) ##charset=CP870
IBM EBCDIC (Simplified Chinese) ##charset=x-EBCDIC-SimplifiedChinese
IBM EBCDIC (Spain) ##charset=X-EBCDIC-Spain
IBM EBCDIC (Spain-Euro) ##charset=x-ebcdic-spain-euro
IBM EBCDIC (Thai) ##charset=x-EBCDIC-Thai
IBM EBCDIC (Traditional Chinese) ##charset=x-EBCDIC-TraditionalChinese
IBM EBCDIC (Turkish Latin-5) ##charset=CP1026
IBM EBCDIC (Turkish) ##charset=x-EBCDIC-Turkish
IBM EBCDIC (UK) ##charset=x-EBCDIC-UK
IBM EBCDIC (UK-Euro) ##charset=x-ebcdic-uk-euro
IBM EBCDIC (US-Canada) ##charset=ebcdic-cp-us
IBM EBCDIC (US-Canada-Euro) ##charset=x-ebcdic-cp-us-euro
Icelandic (DOS) ##charset=ibm861
Icelandic (Mac) ##charset=x-mac-icelandic
ISCII Assamese ##charset=x-iscii-as
ISCII Bengali ##charset=x-iscii-be
ISCII Devanagari ##charset=x-iscii-de
ISCII Gujarathi ##charset=x-iscii-gu
ISCII Kannada ##charset=x-iscii-ka
ISCII Malayalam ##charset=x-iscii-ma
ISCII Oriya ##charset=x-iscii-or
ISCII Panjabi ##charset=x-iscii-pa
ISCII Tamil ##charset=x-iscii-ta
ISCII Telugu ##charset=x-iscii-te
Japanese (EUC) ##charset=euc-jp ##charset=x-euc-jp
Japanese (JIS) ##charset=iso-2022-jp
Japanese (JIS-Allow 1 byte Kana - SO/SI) ##charset=iso-2022-jp
Japanese (JIS-Allow 1 byte Kana) ##charset=csISO2022JP
Japanese (Mac) ##charset=x-mac-japanese
Japanese (Shift-JIS) ##charset=shift_jis
Korean ##charset=ks_c_5601-1987
Korean (EUC) ##charset=euc-kr
Korean (ISO) ##charset=iso-2022-kr
Korean (Johab) ##charset=Johab
Korean (Mac) ##charset=x-mac-korean
Latin 3 (ISO) ##charset=iso-8859-3
Latin 9 (ISO) ##charset=iso-8859-15
Norwegian (IA5) ##charset=x-IA5-Norwegian
OEM United States ##charset=IBM437
Swedish (IA5) ##charset=x-IA5-Swedish
Thai (Windows) ##charset=windows-874
Turkish (DOS) ##charset=ibm857
Turkish (ISO) ##charset=iso-8859-9
Turkish (Mac) ##charset=x-mac-turkish
Turkish (Windows) ##charset=windows-1254
Unicode ##charset=unicode
Unicode (Big-Endian) ##charset=unicodeFFFE
Unicode (UTF-7) ##charset=utf-7
Unicode (UTF-8) ##charset=utf-8
US-ASCII ##charset=us-ascii
Vietnamese (Windows) ##charset=windows-1258
Western European (DOS) ##charset=ibm850
Western European (IA5) ##charset=x-IA5
Western European (ISO) ##charset=iso-8859-1
Western European (Mac) ##charset=macintosh
Western European (Windows) ##charset=Windows-1252