Character sets supported by Gecko

  • Revision slug: Character_Sets_Supported_by_Gecko
  • Revision title: Character sets supported by Gecko
  • Revision id: 31304
  • Created:
  • Creator: Dria
  • Is current revision? No
  • Comment

Revision Content

This page is still far from finished, at this stage it's just meant to give a general impression to where I'm heading. Because of that no link from other pages to here are added yet.

Introduction

Character set names are used in the header of HTML documents to identify by which character set the content of a specific page should be processed.

Supported Character Sets

The following character sets are supported by Mozilla based browsers:

<tr> <td>IBM-864</td> <td>Arabic</td> </tr> <tr> <td>ISO-8859-6</td> <td>Arabic</td> </tr> <tr> <td>MacArabic</td> <td>Arabic</td> </tr> <tr> <td>Windows-1256</td> <td>Arabic</td> </tr> <tr> <td>ARMSCII-8</td> <td>Armenian</td> </tr> <tr> <td>ISO-8859-13</td> <td>Baltic</td> </tr> <tr> <td>ISO-8859-4</td> <td>Baltic</td> </tr> <tr> <td>Windows-1257</td> <td>Baltic</td> </tr> <tr> <td>ISO-8859-14</td> <td>Celtic</td> </tr> <tr> <td>IBM-852</td> <td>Central European</td> </tr> <tr> <td>ISO-8859-2</td> <td>Central European</td> </tr> <tr> <td>MacCE</td> <td>Central European</td> </tr> <tr> <td>Windows-1250</td> <td>Central European</td> </tr> <tr> <td>GB18030</td> <td>Chinese Simplified</td> </tr> <tr> <td>GB2312</td> <td>Chinese Simplified</td> </tr> <tr> <td>GBK</td> <td>Chinese Simplified</td> </tr> <tr> <td>HZ</td> <td>Chinese Simplified</td> </tr> <tr> <td>ISO-2022-CN</td> <td>Chinese Simplified</td> </tr> <tr> <td>Big5</td> <td>Chinese Traditional</td> </tr> <tr> <td>Big5-HKSCS</td> <td>Chinese Traditional</td> </tr> <tr> <td>EUC-TW</td> <td>Chinese Traditional</td> </tr> <tr> <td>MacCroatian</td> <td>Croatian</td> </tr> <tr> <td>IBM-855</td> <td>Cyrillic</td> </tr> <tr> <td>ISO-8859-5</td> <td>Cyrillic</td> </tr> <tr> <td>ISO-IR-111</td> <td>Cyrillic</td> </tr> <tr> <td>KOI8-R</td> <td>Cyrillic</td> </tr> <tr> <td>MacCyrillic</td> <td>Cyrillic</td> </tr> <tr> <td>Windows-1251</td> <td>Cyrillic</td> </tr> <tr> <td>CP-866</td> <td>Cyrillic/Russian</td> </tr> <tr> <td>KOI8-U</td> <td>Cyrillic/Ukrainian</td> </tr> <tr> <td>MacUkrainian</td> <td>Cyrillic/Ukrainian</td> </tr> <tr> <td>MacFarsi</td> <td>Farsi</td> </tr> <tr> <td>GEOSTD8</td> <td>Georgian</td> </tr> <tr> <td>ISO-8859-7</td> <td>Greek</td> </tr> <tr> <td>MacGreek</td> <td>Greek</td> </tr> <tr> <td>Windows-1253</td> <td>Greek</td> </tr> <tr> <td>MacGujarati</td> <td>Gujarati</td> </tr> <tr> <td>MacGurmukhi</td> <td>Gurmukhi</td> </tr> <tr> <td>IBM-862</td> <td>Hebrew</td> </tr> <tr> <td>ISO-8859-8-I</td> <td>Hebrew</td> </tr> <tr> <td>MacHebrew</td> <td>Hebrew</td> </tr> <tr> <td>Windows-1255</td> <td>Hebrew</td> </tr> <tr> <td>ISO-8859-8</td> <td>Hebrew Visual</td> </tr> <tr> <td>MacDevanagari</td> <td>Hindi</td> </tr> <tr> <td>MacIcelandic</td> <td>Icelandic</td> </tr> <tr> <td>EUC-JP</td> <td>Japanese</td> </tr> <tr> <td>ISO-2022-JP</td> <td>Japanese</td> </tr> <tr> <td>Shift_JIS</td> <td>Japanese</td> </tr> <tr> <td>EUC-KR</td> <td>Korean</td> </tr> <tr> <td>ISO-2022-KR</td> <td>Korean</td> </tr> <tr> <td>JOHAB</td> <td>Korean</td> </tr> <tr> <td>UHC</td> <td>Korean</td> </tr> <tr> <td>ISO-8859-10</td> <td>Nordic</td> </tr> <tr> <td>ISO-8859-16</td> <td>Romanian</td> </tr> <tr> <td>MacRomanian</td> <td>Romanian</td> </tr> <tr> <td>ISO-8859-3</td> <td>South European</td> </tr> <tr> <td>ISO-8859-11</td> <td>Thai</td> </tr> <tr> <td>TIS-620</td> <td>Thai</td> </tr> <tr> <td>Windows-874</td> <td>Thai</td> </tr> <tr> <td>IBM-857</td> <td>Turkish</td> </tr> <tr> <td>ISO-8859-9</td> <td>Turkish</td> </tr> <tr> <td>MacTurkish</td> <td>Turkish</td> </tr> <tr> <td>Windows-1254</td> <td>Turkish</td> </tr> <tr> <td>UTF-16BE</td> <td>Unicode</td> </tr> <tr> <td>UTF-16LE</td> <td>Unicode</td> </tr> <tr> <td>UTF-16</td> <td>Unicode</td> </tr> <tr> <td>UTF-32BE</td> <td>Unicode</td> </tr> <tr> <td>UTF-32LE</td> <td>Unicode</td> </tr> <tr> <td>UTF-7</td> <td>Unicode</td> </tr> <tr> <td>UTF-8</td> <td>Unicode</td> </tr> <tr> <td>TCVN</td> <td>Vietnamese</td> </tr> <tr> <td>VISCII</td> <td>Vietnamese</td> </tr> <tr> <td>VPS</td> <td>Vietnamese</td> </tr> <tr> <td>Windows-1258</td> <td>Vietnamese</td> </tr> <tr> <td>IBM-850</td> <td>Western</td> </tr> <tr> <td>ISO-8859-1</td> <td>Western</td> </tr> <tr> <td>ISO-8859-15</td> <td>Western</td> </tr> <tr> <td>MacRoman</td> <td>Western</td> </tr> <tr> <td>Windows-1252</td> <td>Western</td> </tr> </table>
Charset name</th>
   <td class="header">Language</th>

Revision Source

<p><span class="comment">This page is still far from finished, at this stage it's just meant to give a general impression to where I'm heading. Because of that no link from other pages to here are added yet.</span>
</p>
<h3 name="Introduction">Introduction</h3>
<p>Character set names are used in the header of HTML documents to identify by which character set the content of a specific page should be processed.
</p>
<h3 name="Supported_Character_Sets">Supported Character Sets</h3>
<p>The following character sets are supported by Mozilla based browsers:
</p>

  &lt;tr&gt;
    &lt;td&gt;IBM-864&lt;/td&gt;
    &lt;td&gt;Arabic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-6&lt;/td&gt;
    &lt;td&gt;Arabic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacArabic&lt;/td&gt;
    &lt;td&gt;Arabic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1256&lt;/td&gt;
    &lt;td&gt;Arabic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ARMSCII-8&lt;/td&gt;
    &lt;td&gt;Armenian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-13&lt;/td&gt;
    &lt;td&gt;Baltic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-4&lt;/td&gt;
    &lt;td&gt;Baltic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1257&lt;/td&gt;
    &lt;td&gt;Baltic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-14&lt;/td&gt;
    &lt;td&gt;Celtic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;IBM-852&lt;/td&gt;
    &lt;td&gt;Central European&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-2&lt;/td&gt;
    &lt;td&gt;Central European&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacCE&lt;/td&gt;
    &lt;td&gt;Central European&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1250&lt;/td&gt;
    &lt;td&gt;Central European&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;GB18030&lt;/td&gt;
    &lt;td&gt;Chinese Simplified&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;GB2312&lt;/td&gt;
    &lt;td&gt;Chinese Simplified&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;GBK&lt;/td&gt;
    &lt;td&gt;Chinese Simplified&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;HZ&lt;/td&gt;
    &lt;td&gt;Chinese Simplified&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-2022-CN&lt;/td&gt;
    &lt;td&gt;Chinese Simplified&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Big5&lt;/td&gt;
    &lt;td&gt;Chinese Traditional&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Big5-HKSCS&lt;/td&gt;
    &lt;td&gt;Chinese Traditional&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;EUC-TW&lt;/td&gt;
    &lt;td&gt;Chinese Traditional&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacCroatian&lt;/td&gt;
    &lt;td&gt;Croatian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;IBM-855&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-5&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-IR-111&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;KOI8-R&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacCyrillic&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1251&lt;/td&gt;
    &lt;td&gt;Cyrillic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;CP-866&lt;/td&gt;
    &lt;td&gt;Cyrillic/Russian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;KOI8-U&lt;/td&gt;
    &lt;td&gt;Cyrillic/Ukrainian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacUkrainian&lt;/td&gt;
    &lt;td&gt;Cyrillic/Ukrainian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacFarsi&lt;/td&gt;
    &lt;td&gt;Farsi&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;GEOSTD8&lt;/td&gt;
    &lt;td&gt;Georgian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-7&lt;/td&gt;
    &lt;td&gt;Greek&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacGreek&lt;/td&gt;
    &lt;td&gt;Greek&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1253&lt;/td&gt;
    &lt;td&gt;Greek&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacGujarati&lt;/td&gt;
    &lt;td&gt;Gujarati&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacGurmukhi&lt;/td&gt;
    &lt;td&gt;Gurmukhi&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;IBM-862&lt;/td&gt;
    &lt;td&gt;Hebrew&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-8-I&lt;/td&gt;
    &lt;td&gt;Hebrew&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacHebrew&lt;/td&gt;
    &lt;td&gt;Hebrew&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1255&lt;/td&gt;
    &lt;td&gt;Hebrew&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-8&lt;/td&gt;
    &lt;td&gt;Hebrew Visual&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacDevanagari&lt;/td&gt;
    &lt;td&gt;Hindi&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacIcelandic&lt;/td&gt;
    &lt;td&gt;Icelandic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;EUC-JP&lt;/td&gt;
    &lt;td&gt;Japanese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-2022-JP&lt;/td&gt;
    &lt;td&gt;Japanese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Shift_JIS&lt;/td&gt;
    &lt;td&gt;Japanese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;EUC-KR&lt;/td&gt;
    &lt;td&gt;Korean&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-2022-KR&lt;/td&gt;
    &lt;td&gt;Korean&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;JOHAB&lt;/td&gt;
    &lt;td&gt;Korean&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UHC&lt;/td&gt;
    &lt;td&gt;Korean&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-10&lt;/td&gt;
    &lt;td&gt;Nordic&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-16&lt;/td&gt;
    &lt;td&gt;Romanian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacRomanian&lt;/td&gt;
    &lt;td&gt;Romanian&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-3&lt;/td&gt;
    &lt;td&gt;South European&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-11&lt;/td&gt;
    &lt;td&gt;Thai&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;TIS-620&lt;/td&gt;
    &lt;td&gt;Thai&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-874&lt;/td&gt;
    &lt;td&gt;Thai&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;IBM-857&lt;/td&gt;
    &lt;td&gt;Turkish&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-9&lt;/td&gt;
    &lt;td&gt;Turkish&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacTurkish&lt;/td&gt;
    &lt;td&gt;Turkish&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1254&lt;/td&gt;
    &lt;td&gt;Turkish&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-16BE&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-16LE&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-16&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-32BE&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-32LE&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-7&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;UTF-8&lt;/td&gt;
    &lt;td&gt;Unicode&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;TCVN&lt;/td&gt;
    &lt;td&gt;Vietnamese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;VISCII&lt;/td&gt;
    &lt;td&gt;Vietnamese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;VPS&lt;/td&gt;
    &lt;td&gt;Vietnamese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1258&lt;/td&gt;
    &lt;td&gt;Vietnamese&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;IBM-850&lt;/td&gt;
    &lt;td&gt;Western&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-1&lt;/td&gt;
    &lt;td&gt;Western&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;ISO-8859-15&lt;/td&gt;
    &lt;td&gt;Western&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;MacRoman&lt;/td&gt;
    &lt;td&gt;Western&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Windows-1252&lt;/td&gt;
    &lt;td&gt;Western&lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;
<table class="standard-table">
  <tbody><tr>
    <td class="header">Charset name&lt;/th&gt;
<pre class="eval">   &lt;td class="header"&gt;Language&lt;/th&gt;
</pre>
  </td></tr></tbody></table>
Revert to this revision