  |
IANA: Character Sets - http://www.iana.org/assignments/character-sets
The official names for character sets that may be used in the Internet and referred to in Internet documentation - held at the Internet Assigned Number Authority. |
  |
HTML Document Representation - http://www.w3.org/TR/REC-html40/charset.html
Chapter covering document character sets and encodings in HTML from the World Wide Web Consortium's HTML 4.0 Specification. |
  |
World Wide Web Consortium - http://www.w3.org/International/O-charset.html
Covers code tables, Unicode, HTML and XML and links to other resources and discusses internationalization and localization issues relating to character sets. |
  |
ECMA: Character Code Structure and Extension Techniques - http://www.ecma-international.org/publications/standards/ECMA-035.HTM
Specifies the structure of ECMA-35, for 8-bit codes and 7-bit codes which provide for the coding of character sets, with a detailed PDF document. |
  |
ISO 639 Language Names - http://xml.coverpages.org/iso639a.html
The standard names for use in SGML and XML, including a complete list of language name codes. |
  |
Characters and Encodings - http://www.cs.tut.fi/~jkorpela/chars/
A tutorial on character code issues in digital processing and transfer of text data, on the Internet or otherwise. Includes tables and a detailed listing of control codes. In English and Finnish. |
  |
HTML Validation: Using Character Encodings - http://www.htmlhelp.com/tools/validator/charset.html
How to validate HTML documents in various character encodings. |
  |
EKI Letter Database - http://www.eki.ee/letter/
Query character sets, encoding, codepages and Unicode information in an easy-to-use web form. Held at the Institute of the Estonian Language. |
  |
MS Windows characters in HTML - http://www.cs.tut.fi/~jkorpela/www/windows-chars.html
A review of the HTML authoring problems caused by some special characters which belong to MS Windows character set but not to ISO Latin 1. Includes technical details and substitution tables. In English and Finnish. |
  |
LangBox International - http://www.langbox.com/
Codetables for ISO 8859-6, ASMO 449 plus, ASMO 708 (Arabic) and ISO 8859-8 (Hebrew) and further information about the company's work in multilingual UNIX. |
  |
An Early History of Character Set Standardization - http://www.cwi.nl/~dik/english/codes/stand.html
Covers the beginnings of the ASCII standards from ASCII-1963 onwards and information on Cyrillic, Japanese, Korean, Thai and Vietnamese encoding systems, including various localized versions of EBCDIC. With tables and links to other resources. |
  |
ASCII and EBCDIC Compared - http://www.dynamoo.com/technical/ascii-ebcdic.htm
A comparison of two of these two basic encoding systems, with tables. |
  |
WhatAsciiCode.com - http://www.whatasciicode.com
Quick reference and searchable ASCII code and conversion tables. |
  |
Tutorial: Shady Characters - http://webreference.com/html/tutorial17/
A tutorial that explains HTML character sets, character encodings and character references from Webreference.com. |
  |
Chilkat Charset Conversion Component - http://www.chilkatsoft.com/ChilkatCharset.asp
A character set conversion component for Unicode, Japanese, Chinese, Korean, Cyrillic, Arabic, Hebrew, Thai, Vietnamese and all Western languages. |