MULTEXT
-
Document MRC 1. MtRecode/Character sets.
| MtRecode: Character sets supported
|
In the current version, resource tables are provided for the following character sets:
- ISO 646-IRV (ASCII)
- ISO 8859 series. See the graphic representation of the code tables.
- ISO 8859-1 - Latin 1. Western Europe and Americas: Afrikaans, Basque, Catalan, Danish, Dutch,
English, Faeroese, Finnish, French, Galician, German, Icelandic, Irish,
Italian, Norwegian, Portuguese, Spanish and Swedish.
- ISO 8859-2 Latin 2. Latin-written Slavic and Central European languages: Czech, German,
Hungarian, Polish, Romanian, Croatian, Slovak, Slovene.
- ISO 8859-3 - Latin 3. Esperanto, Galician, Maltese, and Turkish.
- ISO 8859-4 - Latin 4. Scandinavia/Baltic (mostly covered by 8859-1 also): Estonian, Latvian, and
Lithuanian. It is an incomplete predecessor of Latin 6.
- ISO 8859-5 - Cyrillic. Bulgarian, Byelorussian, Macedonian, Russian, Serbian and Ukrainian.
- ISO 8859-6 - Arabic. Non-accented Arabic.
- ISO 8859-7- Modern Greek. Greek.
- ISO 8859-8 - Hebrew. Non-accented Hebrew.
- ISO 8859-9 - Latin 5. Same as 8859-1 except for Turkish instead of Icelandic.
- ISO 8859-10 - Latin 6. Covers the entire Nordic languages.
- Mac roman. The Macintosh base character set.
- EasyFrench This character set has been proposed by François Pinard for easy transcription of the French language with only ISO 646 characters. It uses combinations such as e' for e acute. See the Easy French table.
Resource tables are also provided for SGML entities.
Note the format of the character set mapping tables is the same as the one used by Unicode. You can load additional mapping tables from the Unicode site.
| Top
| Next
| MtRecode
| LPL/CNRS
| MULTEXT
Copyright © Centre National de la Recherche Scientifique, 1996.