|MIME / IANA||ISO-8859-9|
|Alias(es)||iso-ir-148, latin5, l5, csISOLatin5|
|Standard||TS 5881, ECMA-128, ISO/IEC 8859|
|Classification||ISO 8859 (extended ASCII, ISO 4873 level 1)|
|Based on||ISO/IEC 8859-1|
|Preceded by||ISO/IEC 8859-3|
|Other related encoding(s)||Windows-1254|
ISO/IEC 8859-9:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 9: Latin alphabet No. 5, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1989. It is designated ECMA-128 by Ecma International and TS 5881 as a Turkish standard. It is informally referred to as Latin-5 or Turkish. It was designed to cover the Turkish language, designed as being of more use than the ISO/IEC 8859-3 encoding. It is identical to ISO/IEC 8859-1 except for the replacement of six Icelandic characters (Ðð, Ýý, Þþ) with characters unique to the Turkish alphabet (Ğğ, İ, ı, Şş).
ISO-8859-9 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. In modern applications Unicode and UTF-8 are preferred; authors of new web pages and the designers of new protocols are instructed to use UTF-8 instead. Since August 2019, 0.1% of all web pages use ISO-8859-9, while 1.9% of web pages located in Turkey declare use of ISO-8859-9. However, the WHATWG Encoding Standard, which specifies the character encodings which are permitted in HTML5 and which compliant browsers must support, requires that web pages marked as ISO-8859-9 be handled as Windows-1254, which differs from ISO-8859-9 by using the CR range which ISO-8859-9 reserves for C1 control codes for additional graphical characters instead (analogous to the relationship between ISO-8859-1 and Windows-1252).
Microsoft has assigned code page 28599 a.k.a. Windows-28599 to ISO-8859-9 in Windows. IBM has assigned code page 920 (CCSID 920) to ISO-8859-9. It is published by Ecma International as ECMA-128.
Differences from ISO-8859-1 have the Unicode code point number below the character.
- Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
- "Latin-5: A list of the Latin-5 client and server CCSIDs, which includes Turkey". IBM. Archived from the original on 2022-02-13.
- van Kesteren, Anne. "Names and labels". Encoding Standard. WHATWG.
- "Historical trends in the usage of character encodings for websites". w3techs.com.
- "Frequently Asked Questions". w3techs.com.
- "Distribution of character encodings among websites that use Turkey". w3techs.com.
- "22.214.171.124. Character encodings". HTML 5.1 2nd Edition. W3C.
User agents must support the encodings defined in the WHATWG Encoding standard, including, but not limited to […]
- "Code page 920 information document". Archived from the original on 2017-01-16.
- "CCSID 920 information document". Archived from the original on 2016-03-27.
- Standard ECMA-128: 8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 (2nd ed.). 1999.
This Ecma publication is also approved as ISO 8859-9.
- Code Page CPGID 00920 (pdf) (PDF), IBM
- Code Page CPGID 00920 (txt), IBM
- International Components for Unicode (ICU), ibm-920_P100-1995.ucm, 2002-12-03