Tags (Unicode block)
(128 code points)
|Assigned||97 code points|
|Unused||31 reserved code points
|Unicode version history|
Tags is a Unicode block containing formatting characters.
U+E0001, U+E0020–U+E007E, and U+E007F were originally intended for invisibly tagging texts by language but that use is no longer recommended. All of those characters were deprecated in Unicode 5.1.
With the release of Unicode 8.0, U+E0020–U+E007E are no longer deprecated characters. (U+E0001 LANGUAGE TAG and U+E007F CANCEL TAG remain deprecated.) The change was made "to clear the way for the potential future use of tag characters for a purpose other than to represent language tags". Unicode states that "the use of tag characters to represent language tags in a plain text stream is still a deprecated mechanism for conveying language information about text.
Official Unicode Consortium code chart (PDF)
- "Unicode character database". The Unicode Standard. Retrieved 22 March 2013.
- "RFC2482: Language Tagging in Unicode Plain Text". Network Working Group. January 1999.
- "RFC6082: Deprecating Unicode Language Tag Characters: RFC 2482 is Historic". Internet Engineering Task Force (IETF). November 2010.
- "Unicode 8.0.0, Implications for Migration". Unicode Consortium.
|This writing system-related article is a stub. You can help Wikipedia by expanding it.|