Tags (Unicode block)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Tags
Range U+E0000..U+E007F
(128 code points)
Plane SSP
Scripts Common
Assigned 97 code points
Unused 31 reserved code points
2 deprecated
Unicode version history
3.1 97 (+97)
Note: [1]

Tags is a Unicode block containing formatting characters.

U+E0001, U+E0020–U+E007E, and U+E007F were originally intended for invisibly tagging texts by language[2] but that use is no longer recommended.[3] All of those characters were deprecated in Unicode 5.1.

With the release of Unicode 8.0, U+E0020–U+E007E are no longer deprecated characters. (U+E0001 LANGUAGE TAG and U+E007F CANCEL TAG remain deprecated.) The change was made "to clear the way for the potential future use of tag characters for a purpose other than to represent language tags".[4] Unicode states that "the use of tag characters to represent language tags in a plain text stream is still a deprecated mechanism for conveying language information about text.[4]

Tags[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+E000x  LANG 
U+E001x
U+E002x   SP     !     "     #     $     %     &     '     (     )     *     +     ,     -     .     /  
U+E003x   0     1     2     3     4     5     6     7     8     9     :     :     <     =     >     ?  
U+E004x   @     A     B     C     D     E     F     G     H     I     J     K     L     M     N     O  
U+E005x   P     Q     R     S     T     U     V     W     X     Y     Z     [     \     ]     ^     _  
U+E006x   `     a     b     c     d     e     f     g     h     i     j     k     l     m     n     o  
U+E007x   p     q     r     s     t     u     v     w     x     y     z     {     |     }     ~    END 
Notes
1.^ As of Unicode version 8.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code points U+E0001 and U+E0020 through U+E007F were deprecated with Unicode version 5.1 however as of Unicode version 8.0 only U+E0001 and U+E007F remain deprecated

References[edit]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 22 March 2013. 
  2. ^ "RFC2482: Language Tagging in Unicode Plain Text". Network Working Group. January 1999. 
  3. ^ "RFC6082: Deprecating Unicode Language Tag Characters: RFC 2482 is Historic". Internet Engineering Task Force (IETF). November 2010. 
  4. ^ a b "Unicode 8.0.0, Implications for Migration". Unicode Consortium.