Unicode anomaly

From Wikipedia, the free encyclopedia
Jump to: navigation, search

The Unicode Standard has imposed for itself strict rules to guarantee stability.[1] This implies that when mistakes against these permanent rules are published, these mistakes cannot be corrected. Depending on the grade of strictness of a rule, a change can be prohibited or allowed. For example, a "Name" given to a code point can not and will not change. But a "Script" property is more flexible, by Unicode's own rules.

Anomalies[edit]

  • U+0818 samaritan mark dagesh and U+0819 samaritan mark occlusion: Names mixed up.
Corrected text, names swapped:
U+0818 samaritan mark occlusion (HTML: ࠘ "strengthens" the consonant, for example changing /w/ to /b/) and
U+0819 samaritan mark dagesh (HTML: ࠙ indicates consonant gemination)[2]
The name says "capital", but it is a small letter. The true capital is U+1D4AB 𝒫 mathematical script capital p (HTML: 𝒫)[3]
  • U+FE18 presentation form for vertical right white lenticular brakcet (HTML: ︘): BRAKCET is spelled wrong. Since this is the fixed Character Name by policy, it cannot be changed.[4]
  • In 2006 Unicode has published a list of anomalies in character names.[5]

Stability policy[edit]

Version 1.0 versus Version 2.0 Names[edit]

In version 2.0, Unicode changed many code point "Names" from version 1. At the same moment, Unicode stated that from then on, an assigned Name to a code point will never change anymore.

References[edit]