Talk:Unicode block

This article has not yet been rated on Wikipedia's content assessment scale.
It is of interest to the following WikiProjects:

Please add the quality rating to the {{WikiProject banner shell}} template instead of this project banner. See WP:PIQA for details.

Computing List‑class Mid‑importance

	This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing articles
List	This article has been rated as List-class on Wikipedia's content assessment scale.
Mid	This article has been rated as Mid-importance on the project's importance scale.

Please add the quality rating to the {{WikiProject banner shell}} template instead of this project banner. See WP:PIQA for details.

Typography List‑class Mid‑importance

	This article is within the scope of WikiProject Typography, a collaborative effort to improve the coverage of articles related to Typography on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TypographyWikipedia:WikiProject TypographyTemplate:WikiProject TypographyTypography articles
List	This article has been rated as List-class on Wikipedia's content assessment scale.
Mid	This article has been rated as Mid-importance on the importance scale.

Please add the quality rating to the {{WikiProject banner shell}} template instead of this project banner. See WP:PIQA for details.

Writing systems List‑class Mid‑importance

	Writing portal This article falls within the scope of WikiProject Writing systems, a WikiProject interested in improving the encyclopaedic coverage and content of articles relating to writing systems on Wikipedia. If you would like to help out, you are welcome to drop by the project page and/or leave a query at the project’s talk page.Writing systemsWikipedia:WikiProject Writing systemsTemplate:WikiProject Writing systemsWriting system articles
List	This article has been rated as List-class on Wikipedia's content assessment scale.
Mid	This article has been rated as Mid-importance on the project's importance scale.

Archives

1

Proposed moves of unicode block articles

It has been proposed in this section that multiple pages be renamed and moved.

A bot will list this discussion on the requested moves current discussions subpage within an hour of this tag being placed. The discussion may be closed 7 days after being opened, if consensus has been reached (see the closing instructions). Please base arguments on article title policy, and keep discussion succinct and civil.

Please use {{subst:requested move}}. Do not use {{requested move/dated}} directly.

Links: current log • target log • direct move

– A little more than half of the articles about unicode character blocks have the qualifier "(Unicode block)", the others (listed above) don't. I propose to rename the latter so that all of them do. Here are some reasons:

The current names are not descriptive of the subjects. A name like "Control Pictures", "Geometric Shapes", "Mathematical Operators", "Greek and Coptic" or even "Latin Extended-B" gives no clue that the topic of the article is a section of a specific computer character set.
The current names are not specific to the subject of the article. Even names like "Greek Extended" or "Miscellaneous Symbols", that one could infer are about character encodings, could apply to other character sets, besides Unicode, that were in use in the rather recent past, and would still deserve articles of their own.
The naming of the Unicode blocks is not consistent. The fact that only half of the articles have the "(Unicode block)" qualifier causes difficulties for editors and potential confusion for readers. For example, an editor intending to link to an article on the history and use of Braille system may link to Braille Patterns instead by mistake.
The articles violate the Wikipedia standards for titles. Many of the unqualified Unicode block articles are in the plural, have unnecessary capitalizations, or violate the standards in other ways. For example, to satisfy the standards the article Mathematical Operators should be named Mathematical operator. But of course that is not the name of the Unicode block; and it is the name of an article with a very different subject. Adding the qualifier "(Unicode block)" would satisfy the naming standards, besides avoiding confusion.
Those topics are not very notable and are only of specialized and ephemeral technical interest. The division of the Unicode character space into blocks is mostly an artifact of the way the Unicode Consortium discusses, approves, and documents proposals to include characters. It has only tenuous (and often very questionable) connections to the history, usage, or semantics of those characters.
The division is relevant only to those who are interested in the history of Unicode, or who intend to propose new symbols for it.
The division is not relevant to users of Unicode. On the contrary, to find the Unicode for a desired glyph, like a special math symbol or a letter with a certain modifier, one should ignore the block division and use Google or some other generic search tool -- because one cannot tell which block that symbol has been put into.
The division is not even useful to font designers. While at some point one would find computer fonts that were limited to one or two specific blocks, that has never been a rule, and fonts are increasingly cutting across the Unicode block boundaries.

Apparently the names above were assigned without the "(Unicode block)" because it was felt that the qualifier was unnecessary, since there was no other page in Wikipedia with that name. But that is not what "unnecessary" means. Most of the names above have a common-sense meaning that has nothing to do with Unicode; so a qualifier is necessary to differentiate them from those common meanings. If you say "Geeometric Patterns", "Number Forms", or "Greek and Coptic" to someone, even to a computer expert, the last thing she will think of is the Unicode block of that name. Initially the moves will create a redirect from each unqualified name to the coresponding qualified name. I will try to replace all uses of the former by the latter. In some cases, like "Tai Viet" or "Mathematical Operators" the redirect is inappropriate or pointless, in which case it will be deleted or redirected to a more appropriate article. Note that, if one will type "Tai Viet" to the search window, the "(Unicode block)" article will be listed anyway as one of the suggested alternatives. There are also half a dozen cases where the Unicode block article was merged into an article about a language, script, or typography article:

Phoenician (Unicode block) • Taixuanjing • Kangxi radical • Dingbat • Halfwidth and fullwidth forms

These merges should be undone, since the article about the Unicode block is suposed to have a lengthy section that documents the history of the block and the relevant Unicode Consortium publications, that do not belong to the articles above. For reference, the following articles are already named with the qualifier:

Adlam (Unicode block) • Aegean Numbers (Unicode block) • Ahom (Unicode block) • Alchemical Symbols (Unicode block) • Anatolian Hieroglyphs (Unicode block) • Ancient Greek Numbers (Unicode block) • Arabic (Unicode block) • Armenian (Unicode block) • Arrows (Unicode block) • Avestan (Unicode block) • Balinese (Unicode block) • Bamum (Unicode block) • Basic Latin (Unicode block) • Bassa Vah (Unicode block) • Batak (Unicode block) • Bengali (Unicode block) • Bhaiksuki (Unicode block) • Bopomofo (Unicode block) • Brahmi (Unicode block) • Buginese (Unicode block) • Buhid (Unicode block) • CJK Strokes (Unicode block) • CJK Unified Ideographs (Unicode block) • Carian (Unicode block) • Caucasian Albanian (Unicode block) • Chakma (Unicode block) • Cham (Unicode block) • Cherokee (Unicode block) • Chess Symbols (Unicode block) • Coptic (Unicode block) • Cuneiform (Unicode block) • Currency Symbols (Unicode block) • Cypriot Syllabary (Unicode block) • Cyrillic (Unicode block) • Deseret (Unicode block) • Devanagari (Unicode block) • Dogra (Unicode block) • Duployan (Unicode block) • Egyptian Hieroglyphs (Unicode block) • Elbasan (Unicode block) • Elymaic (Unicode block) • Emoticons (Unicode block) • Ethiopic (Unicode block) • Georgian (Unicode block) • Glagolitic (Unicode block) • Gothic (Unicode block) • Grantha (Unicode block) • Gujarati (Unicode block) • Gunjala Gondi (Unicode block) • Gurmukhi (Unicode block) • Hangul Jamo (Unicode block) • Hanifi Rohingya (Unicode block) • Hanunoo (Unicode block) • Hatran (Unicode block) • Hebrew (Unicode block) • Hiragana (Unicode block) • Ideographic Description Characters (Unicode block) • Imperial Aramaic (Unicode block) • Indic Siyaq Numbers (Unicode block) • Inscriptional Pahlavi (Unicode block) • Inscriptional Parthian (Unicode block) • Javanese (Unicode block) • Kaithi (Unicode block) • Kanbun (Unicode block) • Kannada (Unicode block) • Katakana (Unicode block) • Kayah Li (Unicode block) • Kharoshthi (Unicode block) • Khmer (Unicode block) • Khojki (Unicode block) • Khudawadi (Unicode block) • Lao (Unicode block) • Latin-1 Supplement (Unicode block) • Lepcha (Unicode block) • Limbu (Unicode block) • Linear A (Unicode block) • Lisu (Unicode block) • Lycian (Unicode block) • Lydian (Unicode block) • Mahajani (Unicode block) • Mahjong Tiles (Unicode block) • Makasar (Unicode block) • Malayalam (Unicode block) • Mandaic (Unicode block) • Manichaean (Unicode block) • Marchen (Unicode block) • Masaram Gondi (Unicode block) • Mayan Numerals (Unicode block) • Medefaidrin (Unicode block) • Meetei Mayek (Unicode block) • Mende Kikakui (Unicode block) • Meroitic Cursive (Unicode block) • Meroitic Hieroglyphs (Unicode block) • Miao (Unicode block) • Modi (Unicode block) • Mongolian (Unicode block) • Mro (Unicode block) • Multani (Unicode block) • Musical Symbols (Unicode block) • Myanmar (Unicode block) • NKo (Unicode block) • Nabataean (Unicode block) • Nandinagari (Unicode block) • New Tai Lue (Unicode block) • Newa (Unicode block) • Nushu (Unicode block) • Nyiakeng Puachue Hmong (Unicode block) • Ogham (Unicode block) • Ol Chiki (Unicode block) • Old Hungarian (Unicode block) • Old Italic (Unicode block) • Old North Arabian (Unicode block) • Old Permic (Unicode block) • Old Persian (Unicode block) • Old Sogdian (Unicode block) • Old South Arabian (Unicode block) • Old Turkic (Unicode block) • Optical Character Recognition (Unicode block) • Oriya (Unicode block) • Osage (Unicode block) • Osmanya (Unicode block) • Ottoman Siyaq Numbers (Unicode block) • Pahawh Hmong (Unicode block) • Palmyrene (Unicode block) • Pau Cin Hau (Unicode block) • Phags-pa (Unicode block) • Phaistos Disc (Unicode block) • Psalter Pahlavi (Unicode block) • Rejang (Unicode block) • Runic (Unicode block) • Samaritan (Unicode block) • Saurashtra (Unicode block) • Sharada (Unicode block) • Shavian (Unicode block) • Siddham (Unicode block) • Sogdian (Unicode block) • Sora Sompeng (Unicode block) • Soyombo (Unicode block) • Specials (Unicode block) • Sundanese (Unicode block) • Superscripts and Subscripts (Unicode block) • Sutton SignWriting (Unicode block) • Syloti Nagri (Unicode block) • Syriac (Unicode block) • Tagalog (Unicode block) • Tagbanwa (Unicode block) • Tags (Unicode block) • Tai Le (Unicode block) • Tai Tham (Unicode block) • Takri (Unicode block) • Tamil (Unicode block) • Tangut (Unicode block) • Telugu (Unicode block) • Thaana (Unicode block) • Thai (Unicode block) • Tibetan (Unicode block) • Tifinagh (Unicode block) • Tirhuta (Unicode block) • Ugaritic (Unicode block) • Vai (Unicode block) • Variation Selectors (Unicode block) • Wancho (Unicode block) • Warang Citi (Unicode block) • Zanabazar Square (Unicode block)

Jorge Stolfi (talk) 22:01, 1 May 2019 (UTC)[reply]

Comment. I think you're on the right track, but taken individually, I would say that while some clearly make sense (e.g. Mathematical Operators), others are a bit questionable (e.g. CJK Unified Ideographs Extension A), and the only justification for adding the parenthetical disambiguator in that case would be for WP:CONSISTENCY. I normally don't favor unnecessarily disambiguation, especially not unnecessary parenthetical disambiguation, but I recognize in rare circumstances it can be a valid solution and a general rule like this is not without precedent, cf. Wikipedia:Naming conventions (UK Parliament constituencies). I will wait to see what others have to say about this proposal. -- King of ♥ ♦ ♣ ♠ 02:58, 2 May 2019 (UTC)[reply]

Agree. Makes sense, and I think this qualifies as a circumstance for which we can have a specific naming convention. Rajanala Samyak (talk) 01:52, 3 May 2019 (UTC)[reply]