CJK Symbols and Punctuation

From Wikipedia, the free encyclopedia
CJK Symbols and Punctuation
RangeU+3000..U+303F
(64 code points)
PlaneBMP
ScriptsHan (15 char.)
Hangul (2 char.)
Common (43 char.)
Inherited (4 char.)
Assigned64 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)56 (+56)
1.0.1 (1992)56 (+0)
1.1 (1993)57 (+1)
3.0 (1999)61 (+4)
3.2 (2002)64 (+3)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]
In Unicode 1.0.1, during the process of unifying with ISO 10646, the "IDEOGRAPHIC DITTO MARK" (仝) was unified with the unified ideograph at U+4EDD, allowing the Japanese Industrial Standard symbol to be moved from U+32FF in the Enclosed CJK Letters and Months block to the vacated code point at U+3004.[3]

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Block[edit]

CJK Symbols and Punctuation[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+300x ID
 SP 
U+301x
U+302x
U+303x  〾 
Notes
1.^ As of Unicode version 15.1

The block has variation sequences defined for East Asian punctuation positional variants.[4][5] They use U+FE00 VARIATION SELECTOR-1 (VS01) and U+FE01 VARIATION SELECTOR-2 (VS02):

Variation sequences for punctuation alignment
U+ 3001 3002 Description
base code point
base + VS01 、︀ 。︀ corner-justified form
base + VS02 、︁ 。︁ centered form

Chinese character[edit]

The CJK Symbols and Punctuation block contains one Chinese character: U+3007 IDEOGRAPHIC NUMBER ZERO. Although it is not covered under "Unified Ideographs", it is treated as a CJK character for all other intents and purposes.[6]

Emoji[edit]

The CJK Symbols and Punctuation block contains two emoji: U+3030 and U+303D.[7][8]

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[9]

Emoji variation sequences
U+ 3030 303D
base code point
base+VS15 (text) 〰︎ 〽︎
base+VS16 (emoji) 〰️ 〽️

History[edit]

In Unicode 1.0.1, two changes were made to this block in order to make Unicode 1.0.1 a proper subset of ISO 10646:[10][11][12]


The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Symbols and Punctuation block:

Version Final code points[a] Count L2 ID WG2 ID IRG ID Document
1.0.0 U+3000..3003, 3005..3037, 303F 56 (to be determined)
L2/11-402 Iancu, Laurențiu (2011-10-20), Proposal to change the General_Category of Hangul tone marks U+302E and U+302F
L2/14-198 N4606 Komatsu, Hiroyuki (2014-08-06), Proposal for the modification of the sample character layout of WAVE_DASH (U+301C)
L2/14-177 Moore, Lisa (2014-10-17), "WAVE_DASH (B.15.3)", UTC #140 Minutes
L2/16-052 N4603 (pdf, doc) Umamaheswaran, V. S. (2015-09-01), "M63.11v", Unconfirmed minutes of WG 2 meeting 63, Reverse the shape of current glyph for 301C WAVE DASH as requested in document N4606
L2/17-056 Lunde, Ken (2017-02-13), Proposal to add standardized variation sequences
L2/17-436 Lunde, Ken (2018-01-21), Proposal to add standardized variation sequences for fullwidth East Asian punctuation
L2/18-039 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Cook, Richard (2018-01-19), "24. Fullwidth East Asian Punctuation", Recommendations to UTC #154 January 2018 on Script Proposals
L2/18-115 Moore, Lisa (2018-05-09), "Consensus 154-C17", UTC #155 Minutes, Add 16 standardized variation sequences based on L2/17-436R, for Unicode 12.0.
1.0.1 U+3004 1 (to be determined)
3.0 U+3038..303A 3 L2/97-017 N1182 N202 Proposal to add 210 KangXi Radicals and 3 HANGZHOU Numbers in BMP for compatibility, 1995-03-23
N1203 Umamaheswaran, V. S.; Ksar, Mike (1995-05-03), "6.1.11", Unconfirmed minutes of SC2/WG2 Meeting 27, Geneva
N1303 (html, doc) Umamaheswaran, V. S.; Ksar, Mike (1996-01-26), Minutes of Meeting 29, Tokyo
L2/97-284 N1629 N486 Zhang, Zhoucai (1997-07-07), Kangxi Radicals and Hangzhou Numerals
L2/97-255R Aliprand, Joan (1997-12-03), "4.B.1 Hangzhou Numerals", Approved Minutes - UTC #73 & L2 #170 joint meeting, Palo Alto, CA - August 4-5, 1997, Motion [#73-M9]: That the UTC concurs with SC2/WG2 Resolution M32.11, and accepts the 3 Hangzhou numeral characters.
L2/98-112 N1629R Zhang, Zhoucai (1998-03-19), Kangxi Radicals, Hangzhou Numerals
L2/98-332 N1923 Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 15, AMENDMENT 15: Kang Xi radicals and CJK radicals supplement, 1998-10-28
L2/99-073 N1968 (html, doc) Summary of Voting on SC 2 N 3213, PDAM ballot on WD for 10646-1/Amd. 15: Kang Xi radicals and CJK radicals supplement, 1999-02-08
L2/99-119 Text for FPDAM ballot of ISO/IEC 10646, Amd. 15 - Kang Xi radicals and CJK radicals supplement, 1999-04-07
L2/99-232 N2003 Umamaheswaran, V. S. (1999-08-03), "6.1.1 PDAM15 - Kang Xi & CJK Radicals", Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15
L2/99-252 N2065 Summary of Voting on SC 2 N 3311, ISO 10646-1/FPDAM 15 - Kang Xi radicals and CJK radicals supplement, 1999-08-19
L2/99-300 N2122 Paterson, Bruce (1999-09-21), Revised Text for FDAM ballot of ISO/IEC 10646-1/FDAM 15, AMENDMENT 15: Kang Xi radicals and CJK radicals supplement
L2/00-044 Summary of FDAM voting: ISO 10646 Amendment 15: Kang Xi radicals and CJK radicals supplement, 2000-01-31
U+303E 1 N1431 N406, N406A Ideographic Variation Mark, 1996-06-27
N1453 Ksar, Mike; Umamaheswaran, V. S. (1996-12-06), "9.7 Ideographic Variation Mark", WG 2 Minutes - Quebec Meeting 31
L2/97-023 N1486 N437 IRG #8 Resolutions, 1997-01-16
N1489 Supplement to Ideographic Components and Composition Schemes, 1997-01-16
N1490 N436 "Response related to N1431 (Ideographic Variation Mark)", Response to WG2 question on Ideographic Structure Characters, 1997-01-16
L2/97-024 N1491 IRG proposal: Ideographic variant character, 1997-01-19
L2/97-030 N1503 (pdf, doc) Umamaheswaran, V. S.; Ksar, Mike (1997-04-01), "9.5", Unconfirmed Minutes of WG 2 Meeting #32, Singapore; 1997-01-20--24
L2/97-114 N1544 (html, doc) N453 Sato, T. K. (1997-04-08), Questions on the "Han structure method" described in WG2 N1490 (IRG N436)
N1678 (pdf, doc) Further explanation on Variation Mark, 1997-12-18
L2/98-100 N1728 Ad-hoc report on ideographic variation indicator, 1998-03-18
L2/98-158 Aliprand, Joan; Winkler, Arnold (1998-05-26), "Ideographic Variation Indicator", Draft Minutes - UTC #76 & NCITS Subgroup L2 #173 joint meeting, Tredyffrin, Pennsylvania, April 20-22, 1998
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "9.3", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
L2/98-321 N1905 Revised text of 10646-1/FPDAM 23, AMENDMENT 23: Bopomofo Extended and other characters, 1998-10-22
3.2 U+303B..303D 3 L2/99-238 Consolidated document containing 6 Japanese proposals, 1999-07-15
N2092 Addition of forty eight characters, 1999-09-13
L2/00-024 Shibano, Kohji (2000-01-31), JCS proposal revised
L2/00-098, L2/00-098-page5 N2195 Rationale for non-Kanji characters proposed by JCS committee, 2000-03-15
L2/00-234 N2203 (rtf, txt) Umamaheswaran, V. S. (2000-07-21), "8.20", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24
L2/00-298 N2258 Sato, T. K. (2000-09-04), JIS X 0213 symbols part-2
L2/00-342 N2278 Sato, T. K.; Everson, Michael; Whistler, Ken; Freytag, Asmus (2000-09-20), Ad hoc Report on Japan feedback N2257 and N2258
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.16 JIS X0213 Symbols", Minutes of the SC2/WG2 meeting in Athens, September 2000
L2/01-114 N2328 Summary of Voting on SC 2 N 3503, ISO/IEC 10646-1: 2000/PDAM 1, 2001-03-09
L2/11-438[b][c] N4182 Edberg, Peter (2011-12-22), Emoji Variation Sequences (Revision of L2/11-429)
  1. ^ Proposed code points and characters names may differ from final code points and names
  2. ^ See also L2/10-458, L2/11-414, L2/11-415, and L2/11-429
  3. ^ Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents

See also[edit]

References[edit]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  4. ^ Lunde, Ken (2018-01-21). "L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF).
  5. ^ "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
  6. ^ GB/T 15835-2011《出版物上数字用法》. China Guojia Biaozhun. https://journals.usst.edu.cn/uploadfile/file/GBT%2015835-2011%E3%80%8A%E5%87%BA%E7%89%88%E7%89%A9%E4%B8%8A%E6%95%B0%E5%AD%97%E7%94%A8%E6%B3%95%E3%80%8B.pdf
  7. ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
  8. ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
  9. ^ "UTS #51 Emoji Variation Sequences". The Unicode Consortium.
  10. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  11. ^ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  12. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.