Code page 932 (IBM)

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

IBM code page 932 (abbreviated as IBM-932[1] or ambiguously as CP932) is one of IBM's extensions of Shift JIS. The coded character sets are JIS X 0201:1976, JIS X 0208:1983,[1] IBM extensions and IBM extensions for IBM 1880 UDC. It is the combination of the single-byte Code page 897 and the double-byte Code page 301.[2]

IBM-932 resembles IBM-943. One difference is that IBM-932 encodes the JIS X 0208:1983 characters but preserves the 1978 ordering, whereas IBM-943 uses the 1983 ordering[1] (i.e. the character variant swaps made in JIS X 0208:1983). Another difference is that IBM-932 does not incorporate the NEC selected extensions, which IBM-943 includes for Microsoft compatibility.[1]

IBM-942 includes the same double-byte codes as IBM-932 (those from Code page 301) but includes additional single-byte extensions. International Components for Unicode treats "ibm-932" and "ibm-942" as aliases for the same decoder.[3]

IBM-932 contains 7-bit ISO 646 codes, and Japanese characters are indicated by the high bit of the first byte being set to 1. Some code points in this page require a second byte, so characters use either 8 or 16 bits for encoding.

Layout[edit]

First byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2 ! " # $ % & ' ( ) * + , - . /
3 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4 @ A B C D E F G H I J K L M N O
5 P Q R S T U V W X Y Z [ ¥ ] ^ _
6 ` a b c d e f g h i j k l m n o
7 p q r s t u v w x y z { | }
8
9
A
B ソ
C
D
E
F
Second byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
 
Non printable ASCII character
Unaltered ASCII character
Modified ASCII character
Single-byte half-width katakana
First byte of a double-byte character, used by JIS X 0208
Not used as first byte, unallocated space in JIS X 0208
First byte of a double-byte IBM extension character
First byte of a double-byte IBM-designated user defined character
Not used as first byte
Second byte of a double-byte character whose first half of the JIS sequence was odd
Second byte of a double-byte character whose first half of the JIS sequence was even
Unused as second byte of a double-byte character


See also[edit]

References[edit]

  1. ^ a b c d "IBM-943 and IBM-932". IBM Knowledge Center. IBM.
  2. ^ "Coded character set identifiers - CCSID 932". IBM Globalization. IBM.
  3. ^ "Converter Explorer: ibm-942_P12A-1999". ICU Demonstration. International Components for Unicode.

External links[edit]