EBCDIC 1047

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
EBCDIC 1047
MIME / IANAIBM1047
Alias(es)ibm-1047, cp1047[1]
ClassificationEBCDIC
Transforms / EncodesISO 8859-1
Other related encoding(s)EBCDIC 037-2
EBCDIC 924
MIME / IANAIBM00924
Alias(es)CCSID00924, CP00924, ebcdic-Latin9-euro[1]
ClassificationEBCDIC
Transforms / EncodesISO 8859-15

Code page 1047 (CCSID 1047)[2] is an EBCDIC code page with the full Latin-1 character set.[3] It is closely related to both EBCDIC 037-2 (with only two points differing) and EBCDIC 037 (with six points differing), both of which also encode Latin-1.

Code page 924 (CCSID 924) is an update of code page/CCSID 1047 which adds various characters including the euro sign.[4][5] It is an EBCDIC version of Latin-9 (ISO 8859-15).

Code page 1047 layout[edit]

Characters are shown with their Unicode equivalents. Differences from EBCDIC 037 are shown with a heavy border.

EBCDIC Code Page 01047[6][7][8][9]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_ NUL
0000
SOH
0001
STX
0002
ETX
0003
SEL
 
HT
0009
RNL
 
DEL
007F
GE
 
SPS
 
RPT
 
VT
000B
FF
000C
CR
000D
SO
000E
SI
000F
1_ DLE
0010
DC1
0011
DC2
0012
DC3
0013
res/enp
 
NL
0085
BS
0008
POC
 
CAN
0018
EM
0019
UBS
 
CU1
 
IFS
001C
IGS
001D
IRS
001E
ius/itb
001F
2_ DS
 
SOS
 
FS
 
WUS
 
byp/imp
 
LF
000A
ETB
0017
ESC
001B
SA
 
SFE
 
sm/sw
 
CSP
 
MFA
 
ENQ
0005
ACK
0006
BEL
0007
3_ SYN
0016
IR
 
PP
 
TRN
 
NBS
 
EOT
0004
SBS
 
IT
 
RFF
 
CU3
 
DC4
0014
NAK
0015
SUB
001A
4_ SP
0020
NBSP
00A0
â
00E2
ä
00E4
à
00E0
á
00E1
ã
00E3
å
00E5
ç
00E7
ñ
00F1
¢
00A2
.
002E
<
003C
(
0028
+
002B
|
007C
5_ &
0026
é
00E9
ê
00EA
ë
00EB
è
00E8
í
00ED
î
00EE
ï
00EF
ì
00EC
ß
00DF
!
0021
$
0024
*
002A
)
0029
;
003B
^
005E
6_ -
002D
/
002F
Â
00C2
Ä
00C4
À
00C0
Á
00C1
Ã
00C3
Å
00C5
Ç
00C7
Ñ
00D1
¦
00A6
,
002C
%
0025
_
005F
>
003E
?
003F
7_ ø
00F8
É
00C9
Ê
00CA
Ë
00CB
È
00C8
Í
00CD
Î
00CE
Ï
00CF
Ì
00CC
`
0060
:
003A
#
0023
@
0040
'
0027
=
003D
"
0022
8_ Ø
00D8
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
«
00AB
»
00BB
ð
00F0
ý
00FD
þ
00FE
±
00B1
9_ °
00B0
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
p
0070
q
0071
r
0072
ª
00AA
º
00BA
æ
00E6
¸
00B8
Æ
00C6
¤
00A4
A_ µ
00B5
~
007E
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
¡
00A1
¿
00BF
Ð
00D0
[
005B
Þ
00DE
®
00AE
B_ ¬
00AC
£
00A3
¥
00A5
·
00B7
©
00A9
§
00A7

00B6
¼
00BC
½
00BD
¾
00BE
Ý
00DD
¨
00A8
¯
00AF
]
005D
´
00B4
×
00D7
C_ {
007B
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
SHY
00AD
ô
00F4
ö
00F6
ò
00F2
ó
00F3
õ
00F5
D_ }
007D
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
P
0050
Q
0051
R
0052
¹
00B9
û
00FB
ü
00FC
ù
00F9
ú
00FA
ÿ
00FF
E_ \
005C
÷
00F7
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
²
00B2
Ô
00D4
Ö
00D6
Ò
00D2
Ó
00D3
Õ
00D5
F_ 0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
³
00B3
Û
00DB
Ü
00DC
Ù
00D9
Ú
00DA
EO
 

  Letter  Number  Punctuation  Symbol  Other  Undefined   Differences from EBCDIC 037

Code page 1047 translation[edit]

Since CP 01047 contains all of the standard Latin-1 characters, it is possible to translate the character codes from the CP 01047 charset to ISO 8859-1 character codes, so that translation back to the CP 01047 charset is an exact value-preserving round-trip conversion. Likewise, half of the control character codes can be translated into their exact ASCII equivalents. If the remaining EBCDIC-only control characters are translated (arbitrarily) into the remaining unused ASCII codes points (hex 80 to 9F) as well, the resulting translation covers all of the 256 character code points. Such a conversion table (for translating from CP 01047 to ISO 8859-1) is shown below:

CP 1047 → ISO 8859-1
  _0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_ 00 01 02 03 9C 09 86 7F 97 8D 8E 0B 0C 0D 0E 0F
1_ 10 11 12 13 9D 85 08 87 18 19 92 8F 1C 1D 1E 1F
2_ 80 81 82 83 84 0A 17 1B 88 89 8A 8B 8C 05 06 07
3_ 90 91 16 93 94 95 96 04 98 99 9A 9B 14 15 9E 1A
4_ 20 A0 E2 E4 E0 E1 E3 E5 E7 F1 A2 2E 3C 28 2B 7C
5_ 26 E9 EA EB E8 ED EE EF EC DF 21 24 2A 29 3B 5E
6_ 2D 2F C2 C4 C0 C1 C3 C5 C7 D1 A6 2C 25 5F 3E 3F
7_ F8 C9 CA CB C8 CD CE CF CC 60 3A 23 40 27 3D 22
8_ D8 61 62 63 64 65 66 67 68 69 AB BB F0 FD FE B1
9_ B0 6A 6B 6C 6D 6E 6F 70 71 72 AA BA E6 B8 C6 A4
A_ B5 7E 73 74 75 76 77 78 79 7A A1 BF D0 5B DE AE
B_ AC A3 A5 B7 A9 A7 B6 BC BD BE DD A8 AF 5D B4 D7
C_ 7B 41 42 43 44 45 46 47 48 49 AD F4 F6 F2 F3 F5
D_ 7D 4A 4B 4C 4D 4E 4F 50 51 52 B9 FB FC F9 FA FF
E_ 5C F7 53 54 55 56 57 58 59 5A B2 D4 D6 D2 D3 D5
F_ 30 31 32 33 34 35 36 37 38 39 B3 DB DC D9 DA 9F

Code page 924 layout[edit]

Characters are shown with their Unicode equivalents. Differences from EBCDIC 1047 are shown with a heavy border.

EBCDIC Code Page 00924[10][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_ NUL
0000
SOH
0001
STX
0002
ETX
0003
SEL
 
HT
0009
RNL
 
DEL
007F
GE
 
SPS
 
RPT
 
VT
000B
FF
000C
CR
000D
SO
000E
SI
000F
1_ DLE
0010
DC1
0011
DC2
0012
DC3
0013
res/enp
 
NL
0085
BS
0008
POC
 
CAN
0018
EM
0019
UBS
 
CU1
 
IFS
001C
IGS
001D
IRS
001E
ius/itb
001F
2_ DS
 
SOS
 
FS
 
WUS
 
byp/imp
 
LF
000A
ETB
0017
ESC
001B
SA
 
SFE
 
sm/sw
 
CSP
 
MFA
 
ENQ
0005
ACK
0006
BEL
0007
3_ SYN
0016
IR
 
PP
 
TRN
 
NBS
 
EOT
0004
SBS
 
IT
 
RFF
 
CU3
 
DC4
0014
NAK
0015
SUB
001A
4_ SP
0020
NBSP
00A0
â
00E2
ä
00E4
à
00E0
á
00E1
ã
00E3
å
00E5
ç
00E7
ñ
00F1
Ý
00DD
.
002E
<
003C
(
0028
+
002B
|
007C
5_ &
0026
é
00E9
ê
00EA
ë
00EB
è
00E8
í
00ED
î
00EE
ï
00EF
ì
00EC
ß
00DF
!
0021
$
0024
*
002A
)
0029
;
003B
^
005E
6_ -
002D
/
002F
Â
00C2
Ä
00C4
À
00C0
Á
00C1
Ã
00C3
Å
00C5
Ç
00C7
Ñ
00D1
Š
0160
,
002C
%
0025
_
005F
>
003E
?
003F
7_ ø
00F8
É
00C9
Ê
00CA
Ë
00CB
È
00C8
Í
00CD
Î
00CE
Ï
00CF
Ì
00CC
`
0060
:
003A
#
0023
@
0040
'
0027
=
003D
"
0022
8_ Ø
00D8
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
«
00AB
»
00BB
ð
00F0
ý
00FD
þ
00FE
±
00B1
9_ °
00B0
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
p
0070
q
0071
r
0072
ª
00AA
º
00BA
æ
00E6
ž
017E
Æ
00C6

20AC
A_ µ
00B5
~
007E
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
¡
00A1
¿
00BF
Ð
00D0
[
005B
Þ
00DE
®
00AE
B_ ¢
00A2
£
00A3
¥
00A5
·
00B7
©
00A9
§
00A7

00B6
Œ
0152
œ
0153
Ÿ
0178
¬
00AC
š
0161
¯
00AF
]
005D
Ž
017D
×
00D7
C_ {
007B
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
SHY
00AD
ô
00F4
ö
00F6
ò
00F2
ó
00F3
õ
00F5
D_ }
007D
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
P
0050
Q
0051
R
0052
¹
00B9
û
00FB
ü
00FC
ù
00F9
ú
00FA
ÿ
00FF
E_ \
005C
÷
00F7
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
²
00B2
Ô
00D4
Ö
00D6
Ò
00D2
Ó
00D3
Õ
00D5
F_ 0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
³
00B3
Û
00DB
Ü
00DC
Ù
00D9
Ú
00DA
EO
 

  Letter  Number  Punctuation  Symbol  Other  Undefined   Differences from EBCDIC 1047

References[edit]

  1. ^ a b Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
  2. ^ "CCSID 1047 information document". Archived from the original on 2015-06-07.
  3. ^ "Code page 1047 information document". Archived from the original on 2015-06-07.
  4. ^ "Code page 924 information document". Archived from the original on 2016-03-17.
  5. ^ "CCSID 924 information document". Archived from the original on 2016-03-26.
  6. ^ "01047" (PDF). Code pages by CPGID. IBM. Archived from the original (PDF) on 2013-01-21. Retrieved 2020-02-28.
  7. ^ Code Page CPGID 01047 (pdf) (PDF), IBM
  8. ^ Code Page CPGID 01047 (txt), IBM
  9. ^ International Components for Unicode (ICU), ibm-1047_P100-1995.ucm, 2002-12-03
  10. ^ "00924" (PDF). Code pages by CPGID. IBM. Archived from the original (PDF) on 2015-07-08. Retrieved 2020-02-28.
  11. ^ Code Page CPGID 00924 (pdf) (PDF), IBM
  12. ^ Code Page CPGID 00924 (txt), IBM