KOI-7

From Wikipedia, the free encyclopedia
Jump to: navigation, search

KOI-7 (КОИ-7) is a 7-bit character encoding, designed to cover Russian, which uses the Cyrillic alphabet.

In Russian, KOI-7 stands for Kod Obmena Informatsiey, 7 bit (Код Обмена Информацией, 7 бит) which means "Code for Information Exchange, 7 bit".[1]

Shift Out (SO) and Shift In (SI) control characters are used in KOI-7, where SO starts printing Russian letters (KOI-7 N1), and SI starts printing Latin letters again (KOI-7 N0), or for lowercase and uppercase switching. This version is also known as KOI7-switched aka csKOI7switched.[2][3]

On ISO 2022 compatible computer terminals KOI7-switched can be activated by the escape sequence ESC ( @ ESC ) N LS0.[2]

KOI-7 was used on machines like the SM EVM (СМ ЭВМ) and DVK (ДВК); KOI-7 N2 was utilized in the machine-language of the Электроника Д3-28 (ru) (Elektronika D3-28) as four-digit hexadecimal code, БЭСМ-6 (ru) (BESM-6), where it was called ВКД, internal data code). The encodings were also used on RSX-11, RT-11 and similar systems.[4]

KOI-7 N0[edit]

KOI-7 N0[2] (КОИ-7 Н0[1]) is identical to the IRV set in ISO 646:1983.[2] Compared to US-ASCII, the dollar sign ("$") at codepoint 24hex) was replaced by the universal currency sign "¤", but this wasn't maintained in all cases, in particular not after the fall of the Iron Curtain. Likewise, the IRV set in ISO/IEC 646:1991 also changed the character back to a dollar sign.

Legend:

KOI-7 N0
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
 
0_
 
NUL
0000
0
SOH
0001
1
STX
0002
2
ETX
0003
3
EOT
0004
4
ENQ
0005
5
ACK
0006
6
BEL
0007
7
BS
0008
8
HT
0009
9
LF
000A
10
VT
000B
11
FF
000C
12
CR
000D
13
SO
000E
14
SI
000F
15
 
1_
 
DLE
0010
16
DC1
0011
17
DC2
0012
18
DC3
0013
19
DC4
0014
20
NAK
0015
21
SYN
0016
22
ETB
0017
23
CAN
0018
24
EM
0019
25
SUB
001A
26
ESC
001B
27
FS
001C
28
GS
001D
29
RS
001E
30
US
001F
31
 
2_
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
¤ ($)
00A4 (0024)
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3_
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4_
 
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
 
5_
 
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
^
005E
94
_
005F
95
 
6_
 
`
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
 
7_
 
p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
DEL
007F
127

KOI-7 N1[edit]

KOI-7 N1 (КОИ-7 Н1[1]) was first standardized in GOST 13052,[5][6][7] and later also in ISO 5427.[2] It is sometimes referred to as "koi-0" as well.[6]

Compared to ASCII and ISO 646 uppercase and lowercase letters are swapped in order to make it easier to recognize Russian text when presented using ASCII.[6]

To trim the alphabet into chunks of 32 characters the dotted Ё/ë was dropped.[6] In order to avoid conflicts with ASCII's and ISO 646's definition as DEL and its usage as EOF marker (-1) in some systems, it dropped the "CAPITAL HARD SIGN" Ъ that would have naturally resided at this location.[6]

In a Bulgarian variant the unnecessary Russian "CAPITAL YERY" Ы at codepoint 121 was replaced by the "CAPITAL HARD SIGN" Ъ.[6]

Legend:

KOI-7 N1[7]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
 
0_
 
NUL
0000
0
SOH
0001
1
STX
0002
2
ETX
0003
3
EOT
0004
4
ENQ
0005
5
ACK
0006
6
BEL
0007
7
BS
0008
8
HT
0009
9
LF
000A
10
VT
000B
11
FF
000C
12
CR
000D
13
SO
000E
14
SI
000F
15
 
1_
 
DLE
0010
16
DC1
0011
17
DC2
0012
18
DC3
0013
19
DC4
0014
20
NAK
0015
21
SYN
0016
22
ETB
0017
23
CAN
0018
24
EM
0019
25
SUB
001A
26
ESC
001B
27
FS
001C
28
GS
001D
29
RS
001E
30
US
001F
31
 
2_
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
¤ ($)
00A4 (0024)
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3_
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4_
 
ю
044E
64
а
0430
65
б
0431
66
ц
0446
67
д
0434
68
е
0435
69
ф
0444
70
г
0433
71
х
0445
72
и
0438
73
й
0439
74
к
043A
75
л
043B
76
м
043C
77
н
043D
78
о
043E
79
 
5_
 
п
043F
80
я
044F
81
р
0440
82
с
0441
83
т
0442
84
у
0443
85
ж
0436
86
в
0432
87
ь
044C
88
ы
044B
89
з
0437
90
ш
0448
91
э
044D
92
щ
0449
93
ч
0447
94
ъ
044A
95
 
6_
 
Ю
042E
96
А
0410
97
Б
0411
98
Ц
0426
99
Д
0414
100
Е
0415
101
Ф
0424
102
Г
0413
103
Х
0425
104
И
0418
105
Й
0419
106
К
041A
107
Л
041B
108
М
041C
109
Н
041D
110
О
041E
111
 
7_
 
П
041F
112
Я
042F
113
Р
0420
114
С
0421
115
Т
0422
116
У
0423
117
Ж
0416
118
В
0412
119
Ь
042C
120
Ы (Ъ)
042B (042A)
121
З
0417
122
Ш
0428
123
Э
042D
124
Щ
0429
125
Ч
0427
126
DEL
007F
127

KOI-7 N2[edit]

KOI-7 N2 (КОИ-7 Н2[1]) GOST 19768-74[7]

Kermit names it SHORT-KOI / short-koi.[8][9]

Legend:

KOI-7 N2[7][9]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
 
0_
 
NUL
0000
0
SOH
0001
1
STX
0002
2
ETX
0003
3
EOT
0004
4
ENQ
0005
5
ACK
0006
6
BEL
0007
7
BS
0008
8
HT
0009
9
LF
000A
10
VT
000B
11
FF
000C
12
CR
000D
13
SO
000E
14
SI
000F
15
 
1_
 
DLE
0010
16
DC1
0011
17
DC2
0012
18
DC3
0013
19
DC4
0014
20
NAK
0015
21
SYN
0016
22
ETB
0017
23
CAN
0018
24
EM
0019
25
SUB
001A
26
ESC
001B
27
FS
001C
28
GS
001D
29
RS
001E
30
US
001F
31
 
2_
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
¤[7] ($[9])
00A4 (0024)
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3_
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4_
 
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
 
5_
 
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
[disambiguation needed] (^[7][9])
2191 (005E)
94
_
005F
95
 
6_
 
Ю
042E
96
А
0410
97
Б
0411
98
Ц
0426
99
Д
0414
100
Е
0415
101
Ф
0424
102
Г
0413
103
Х
0425
104
И
0418
105
Й
0419
106
К
041A
107
Л
041B
108
М
041C
109
Н
041D
110
О
041E
111
 
7_
 
П
041F
112
Я
042F
113
Р
0420
114
С
0421
115
Т
0422
116
У
0423
117
Ж
0416
118
В
0412
119
Ь
042C
120
Ы
042B
121
З
0417
122
Ш
0428
123
Э
042D
124
Щ
0429
125
Ч
0427
126
DEL
007F
127

See also[edit]

References[edit]

  1. ^ a b c d ГОСТ 27463-87. Системы обработки информации. 7-битные кодированные наборы символов (с Изменением N 1) [GOST 27463-87. Information processing systems. 7-bit coded character sets] (in Russian). Госстандарт СССР. 1987-10-29. Archived from the original on 2016-12-02. Retrieved 2016-12-02. 
  2. ^ a b c d e Sokolov, Michael (2004-01-23). "Charset name: KOI7-switched". International Free Computing Task Force (IFCTF). Archived from the original on 2016-12-02. Retrieved 2016-12-02. 
  3. ^ Freed, Ned; Dürst, Martin, eds. (2013-12-20). "Character Sets". RFC2978. Archived from the original on 2016-12-03. Retrieved 2016-12-03. 
  4. ^ Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". Archived from the original on 2016-12-05. Retrieved 2016-12-05. 
  5. ^ Clews, John (1988). Language Automation Worldwide - The Development of Character Set Standards (1 ed.). Sesame Computer Projects. ISBN 1-87009501-4. ISBN 978-1-87009501-3. 
  6. ^ a b c d e f Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.  [1] [2] [3] [4]
  7. ^ a b c d e f Hohlov, Yu. E. "Cyrillic Information Representation in Electronic Form - Character Set (Code Page) Tables". Archived from the original on 2016-12-05. Retrieved 2016-12-05. 
  8. ^ da Cruz, Frank (2010-04-02). "Kermit and MIME Character-Set Names". The Kermit Project. Columbia University, New York, USA. Archived from the original on 2016-12-02. Retrieved 2016-12-02. 
  9. ^ a b c d "SHORT KOI (KOI-7)". Kermit. Columbia University. Retrieved 2017-02-17. 

Further reading[edit]

  • Kornai, Andras; Birnbaum, David J.; da Cruz, Frank; Davis, Bur; Fowler, George; Paine, Richard B.; Paperno, Slava; Simonsen, Keld J.; Thobe, Glenn E.; Vulis, Dimitri; van Wingen, Johan W. (1993-03-13). "CYRILLIC ENCODING FAQ Version 1.3". 1.3. Retrieved 2017-02-18.