ANSEL

From Wikipedia, the free encyclopedia
Jump to: navigation, search

ANSEL, The American National Standard for Extended Latin Alphabet Coded Character Set for Bibliographic Use, was a character set used in text encoding. It provided a table of coded values for the representation of characters of the extended Latin alphabet in machine-readable form for thirty-five languages written in the Latin alphabet and for fifty-one romanized languages. The standard was reaffirmed in 2003 although it has been administratively withdrawn by ANSI effective 14 February 2013.[1] It is registered as Registration # 231 in the ISO International Register of Coded Character Sets to be Used with Escape Sequences.[2][3]

ANSEL is composed of a set of 63 graphic characters intended for use with ASCII, the American National Standard Code for Information Interchange, ANSI X3.4-1986,[3] including 29 combining diacritic characters. A combining diacritic character precedes the spacing character on which it should be superimposed.[1]

Code page layout[edit]

The following table shows ANSI/NISO Z39.47-1993 (R2003).[1] Each character is shown with its Unicode equivalent.

ANSI/NISO Z39.47-1993 (R2003)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
 
0_
 
NUL
0000
0
SOH
0001
1
STX
0002
2
ETX
0003
3
EOT
0004
4
ENQ
0005
5
ACK
0006
6
BEL
0007
7
BS
0008
8
HT
0009
9
LF
000A
10
VT
000B
11
FF
000C
12
CR
000D
13
SO
000E
14
SI
000F
15
 
1_
 
DLE
0010
16
DC1
0011
17
DC2
0012
18
DC3
0013
19
DC4
0014
20
NAK
0015
21
SYN
0016
22
ETB
0017
23
CAN
0018
24
EM
0019
25
SUB
001A
26
ESC
001B
27
FS
001C
28
GS
001D
29
RS
001E
30
US
001F
31
 
2_
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
$
0024
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3_
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4_
 
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
 
5_
 
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
^
005E
94
_
005F
95
 
6_
 
`
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
 
7_
 
p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
DEL
007F
127
 
8_
 
 

128
 

129
 

130
 

131
 

132
 

133
 

134
 

135
 

136
 

137
 

138
 

139
 

140
 

141
 

142
 

143
 
9_
 
 

144
 

145
 

146
 

147
 

148
 

149
 

150
 

151
 

152
 

153
 

154
 

155
 

156
 

157
 

158
 

159
 
A_
 
 

160
Ł
0141
161
Ø
00D8
162
Đ
0110
163
Þ
00DE
164
Æ
00C6
165
Œ
0152
166
ʹ
02B9
167
·
00B7
168

266D
169
®
00AE
170
±
00B1
171
Ơ
01A0
172
Ư
01AF
173
ʼ
02BC
174
 

175
 
B_
 
ʻ
02BB
176
ł
0142
177
ø
00F8
178
đ
0111
179
þ
00FE
180
æ
00E6
181
œ
0153
182
ʺ
02BA
183
ı
0131
184
£
00A3
185
ð
00F0
186
 

187
ơ
01A1
188
ư
01B0
189
 

190
 

191
 
C_
 
°
00B0
192

2113
193

2117
194
©
00A9
195

266F
196
¿
00BF
197
¡
00A1
198
 

199
 

200
 

201
 

202
 

203
 

204
 

205
 

206
 

207
 
D_
 
 

208
 

209
 

210
 

211
 

212
 

213
 

214
 

215
 

216
 

217
 

218
 

219
 

220
 

221
 

222
 

223
 
E_
 
̉ 
0309
224
̀ 
0300
225
́ 
0301
226
̂ 
0302
227
̃ 
0303
228
̄ 
0304
229
̆ 
0306
230
̇ 
0307
231
̈ 
0308
232
̌ 
030C
233
̊ 
030A
234
︠ 
FE20
235
︡ 
FE21
236
̕ 
0315
237
̋ 
030B
238
̐ 
0310
239
 
F_
 
̧ 
0327
240
̨ 
0328
241
̣ 
0323
242
̤ 
0324
243
̥ 
0325
244
̳ 
0333
245
̲ 
0332
246
̦ 
0326
247
̜ 
031C
248
̮ 
032E
249
︢ 
FE22
250
︣ 
FE23
251
 

252
 

253
̓ 
0313
254
 

255
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F

Use[edit]

GEDCOM[edit]

The GEDCOM specification for exchanging genealogical data refers to ANSEL (ANSI/NISO Z39.47-1985) as a valid text encoding for GEDCOM files and extends it with additional characters which are shown in the following table.[4][5]

Hex Unicode Glyph Description
0xBE 25A1 empty box
0xBF 25A0 black box
0xCD 0065 e midline e
0xCE 006F o midline o
0xCF 00DF ß es zet
0xFC 0338  ̸  diacritic slash through char

MARC21[edit]

The MARC 21 Extended Latin character set is synchronized with ANSEL[3] but additionally supports the euro sign (€) and eszett (ß) characters.[6]

References[edit]

  1. ^ a b c "ANSI/NISO Z39.47-1993 (R2003) Extended Latin Alphabet Coded Character Set for Bibliographic Use (ANSEL)". 
  2. ^ "ANSI Z39.47-1993 (R2003)". 
  3. ^ a b c "International Register Of Coded Character Sets To Be Used With Escape Sequences". 
  4. ^ The Church of Jesus Christ of Latter-day Saints (2 December 1995). "GEDCOM 5.5". pp. 87–89. 
  5. ^ The Church of Jesus Christ of Latter-day Saints (4 November 1993). GEDCOM 5.3 (Technical report). pp. 67–72. 
  6. ^ "MARC 21 Code Table Extended Latin (ANSEL)". The Library of Congress. December 2007. 

External links[edit]