Indian Script Code for Information Interchange

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by PrimeBOT (talk | contribs) at 19:01, 23 June 2020 (→‎top: Task 24 - replacement of a template following a TFD). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Indian Script Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Assamese, Bengali (Bangla), Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. ISCII does not encode the writing systems of India based on Persian, but its writing system switching codes nonetheless provide for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic. The Persian-based writing systems were subsequently encoded in the PASCII encoding.

ISCII has not been widely used outside certain government institutions and has now been rendered largely obsolete by Unicode. Unicode uses a separate block for each Indic writing system, and largely preserves the ISCII layout within each block.

Background

The Brahmi-derived writing systems have similar structure. So ISCII encodes letters with the same phonetic value at the same code point, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki]. This will be rendered as കി in Malayalam, कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the ATR code described below.

One motivation for the use of a single encoding is the idea that it will allow easy transliteration from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea. See About ISCII.

ISCII is an 8-bit encoding. The lower 128 code points are plain ASCII, the upper 128 code points are ISCII-specific. In addition to the code points representing characters, ISCII makes use of a code point with mnemonic ATR that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes such as bold and italic. ISCII does not provide a means of indicating the default writing system.

Codepage layout

The following table shows the character set for Devanagari. The code sets for Assamese, Bengali, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu are similar, with each Devanagari form replaced by the equivalent form in each writing system. Each character is shown with its decimal code and its Unicode equivalent.

ISCII Devanagari
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0
Template:Chset-color-ctrl|NUL
0000
Template:Chset-color-ctrl|SOH
0001
Template:Chset-color-ctrl|STX
0002
Template:Chset-color-ctrl|ETX
0003
Template:Chset-color-ctrl|EOT
0004
Template:Chset-color-ctrl|ENQ
0005
Template:Chset-color-ctrl|ACK
0006
Template:Chset-color-ctrl|BEL
0007
Template:Chset-color-ctrl|BS
0008
Template:Chset-color-ctrl|HT
0009
Template:Chset-color-ctrl|LF
000A
Template:Chset-color-ctrl|VT
000B
Template:Chset-color-ctrl|FF
000C
Template:Chset-color-ctrl|CR
000D
Template:Chset-color-ctrl|SO
000E
Template:Chset-color-ctrl|SI
000F
1_
16
Template:Chset-color-ctrl|DLE
0010
Template:Chset-color-ctrl|DC1
0011
Template:Chset-color-ctrl|DC2
0012
Template:Chset-color-ctrl|DC3
0013
Template:Chset-color-ctrl|DC4
0014
Template:Chset-color-ctrl|NAK
0015
Template:Chset-color-ctrl|SYN
0016
Template:Chset-color-ctrl|ETB
0017
Template:Chset-color-ctrl|CAN
0018
Template:Chset-color-ctrl|EM
0019
Template:Chset-color-ctrl|SUB
001A
Template:Chset-color-ctrl|ESC
001B
Template:Chset-color-ctrl|FS
001C
Template:Chset-color-ctrl|GS
001D
Template:Chset-color-ctrl|RS
001E
Template:Chset-color-ctrl|US
001F
2_
32
Template:Chset-color-misc|SP
0020
Template:Chset-color-punct|!
0021
Template:Chset-color-punct|"
0022
Template:Chset-color-punct|#
0023
Template:Chset-color-graph|$
0024
Template:Chset-color-punct|%
0025
Template:Chset-color-punct|&
0026
Template:Chset-color-punct|'
0027
Template:Chset-color-punct|(
0028
Template:Chset-color-punct|)
0029
Template:Chset-color-punct|*
002A
Template:Chset-color-graph|+
002B
Template:Chset-color-punct|,
002C
Template:Chset-color-punct|-
002D
Template:Chset-color-punct|.
002E
Template:Chset-color-punct|/
002F
3_
48
Template:Chset-color-digit|0
0030
Template:Chset-color-digit|1
0031
Template:Chset-color-digit|2
0032
Template:Chset-color-digit|3
0033
Template:Chset-color-digit|4
0034
Template:Chset-color-digit|5
0035
Template:Chset-color-digit|6
0036
Template:Chset-color-digit|7
0037
Template:Chset-color-digit|8
0038
Template:Chset-color-digit|9
0039
Template:Chset-color-punct|:
003A
Template:Chset-color-punct|;
003B
Template:Chset-color-graph|<
003C
Template:Chset-color-graph|=
003D
Template:Chset-color-graph|>
003E
Template:Chset-color-punct|?
003F
4_
64
Template:Chset-color-punct|@
0040
Template:Chset-color-letter|A
0041
Template:Chset-color-letter|B
0042
Template:Chset-color-letter|C
0043
Template:Chset-color-letter|D
0044
Template:Chset-color-letter|E
0045
Template:Chset-color-letter|F
0046
Template:Chset-color-letter|G
0047
Template:Chset-color-letter|H
0048
Template:Chset-color-letter|I
0049
Template:Chset-color-letter|J
004A
Template:Chset-color-letter|K
004B
Template:Chset-color-letter|L
004C
Template:Chset-color-letter|M
004D
Template:Chset-color-letter|N
004E
Template:Chset-color-letter|O
004F
5_
80
Template:Chset-color-letter|P
0050
Template:Chset-color-letter|Q
0051
Template:Chset-color-letter|R
0052
Template:Chset-color-letter|S
0053
Template:Chset-color-letter|T
0054
Template:Chset-color-letter|U
0055
Template:Chset-color-letter|V
0056
Template:Chset-color-letter|W
0057
Template:Chset-color-letter|X
0058
Template:Chset-color-letter|Y
0059
Template:Chset-color-letter|Z
005A
Template:Chset-color-punct|[
005B
Template:Chset-color-punct|\
005C
Template:Chset-color-punct|]
005D
Template:Chset-color-graph|^
005E
Template:Chset-color-punct|_
005F
6_
96
Template:Chset-color-graph|`
0060
Template:Chset-color-letter|a
0061
Template:Chset-color-letter|b
0062
Template:Chset-color-letter|c
0063
Template:Chset-color-letter|d
0064
Template:Chset-color-letter|e
0065
Template:Chset-color-letter|f
0066
Template:Chset-color-letter|g
0067
Template:Chset-color-letter|h
0068
Template:Chset-color-letter|i
0069
Template:Chset-color-letter|j
006A
Template:Chset-color-letter|k
006B
Template:Chset-color-letter|l
006C
Template:Chset-color-letter|m
006D
Template:Chset-color-letter|n
006E
Template:Chset-color-letter|o
006F
7_
112
Template:Chset-color-letter|p
0070
Template:Chset-color-letter|q
0071
Template:Chset-color-letter|r
0072
Template:Chset-color-letter|s
0073
Template:Chset-color-letter|t
0074
Template:Chset-color-letter|u
0075
Template:Chset-color-letter|v
0076
Template:Chset-color-letter|w
0077
Template:Chset-color-letter|x
0078
Template:Chset-color-letter|y
0079
Template:Chset-color-letter|z
007A
Template:Chset-color-punct|{
007B
Template:Chset-color-graph||
007C
Template:Chset-color-punct|}
007D
Template:Chset-color-graph|~
007E
Template:Chset-color-ctrl|DEL
007F
8_
128
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|
9_
144
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|
A_
160
Template:Chset-color-undef| Template:Chset-color-misc|
0901
Template:Chset-color-misc|
0902
Template:Chset-color-misc|
0903
Template:Chset-color-letter|
0905
Template:Chset-color-letter|
0906
Template:Chset-color-letter|
0907
Template:Chset-color-letter|
0908
Template:Chset-color-letter|
0909
Template:Chset-color-letter|
090A
Template:Chset-color-letter|
090B
Template:Chset-color-letter|
090E
Template:Chset-color-letter|
090F
Template:Chset-color-letter|
0910
Template:Chset-color-letter|
090D
Template:Chset-color-letter|
0912
B_
176
Template:Chset-color-letter|
0913
Template:Chset-color-letter|
0914
Template:Chset-color-letter|
0911
Template:Chset-color-letter|
0915
Template:Chset-color-letter|
0916
Template:Chset-color-letter|
0917
Template:Chset-color-letter|
0918
Template:Chset-color-letter|
0919
Template:Chset-color-letter|
091A
Template:Chset-color-letter|
091B
Template:Chset-color-letter|
091C
Template:Chset-color-letter|
091D
Template:Chset-color-letter|
091E
Template:Chset-color-letter|
091F
Template:Chset-color-letter|
0920
Template:Chset-color-letter|
0921
C_
192
Template:Chset-color-letter|
0922
Template:Chset-color-letter|
0923
Template:Chset-color-letter|
0924
Template:Chset-color-letter|
0925
Template:Chset-color-letter|
0926
Template:Chset-color-letter|
0927
Template:Chset-color-letter|
0928
Template:Chset-color-letter|
0929
Template:Chset-color-letter|
092A
Template:Chset-color-letter|
092B
Template:Chset-color-letter|
092C
Template:Chset-color-letter|
092D
Template:Chset-color-letter|
092E
Template:Chset-color-letter|
092F
Template:Chset-color-letter|य़
095F
Template:Chset-color-letter|
0930
D_
208
Template:Chset-color-letter|
0931
Template:Chset-color-letter|
0932
Template:Chset-color-letter|
0933
Template:Chset-color-letter|
0934
Template:Chset-color-letter|
0935
Template:Chset-color-letter|
0936
Template:Chset-color-letter|
0937
Template:Chset-color-letter|
0938
Template:Chset-color-letter|
0939
Template:Chset-color-ctrl|INV
 
Template:Chset-color-misc|
093E
Template:Chset-color-misc|ि
093F
Template:Chset-color-misc|
0940
Template:Chset-color-misc|
0941
Template:Chset-color-misc|
0942
Template:Chset-color-misc|
0943
E_
224
Template:Chset-color-misc|
0946
Template:Chset-color-misc|
0947
Template:Chset-color-misc|
0948
Template:Chset-color-misc|
0945
Template:Chset-color-misc|
094A
Template:Chset-color-misc|
094B
Template:Chset-color-misc|
094C
Template:Chset-color-misc|
0949
Template:Chset-color-misc|
094D
Template:Chset-color-misc|
093C
Template:Chset-color-punct|
0964
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-ctrl |ATR
 
F_
240
Template:Chset-color-ctrl|EXT
 
Template:Chset-color-digit|
0966
Template:Chset-color-digit|
0967
Template:Chset-color-digit|
0968
Template:Chset-color-digit|
0969
Template:Chset-color-digit|
096A
Template:Chset-color-digit|
096B
Template:Chset-color-digit|
096C
Template:Chset-color-digit|
096D
Template:Chset-color-digit|
096E
Template:Chset-color-digit|
096F
Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef| Template:Chset-color-undef|

  Letter  Number  Punctuation  Symbol  Other  Undefined

Special code points

INV character—code point D9 (217)
The INV character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क्‍ (half ka). The Unicode equivalent is U+200D ZERO WIDTH JOINER.
ATR character—code point EF (239)
The ATR character followed by a byte code is used to switch to a different font attribute (such as bold) or language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points.
EXT character—code point F0 (240)
The EXT character followed by a byte code indicates a Vedic accent. This has no direct Unicode equivalent, as Vedic accents are assigned to distinct code points.
Halant character ्—code point E8 (232)
The halant character removes the implicit vowel from a consonant and is used between consonants to represent conjunct consonants. For example, क (ka) + ् (halant) + त (ta) = क्त (kta). The sequence ् (halant) + ् (halant) displays a conjunct with an explicit halant, for example क (ka) + ् (halant) + ् (halant) + त (ta) = क्‌त. The sequence ् (halant) + ़ (nukta) displays a conjunct with half consonants, if available, for example क (ka) + ् (halant) + ़ (nukta) + त (ta) = क्‍त.
ISCII Unicode
single halant E8 halant 094D
halant + halant E8 E8 halant + ZWNJ 094D 200C
halant + nukta E8 E9 halant + ZWJ 094D 200D
Nukta character ़—code point E9 (233)
The nukta character after another ISCII character is used for a number of rarer characters which don't exist in the main ISCII set. For example क (ka) + ़ (nukta) = क़ (qa). These characters have precomposed forms in Unicode, as shown in the following table.
ISCII
code point
Original
character
Character
with nukta
Unicode
code point
A1 (161) 0950
A6 (166) 090C
A7 (167) 0961
AA (176) 0960
B3 (179) क़ 0958
B4 (180) ख़ 0959
B5 (181) ग़ 095A
BA (186) ज़ 095B
BF (191) ड़ 095C
C0 (192) ढ़ 095D
C9 (201) फ़ 095E
DB (219) ि 0962
DC (220) 0963
DF (223) 0944
EA (234) 093D

Code pages for ISCII conversion

To convert from Unicode (UTF-8) to an ISCII / ANSI coding, the following code pages may be used:

  • 57002: Devanagari (Hindi, Marathi, Sanskrit, Konkani)
  • 57003: Bengali
  • 57004: Tamil
  • 57005: Telugu
  • 57006: Assamese
  • 57007: Odia
  • 57008: Kannada
  • 57009: Malayalam
  • 57010: Gujarati
  • 57011: Punjabi (Gurmukhi)

Code points for all language

External links