Jump to content

Tamil Script Code for Information Interchange

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by PrimeBOT (talk | contribs) at 19:04, 23 June 2020 (→‎top: Task 24 - replacement of a template following a TFD). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Tamil Script Code for Information Interchange (TSCII) is a coding scheme for representing the Tamil script. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are TSCII-specific. After long years of being used on the Internet by private agreement only, it was successfully registered with the IANA in 2007.[1]

TSCII encodes the characters in visual (written) order, paralleling the use of the Tamil Typewriter.

Unicode has used the logical order encoding strategy for Tamil, following ISCII, in contrast to the case of Thai, where the visual order encoding grandfathered by TIS-620 was adopted.

The government of Tamil Nadu endorses its own TAB/TAM standards for 8-bit encoding and other, older encoding schemes can still be found on the WWW.

The free etext collection at Project Madurai uses the TSCII encoding, but has already started to provide Unicode versions.

History

The need for a common encoding for Tamil was felt by members of various mailing list based forums in mid-1990s, as there were multiple custom coded fonts were prevalent in those forums. While some of the commercial encodings were popular than the others, they were not accepted by wider community due to conflicting commercial interests. While Unicode was accepted by most as the future standard, most of the desktop systems at that time were still not capable of handling Unicode for Tamil language, and an interim 8-bit encoding was required.

A separate mailing list for discussion of such encodings (webmasters@tamil.net) was created in 1997 to initiate this discussion, starting with an email written by Dr.K.Kalyanasundaram to the popular Tamil author Sujatha who headed the committee for standardization of Tamil keyboard.[2] This forum quickly attracted enthusiastic participants from across the globe, including several prominent Tamil scholars. Archives of these discussion are maintained by INFITT.[3]

Subsequent to publishing TSCII, most of the members of webmasters@tamil.net mailing list became part of INFITT, which is a wider initiative to bring in standardization and continued development in various areas of Tamil computing.

Codepage layout

TSCII
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
8_
128
Template:Chset-color-digit|[a]
0BE6
Template:Chset-color-digit|
0BE7
Template:Chset-color-letter|ஸ்ரீ
0BB8 0BCD 0BB0 0BC0
Template:Chset-color-letter|
0B9C
Template:Chset-color-letter|
0BB7
Template:Chset-color-letter|
0BB8
Template:Chset-color-letter|
0BB9
Template:Chset-color-letter|க்ஷ
0B95 0BCD 0BB7
Template:Chset-color-letter|ஜ்
0B9C 0BCD
Template:Chset-color-letter|ஷ்
0BB7 0BCD
Template:Chset-color-letter|ஸ்
0BB8 0BCD
Template:Chset-color-letter|ஹ்
0BB9 0BCD
Template:Chset-color-letter|க்ஷ்
0B95 0BCD 0BB7 0BCD
Template:Chset-color-digit|
0BE8
Template:Chset-color-digit|
0BE9
Template:Chset-color-digit|
0BEA
9_
144
Template:Chset-color-digit|
0BEB
Template:Chset-color-punct|
2018
Template:Chset-color-punct|
2019
Template:Chset-color-punct|
201C
Template:Chset-color-punct|
201D
Template:Chset-color-digit|
0BEC
Template:Chset-color-digit|
0BED
Template:Chset-color-digit|
0BEE
Template:Chset-color-digit|
0BEF
Template:Chset-color-letter|ஙு
0B99 0BC1
Template:Chset-color-letter|ஞு
0B9E 0BC1
Template:Chset-color-letter|ஙூ
0B99 0BC2
Template:Chset-color-letter|ஞூ
0B9E 0BC2
Template:Chset-color-digit|
0BF0
Template:Chset-color-digit|
0BF1
Template:Chset-color-digit|
0BF2
A_
160
Template:Chset-color-misc|NBSP
00A0
Template:Chset-color-misc|
0BBE
Template:Chset-color-misc|ி
0BBF
Template:Chset-color-misc|
0BC0
Template:Chset-color-misc|
0BC1
Template:Chset-color-misc|
0BC2
Template:Chset-color-misc|
0BC6
Template:Chset-color-misc|
0BC7
Template:Chset-color-misc|
0BC8
Template:Chset-color-graph|©
00A9
Template:Chset-color-misc|
0BD7
Template:Chset-color-letter|
0B85
Template:Chset-color-letter|
0B86
Template:Chset-color-undef| Template:Chset-color-letter|
0B88
Template:Chset-color-letter|
0B89
B_
176
Template:Chset-color-letter|
0B8A
Template:Chset-color-letter|
0B8E
Template:Chset-color-letter|
0B8F
Template:Chset-color-letter|
0B90
Template:Chset-color-letter|
0B92
Template:Chset-color-letter|
0B93
Template:Chset-color-letter|
0B94
Template:Chset-color-letter|
0B83
Template:Chset-color-letter|
0B95
Template:Chset-color-letter|
0B99
Template:Chset-color-letter|
0B9A
Template:Chset-color-letter|
0B9E
Template:Chset-color-letter|
0B9F
Template:Chset-color-letter|
0BA3
Template:Chset-color-letter|
0BA4
Template:Chset-color-letter|
0BA8
C_
192
Template:Chset-color-letter|
0BAA
Template:Chset-color-letter|
0BAE
Template:Chset-color-letter|
0BAF
Template:Chset-color-letter|
0BB0
Template:Chset-color-letter|
0BB2
Template:Chset-color-letter|
0BB5
Template:Chset-color-letter|
0BB4
Template:Chset-color-letter|
0BB3
Template:Chset-color-letter|
0BB1
Template:Chset-color-letter|
0BA9
Template:Chset-color-letter|டி
0B9F 0BBF
Template:Chset-color-letter|டீ
0B9F 0BC0
Template:Chset-color-letter|கு
0B95 0BC1
Template:Chset-color-letter|சு
0B9A 0BC1
Template:Chset-color-letter|டு
0B9F 0BC1
Template:Chset-color-letter|ணு
0BA3 0BC1
D_
208
Template:Chset-color-letter|து
0BA4 0BC1
Template:Chset-color-letter|நு
0BA8 0BC1
Template:Chset-color-letter|பு
0BAA 0BC1
Template:Chset-color-letter|மு
0BAE 0BC1
Template:Chset-color-letter|யு
0BAF 0BC1
Template:Chset-color-letter|ரு
0BB0 0BC1
Template:Chset-color-letter|லு
0BB2 0BC1
Template:Chset-color-letter|வு
0BB5 0BC1
Template:Chset-color-letter|ழு
0BB4 0BC1
Template:Chset-color-letter|ளு
0BB3 0BC1
Template:Chset-color-letter|று
0BB1 0BC1
Template:Chset-color-letter|னு
0BA9 0BC1
Template:Chset-color-letter|கூ
0B95 0BC2
Template:Chset-color-letter|சூ
0B9A 0BC2
Template:Chset-color-letter|டூ
0B9F 0BC2
Template:Chset-color-letter|ணூ
0BA3 0BC2
E_
224
Template:Chset-color-letter|தூ
0BA4 0BC2
Template:Chset-color-letter|நூ
0BA8 0BC2
Template:Chset-color-letter|பூ
0BAA 0BC2
Template:Chset-color-letter|மூ
0BAE 0BC2
Template:Chset-color-letter|யூ
0BAF 0BC2
Template:Chset-color-letter|ரூ
0BB0 0BC2
Template:Chset-color-letter|லூ
0BB2 0BC2
Template:Chset-color-letter|வூ
0BB5 0BC2
Template:Chset-color-letter|ழூ
0BB4 0BC2
Template:Chset-color-letter|ளூ
0BB3 0BC2
Template:Chset-color-letter|றூ
0BB1 0BC2
Template:Chset-color-letter|னூ
0BA9 0BC2
Template:Chset-color-letter|க்
0B95 0BCD
Template:Chset-color-letter|ங்
0B99 0BCD
Template:Chset-color-letter|ச்
0B9A 0BCD
Template:Chset-color-letter|ஞ்
0B9E 0BCD
F_
240
Template:Chset-color-letter|ட்
0B9F 0BCD
Template:Chset-color-letter|ண்
0BA3 0BCD
Template:Chset-color-letter|த்
0BA4 0BCD
Template:Chset-color-letter|ந்
0BA8 0BCD
Template:Chset-color-letter|ப்
0BAA 0BCD
Template:Chset-color-letter|ம்
0BAE 0BCD
Template:Chset-color-letter|ய்
0BAF 0BCD
Template:Chset-color-letter|ர்
0BB0 0BCD
Template:Chset-color-letter|ல்
0BB2 0BCD
Template:Chset-color-letter|வ்
0BB5 0BCD
Template:Chset-color-letter|ழ்
0BB4 0BCD
Template:Chset-color-letter|ள்
0BB3 0BCD
Template:Chset-color-letter|ற்
0BB1 0BCD
Template:Chset-color-letter|ன்
0BA9 0BCD
Template:Chset-color-letter|
0B87
Template:Chset-color-undef|

  Letter  Number  Punctuation  Symbol  Other  Undefined

  1. ^ U+0BE6 TAMIL DIGIT ZERO, which was added with Unicode version 4.1 in March, 2005

Conversion Tools

You can convert UTF-8 encoded documents to TSCII using the GNU iconv tools as follows,

$ iconv -f utf-8 -t tscii hello.utf8 > hello.tscii

Whereas conversion from TSCII to UTF-8 is done by interchanging -f and -t flags.

Visual Application

An open source project is available at AnyTaFont2UTF8 is maintained by Isaiyini Tamil Community

See also

  • TACE16 (Tamil All Character Encoding)

References

External links