CS Indic character set

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

The CS Indic character set, or the Classical Sanskrit Indic Character Set, is used by LaTex represent text used in the Romanization of Sanskrit.[1] It is used in fonts, and is based on Code Page 437.[2] Extended versions are the CSX Indic character set and the CSX+ Indic character set.[3][4]

Code page layout[edit]

CS Indic[5]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
8_
128
9_
144
A_
160
ñ
00F1
Ñ
00D1

006C 0303

1E41
B_
176
C_
192
D_
208
E_
224
ā
0101
Ā
0100
ī
012B
Ī
012A
ū
016B
Ū
016A

1E5B

1E5A

1E5D

1E5C

1E37

1E36

1E39

1E38

1E45
F_
240

1E44

1E6D

1E6C

1E0D

1E0C

1E47

1E46
ś
015B
Ś
015A

1E63

1E62

1E43

1E42

1E25

1E24

  Letter  Number  Punctuation  Symbol  Other  Undefined

History[edit]

The CS and CSX character set was defined during an informal discussion over a beer between John Smith, Dominik Wujastyk and Ronald E. Emmerick during the World Sanskrit Conference in Vienna, 1990. A few months later they were endorsed by several other Indologists including Harry Falk, Richard Lariviere, G. Jan Meulenbeld, Hideaki Nakatani, Muneo Tokunaga, and Michio Yano.[5]

References[edit]

  1. ^ Anshuman Pandey (December 1998). "Romanized Indix and LaTex" (PDF). TUGboat. TeX Users Group. 19 (4): 417.
  2. ^ "CTAN: /Tex-archive/Fonts/CSX/Fonts/Charter".
  3. ^ "Classical Sanskrit eXtended encoding for the representation of Indian languages in Roman script".
  4. ^ "The CSX+ encoding (Classical Sanskrit eXtended Plus) encoding used in (La)TeX".
  5. ^ a b Wujastyk, Dominik (1990). "HUMANIST listserv report". HUMANIST.