Jump to content

Allograph

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by AManWithNoPlan (talk | contribs) at 14:24, 21 December 2021 (doi now works). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Allography, from the Greek for "other writing", has several meanings which all relate to how words and sounds are written down.

Authorship

An allograph may be the opposite of an autograph – i.e. a person's words or name (signature) written by someone else.[1]

Script

Historical allographs of Latin letters
a

U+0061
small "A"
rendered with
a top hook
in most fonts
ɑ

U+0251
small Latin alpha
never has
a top hook
 
ɡ

U+0261
small script "G"
never has
a looptail
g

U+0067
small "G"
rendered with
a looptail
in some fonts
 
s

U+0073
small "S"
ſ

U+017F
small long "S"

In graphemics, the term allograph denotes any glyphs that are considered variants of a letter or other grapheme, like a number or punctuation. An obvious example in English (and many other writing systems) is the distinction between uppercase and lowercase letters. Allographs can vary vastly, without affecting the underlying identity of the grapheme. Even if the word "cat" is rendered as "cAt", it remains recognizable as the sequence of the three graphemes ‹c›, ‹a›, ‹t›.[2] Thus, if a group of individual glyphs (shapes that may or may not represent the same letter) are allographs (they do represent the same letter), they all represent a single grapheme (a single instance of the smallest unit of writing).

Letters and other graphemes can also have huge variations that may be missed by many readers. The letter g, for example, has two common forms (glyphs) in different typefaces, and an enormous variety in people's handwriting. A positional example of allography is the so-called long s, a symbol which was once a widely used non-final allograph of the lowercase letter s. A grapheme variant can acquire a separate meaning in a specialized writing system. Several such variants have distinct code points in Unicode and so ceased to be allographs for some applications.[3]

The fact that handwritten allographs differ so widely from person to person, and even from day to day with the same person, means that handwriting recognition software is enormously complicated.

Chinese characters

In the Chinese script, there exist several graphemes that have more than one written representation. Chinese typefaces often contain many variants of some graphemes. Different regional standards have adopted certain character variants. For instance:

Standard Allograph
Taiwan
China
Japan

Orthography

An allograph may also be a smaller fragment of writing, that is a letter or a group of letters, which represents a particular sound. In the words cat and king, the letters c and k are both allographs of the same sound. This relationship between a letter and a sound is not necessarily fixed, for example in a different word, such as city, c is instead an allograph of an s sound.

Some words use groups of letters to represent a sound. In kick both k and ck are allographs of the sound that the c in cat represents. These associations are learned as part of learning to read and write a language. However, the development of phonetic associations is more difficult when deaf children are learning written languages. For example, without auditory knowledge of words, children are unable to associate graphemes with phonemes, or the sounds of letters or groups of letters. This lack of alphabetic knowledge correlates with delayed literacy in early learning.[4]

English orthographic allographs for /k/
cat, mecca, chemistry, back, lacquer, biscuit, sgraffito, key, khan, dekko, walk, Qatar, bouquet Complicated allographs may surprise or baffle language learners, just as those in place names can continue to confuse people who are unfamiliar with a particular location, even when they are native speakers of the language. One notorious allograph in the English language is ough, which may easily represent more than 10 different sounds, depending on the word in which it is used.

Allographs have found use in humor and puns; a famous example of allographic humour is that of spelling fish as ghoti.

The primary reason that we accept all these varieties as representing the same sound or grapheme is that we have been taught to make these associations when learning to read the English language. That is to say, their meaning and correspondence is assigned arbitrarily, by conventions adopted and observed by a particular language community. Many of these associations have to be unlearned if we study a second language whose writing system is based upon, or contains many elements similar to or shared by, our own alphabet or writing system.[5] Very often, the letters one might be comfortable and familiar with are allographs of quite different sounds in the second language. For example, in written Spanish the grapheme ⟨v⟩ will often represent the phoneme /b/, whereas in English this does not occur.

Typography

Official dimensions of the euro sign
Dimensions of the sign in a selection of type faces

The term 'allograph' is used to describe the different representations of the same grapheme or character in different typefaces.[6] The resulting font elements may look quite different in shape and style from the reference character or each other, but nevertheless their meaning remains the same.[7] In Unicode, a given character is allocated a code point: all allographs of that character have the same code point and thus the essential meaning is retained irrespective of font choice at time of printing or display.

See also

References

  1. ^ Jeremy Hawthorn (2000), A Glossary of Contemporary Literary Theory, Oxford University Press, ISBN 0-340-76195-4
  2. ^ The Cambridge Encyclopedia of Language, second edition, Cambridge University Press, 1997, p. 196
  3. ^ Kumar, Sanjeev (2012-10-15). "A Comparative Study of UTF-8, UTF-16, and UTF-32 of Unicode Code Point". The IUP Journal of Telecommunications. IV (2). Rochester, NY: 50–59. SSRN 2161812.
  4. ^ Bergeron, Jessica Page; Lederberg, Amy R.; Easterbrooks, Susan R.; Miller, Elizabeth Malone; McDonald Connor, Carol (2009). "Building the Alphabetic Principle in Young Children Who Are Deaf or Hard of Hearing". Volta Review. 109 (2/3): 87–119. doi:10.17955/tvr.109.2.3.622.
  5. ^ Alco, Bonnie (2009–2010). "An L2 Reader's Word-Recognition Strategies: Transferred or Developed". The Catesol Journal. 21: 66–74.
  6. ^ Thomas Milo (2012). "Arabic Script Tutorial". nuqta.com. Retrieved 24 November 2019. In Arabic the abstract, nominal graphemes are represented by context-dependent allographs. Simplified support for Arabic handles contextual allographs according to two patterns, discontinuous and continuous assimilation. (Allographs and Ligatures)
  7. ^ David Rothlein; Brenda Rapp (3 April 2017). "The role of allograph representations in font-invariant letter identification". Journal of Experimental Psychology: Human Perception and Performance. 43 (7): 1411–1429. doi:10.1037/xhp0000384. PMC 5481478. PMID 28368166.