|This article needs additional citations for verification. (April 2013)|
Predictive text is an input technology used where one key or button represents many letters, such as on mobile phones's numeric keypads and in accessibility technologies. Each key press results in a prediction rather than repeatedly sequencing through the same group of "letters" it represents, in the same, invariable order. Predictive text could allow for an entire word to be input by single keypress. Predictive text makes efficient use of fewer device keys to input writing into a text message, an e-mail, an address book, a calendar, and the like.
The most widely used, general, predictive text systems are T9, iTap, and LetterWise/WordWise. There are many unique ways to build a device that predicts text, but all predictive text systems have initial, linguistic settings that offer predictions that are re-prioritized to adapt to each user. This learning adapts, by way of the device memory, to a user's disambiguating feedback that results in corrective key presses, such as pressing a "next" key to get to the intention. Most predictive text systems have a user database to facilitate this process.
Theoretically the number of keystrokes required per desired character in the finished writing is, on average, comparable to using a keyboard. This is approximately true providing that all words used are in its database, punctuation is ignored, and no input mistakes are made typing or spelling. In practice, these factors are found to cause tremendous variance in the efficiency gain. The theoretical keystrokes per character, KSPC, of a keyboard is KSPC=1.00, and of multi-tap is KSPC=2.03. Eatoni' LetterWise is a predictive multi-tap hybrid, which when operating on a standard telephone keypad achieves KSPC=1.15 for English.
The choice of which predictive text system is the best to use involves matching the user's preferred interface style, the user's level of learned ability to operate predictive text software, and the user's efficiency goal. There are various levels of risk in predictive text systems, versus multi-tap systems, because the predicted text that is automatically written that provide the speed and mechanical efficiency benefit, could, if the user is not careful to review, result in transmitting misinformation. Predictive text systems take time to learn to use well, and so generally, a device's system has user options to set up the choice of multi-tap or of any one of several schools of predictive text methods.
Short message service (SMS) permits a mobile phone user to send text messages, (also called messages, SMSes, texts, and txts) as a short message. The most common system of SMS text input is referred to as "multi-tap". Using multi-tap, a key is pressed multiple times to access the list of letters on that key. For instance, pressing the "2" key once displays an "a", twice displays a "b" and three times displays a "c". To enter two successive letters that are on the same key, the user must either pause or hit a "next" button. A user can type by pressing an alphanumeric keypad without looking at the electronic equipment display. Thus, multi-tap is easy to understand, and can be used without any visual feedback. However, multi-tap is not very efficient, requiring potentially many keystrokes to enter a single letter.
In ideal predictive text entry, all words used are in the dictionary, punctuation is ignored, no spelling mistakes are made, and no typing mistakes are made. The ideal dictionary would include all slang, proper nouns, abbreviations, urls, foreign-language words and other user-unique words. This ideal circumstance gives predictive text software the reduction in the number of key strokes a user is required to enter a word. The user presses the number corresponding to each letter and, as long as the word exists in the predictive text dictionary, or is correctly disambiguated by non-dictionary systems, it will appear. For instance, pressing "4663" will typically be interpreted as the word "good", provided that a linguistic database in English is currently in use, though alternatives such as home, hood, hoof are also valid interpretations of the sequence of key strokes.
The most widely used systems of predictive text are Tegic's T9, Motorola's iTap, and the Eatoni Ergonomics' LetterWise and WordWise. T9 and iTap use dictionaries, but Eatoni Ergonomics' products uses a disambiguation process, a set of statistical rules to recreate words from keystroke sequences. All predictive text systems require a linguistic database for every supported input language.
Dictionary vs. non-dictionary systems
In dictionary-based systems, as the user presses the number buttons, an algorithm searches the dictionary for a list of possible words that match the keypress combination, and offers up the most probable choice. The user can then confirm the selection and move on, or use a key to cycle through the possible combinations.
A non-dictionary system constructs words and other sequences of letters from the statistics of word parts. To attempt predictions of the intended result of keystrokes not yet entered, disambiguation may be combined with a word completion facility.
Either system (disambiguation or predictive) may include a user database, which can be further classified as a "learning" system when words or phrases are entered into the user database without direct user intervention. The user database is for storing words or phrases which are not well-disambiguated by the pre-supplied database. Some disambiguation systems further attempt to correct spelling, format text or perform other automatic rewrites, with the risky effect of either enhancing or frustrating user efforts to enter text.
Predictive entry of text from a telephone keypad has been known at least since the 1970s (Smith and Goodwin, 1971). Aspects of predictive text have been patented for instance by Kondraske(1985), while a fully functional keypad to text system for communicating with deaf people via phone was patented in 1988 by Roy Feinson #4,754,474), that included most of the features of modern predictive text systems including disambiguation and local dictionary storage. Predictive text was mainly used to look up names in directories over the phone, until mobile phone text messaging came into widespread use. Sinclair's ZX80/81 also possessed a form of predictive.
Consider a typical phone keypad:
Suppose a user wishes to type "The". In a traditional "multi-tap" keypad entry system, it would be necessary to do the following:
- Press 8 (tuv) once to select t.
- Press 4 (ghi) twice to select h.
- Press 3 (def) twice to select e.
Meanwhile, in a phone with predictive text, it is only necessary to:
- Press 8 once to select the (tuv) group for the first character.
- Press 4 once to select the (ghi) group for the second character.
- Press 3 once to select the (def) group for the third character.
The system updates the display as each keypress is entered to show the most probable entry. In this case, predictive text reduced the number of button presses from 5 to 3. The effect is even greater with longer, more complex words.
A dictionary-based predictive system is based on hope that the desired word is in the dictionary. That hope may be misplaced if the word differs in any way from common usage—in particular, if the word is not spelled or typed correctly, is slang, or is a proper noun. In these cases, some other mechanism must be used to enter the word.
Furthermore, the simple dictionary approach fails with agglutinative languages, where a single word doesn't necessarily represent a single semantic entity.
Companies and products
Predictive text is developed and marketed in a variety of competing products. Nuance Communications's T9. Other products include Motorola's iTap, Eatoni Ergonomic's LetterWise, (character, rather than word-based prediction), WordWise (word-based prediction without a dictionary), EQ3 (a Qwerty-like layout compatible with regular telephone keypads); Prevalent Devices's Phraze-It; Xrgomics' TenGO (a six-key reduced QWERTY keyboard system); Adaptxt (considers language, context, grammar and semantics); Clevertexting (statistical nature of the language, dictionary less, dynamic key allocation); and Oizea Type (temporal ambiguity); Intelab's Tauto; WordLogic's Intelligent Input Platform™ (patented, layer-based advanced text prediction, includes multi-language dictionary, spell-check, built-in Web search).
Words produced by the same combination of keypresses have been called "textonyms"; also "txtonyms"; or "T9onyms" (pronounced "tynonyms"), though they are not specific to T9. Selecting the wrong textonym can occur with no misspelling or typo, if the wrong textonym is selected by default or user error. As mentioned above, the key sequence 4663 on a telephone keypad, provided with a linguistic database in English, will generally be disambiguated as the word "good". However, the same key sequence also corresponds to other words, such as "home", "gone", "hoof", "hood" and so on. For example, "Are you home?" could be rendered as "Are you good?" if the user neglects to alter the default 4663 word. This can lead to misunderstandings; for example sequence 735328 might correspond to either "select" or its antonym "reject". A 2010 row which led to manslaughter was sparked by a textonym error. Predictive text choosing a default different from that which the user expects has similarities with the Cupertino effect, by which spell-check software changes a spelling to that of an unintended word.
Textonyms have been used as Generation Y slang; for example, the use of the word "book" to mean "cool", since "book" is the default in those predictive text systems which assume it is more frequent than "cool". This is related to cacography.
Disambiguation failure and misspelling
Textonyms in which a disambiguation systems gives more than one dictionary word for a single sequence of keystrokes, are not the only issue, or even the most important issue, limiting the effectiveness of predictive text implementations. More important, according to the above references,[which?] are words for which the disambiguation produces a single, incorrect response. The system may, for example, respond with "Blairf" upon input of 252473, when the intended word was "Blaire" or "Claire" both of which correspond to the keystroke sequence, but are not, in this example, found by the predictive text system. When typos or misspellings occur, they are very unlikely to be recognized correctly by a disambiguation system, though error correction mechanisms may mitigate that effect.
- Assistive technology
- Text entry interface
- Input method editor
- Text messaging
- SMS language
- Speech-to-Text Reporter
- I. Scott MacKenzie (2002). "KSPC (Keystrokes per Character) as a Characteristic of Text Entry Techniques". Proceedings of MobileHCI 2002.
Values [of KSPC] for English range from about 10 for methods using only cursor keys and a SELECT key to about 0.5 for word prediction techniques. It is demonstrated that KSPC is useful for a priori analyses, thereby supporting the characterisation and comparison of text-entry methods before labour-intensive implementations and evaluations
- O'Riordan et. al. "Investigating Text Input Methods for Mobile Phones". J. Computer Sci, I (2):189-199, 2005.
- "Slang early-warning alert: `Book' is the new `cat's pajamas' | Change of Subject". Blogs.chicagotribune.com. 2007-01-19. Retrieved 2009-07-08.
- Dartmelk, Jewis. "Txtonyms". University College London: Centre for Mathematics and Physics in the Life Sciences and Experimental Biology. Retrieved 5 April 2013.
- "Indefinite sentence for killing his friend". This Is Lancashire. 2 April 2011. Retrieved 5 April 2013.
- Alleyne, Richard (5 Feb 2008). "Predictive text creating secret teen language". The Daily Telegraph. Retrieved 5 April 2013.
- "Alphabetic Data Entry Via the Touch-Tone Pad: A Comment", Sidney L. Smith and Nancy C. Goodwin, The Mitre Corporation, HUMAN FACTORS, 1971, 13(2) Page 189-190