Jump to content

Trigram tagger

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Sammi Brie (talk | contribs) at 05:38, 26 January 2021 (Adding short description: "Statistical method for automatically identifying words by part of speech" (Shortdesc helper)). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In computational linguistics, a trigram tagger is a statistical method for automatically identifying words as being nouns, verbs, adjectives, adverbs, etc. based on second order Markov models that consider triples of consecutive words. It is trained on a text corpus as a method to predict the next word, taking the product of the probabilities of unigram, bigram and trigram. In speech recognition, algorithms utilizing trigram-tagger score better than those algorithms utilizing IIMM tagger but less well than Net tagger.

The description of the trigram tagger is provided by Brants (2000).

References

  • Kempe Andre (1993). "A stochastic Tagger and an Analysis of Tagging Errors". Internal paper. Institute for Computational Linguistics, Universität Stuttgart.
  • Brants, T. (2000) TnT - A Statistical Part-of-Speech Tagger, Proc 6th Applied Natural Language Processing Conference, ANLP-200

External links