Jump to content

Trigram tagger

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Sammi Brie (talk | contribs) at 05:38, 26 January 2021 (Adding short description: "Statistical method for automatically identifying words by part of speech" (Shortdesc helper)). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

In computational linguistics, a trigram tagger is a statistical method for automatically identifying words as being nouns, verbs, adjectives, adverbs, etc. based on second order Markov models that consider triples of consecutive words. It is trained on a text corpus as a method to predict the next word, taking the product of the probabilities of unigram, bigram and trigram. In speech recognition, algorithms utilizing trigram-tagger score better than those algorithms utilizing IIMM tagger but less well than Net tagger.

The description of the trigram tagger is provided by Brants (2000).

References

Kempe Andre (1993). "A stochastic Tagger and an Analysis of Tagging Errors". Internal paper. Institute for Computational Linguistics, Universität Stuttgart.
Brants, T. (2000) TnT - A Statistical Part-of-Speech Tagger, Proc 6th Applied Natural Language Processing Conference, ANLP-200

External links

TnT -- Statistical Part-of-Speech Tagging by Thorsten Brants

This computational linguistics-related article is a stub. You can help Wikipedia by expanding it.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Trigram_tagger&oldid=1002821652"

Hidden categories: