Temporal expressions

From Wikipedia, the free encyclopedia
Jump to: navigation, search

A temporal expression in a text is a sequence of tokens (words, numbers and characters) that denote time, that is express a point in time, a duration or a frequency. Examples:

A point in time:
He was born on <TIMEX>6 May, 1980</TIMEX>.
A duration:
The show lasted <TIMEX>7 minutes</TIMEX>.
A frequency:
The pump circulates the water <TIMEX>every 2 hours</TIMEX>.

Initially, temporal expressions were considered a type of named entities and their identification was part of the named entity recognition task. Since the Automatic Content Extraction program in 2004 there has been a separate task identified and called Temporal Expression Recognition and Normalisation (TERN). Timex evaluation is now evaluated in two major temporal annotation challenges: TempEval and i2b2, both of which prefer the TimeML-level TIMEX3 standard.[1]

Approaches[edit]

Similarly to NER systems, temporal expression taggers have been created either using linguistic grammar-based techniques or statistical models. Hand-crafted grammar-based systems typically obtained better results, but at the cost of months of work by experienced linguists. There are many such systems available now,[2][3][4] so creating a temporal expression recognizer from scratch is generally an undesirable duplication of effort. Instead, current approaches focus on novel subclasses of timex.[5]

Statistical systems typically require a large amount of manually annotated training data and are usually applied to the recognition task only (although there is work done using machine learning algorithms to resolve certain ambiguities in the interpretation step).[6][7]

Notes[edit]

  1. ^ See TIMEX3 at timeml.org
  2. ^ Strötgen, Jannik; Michael Gertz (2010). "HeidelTime: High quality rule-based extraction and normalization of temporal expressions". Proceedings of the 5th International Workshop on Semantic Evaluation. ACL. 
  3. ^ Llorens, Hector; Leon Derczynski; Robert Gaizauskas; Estela Saquete (2012). "TIMEN: An Open Temporal Expression Normalisation Resource". LREC. ACL. 
  4. ^ Filannino, Michele; Gavin Brown; Goran Nenadic (2013). "ManTIME: Temporal identification and normalization in the TempEval-3 challenge". Proceedings of the 7th International Workshop on Semantic Evaluation. ACL. 
  5. ^ Brucato, Matteo; Leon Derczynski; Hector Llorens; Kalina Bontcheva; Christian S. Jensen (2013). "Recognising and Interpreting Named Temporal Expressions". Proceedings of the International Conference on Recent Advances in Natural Language Processing. ACL. 
  6. ^ See, for example, Ahn, van Rantwijk & de Rijke 2007
  7. ^ Angeli, Gabor; Christoper Manning; Daniel Jurafsky (2012). "Parsing time: Learning to interpret time expressions". Proceedings of NAACL. ACL: 446. 

References[edit]

External links[edit]

  • TimexPortal - contains information about research on temporal expression recognition and normalisation, including annotation standards, available corpora, researchers in the field, systems and literature.
  • ACE - the website of the Automatic Content Extraction program.
  • TERN - an open-source (research) TERN software.