Semantic role labeling
This article provides insufficient context for those unfamiliar with the subject. Learn how and when to remove this template message)(January 2009) (
Semantic role labeling, sometimes also called shallow semantic parsing, is a task in natural language processing consisting of the detection of the semantic arguments associated with the predicate or verb of a sentence and their classification into their specific roles. For example, given a sentence like "Mary sold the book to John", the task would be to recognize the verb "to sell" as representing the predicate, "Mary" as representing the seller (agent), "the book" as representing the goods (theme), and "John" as representing the recipient. This is an important step towards making sense of the meaning of a sentence. A semantic analysis of this sort is at a lower-level of abstraction than a syntax tree, i.e. it has more categories, thus groups fewer clauses in each category. For instance, "the book belongs to me" would need two labels such as "possessed" and "possessor" whereas "the book was sold to John" would need two other labels such as "goal" (or "theme") and "receiver" (or "recipient") even though these two clauses would be very similar as far as "subject" and "object" functions are concerned.
The FrameNet project produced the first major computational lexicon that systematically described many predicates and their corresponding roles. Daniel Gildea (University of California, Berkeley / International Computer Science Institute) and Daniel Jurafsky (currently teaching at Stanford University, but previously working at University of Colorado and UC Berkeley) developed the first automatic semantic role labeling system based on FrameNet. The PropBank corpus added manually created semantic role annotations to the Penn TreeBank corpus of Wall Street Journal texts. Many automatic semantic role labeling systems have used PropBank as a training dataset to learn how to annotate new sentences automatically.
- Automatic Labeling of Semantic Roles, Daniel Gildea and Daniel Jurafsky. In Proceedings of the 38th Annual Conference of the Association for Computational Linguistics (ACL-00), pp. 512–520, Hong Kong, October 2000.