Dependency grammar

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Dependency grammar (DG) is a class of modern syntactic theories that are all based on the dependency relation and that can be traced back primarily to the work of Lucien Tesnière. Dependency grammars are distinct from phrase structure grammars (= constituency grammars), since they lack phrasal nodes. Structure is determined by the relation between a word (a head) and its dependents. Dependency structures are flatter than constituency structures in part because they lack a finite verb phrase constituent, and they are thus well suited to languages with free word order, such as Czech and Turkish.

Contents

[edit] History

The notion of dependencies between grammatical units has existed since the earliest recorded grammars, e.g. Panini, and the dependency concept therefore arguably predates the constituency notion by many centuries.[1] Ibn Mada was the first grammarian to use the term dependency in the grammatical sense that we use it today. He was born in 1119 in Cordoba, studied in Sevilla and Ceuta, and died 1195 in Sevilla. In early modern times, the dependency concept seems to have coexisted side by side with the constituency concept, the latter having entered Latin, French, English and other grammars from the widespread study of term logic of antiquity.[2]

Modern dependency grammars, however, begin primarily with the work of Lucien Tesnière. Tesnière was a Frenchman, a polyglot, and a professor of linguistics at the universities in Strasbourg and Montpellier. His major work Éléments de syntaxe structurale was published posthumously in 1959 - he died in 1954. The basic approach to syntax he developed seems to have been seized upon independently by others in the 1960s[3] and a number of other dependency-based grammars have gained prominence since those early works.[4] Dependency grammar has generated a lot of interest in Germany[5] in both theoretical syntax and language pedagogy. In recent years, the great development surrounding dependency-based theories has come from computational linguistics. Dependency-based systems are increasingly being used to parse natural language and generate tree banks. Interest in dependency grammar is growing at present, the first international conference on dependency linguistics having taken place just recently (Depling 2011).

[edit] Dependency vs. constituency

Dependency is a one-to-one correspondence: For every element (e.g. word or morph) in the sentence, there is exactly one node in the structure of that sentence that corresponds to that element. The result of this one-to-one correspondence is that dependency grammars are word (or morph) grammars. All that exist are the elements and the dependencies that connect the elements into a structure. This situation should be compared with the constituency relation of phrase structure grammars. Constituency is a one-to-one-or-more correspondence, which means that given a sentence, for every element in that sentence, there are one or more nodes in the structure that correspond to that element. The result of this difference is that dependency structures are minimal[6] compared to their constituency structure counterparts, since they tend to contain many fewer nodes.

Dependency vs. constituency

These two trees illustrate just two possible ways to render the dependency and constituency relations (see below). The dependency tree is an "ordered" tree, i.e. it reflects actual word order. Many dependency trees abstract away from linear order and focus just on hierarchical order, which means they do not show actual word order. The constituency tree follows the conventions of bare phrase structure (BPS), whereby the words themselves are employed as the node labels.

The distinction between dependency- and constituency-based grammars derives in a large part from the initial division of the clause. The constituency relation derives from an initial binary division, whereby the clause is split into a subject noun phrase (NP) and a predicate verb phrase (VP). This division is certainly present in the basic analysis of the clause that we find in the works of, for instance, Leonard Bloomfield and Noam Chomsky. Tesnière, however, argued vehemently against this binary division, preferring instead to position the verb as the root of all clause structure. Tesnière's stance was that the subject-predicate division stems from term logic and has no place in linguistics.[7] The importance of this distinction is that if one acknowledges the initial subject-predicate division in syntax as something real, then one is likely to go down the path of constituency grammar, whereas if one rejects this division, then the only alternative is to position the verb as the root of all structure, which means one has chosen the path of dependency grammar.

[edit] Representing dependencies

The conventions vary that dependency grammars employ to represent dependencies. The following schemata (in addition to the tree above) illustrate some of these conventions:

Conventions for illustrating dependencies

The representations in (a-d) are trees, whereby the specific conventions employed in each tree vary. Solid lines are dependency edges and lightly dotted lines are projection lines. The only difference between tree (a) and tree (b) is that tree (a) employs the category class to label the nodes whereas tree (b) employs the words themselves as the node labels. Tree (c) is a reduced tree insofar as the string of words below and projection lines are deemed unnecessary and are hence omitted. Tree (d) abstracts away from linear order and reflects just hierarchical order. The arrows in (e) are an alternative convention used to show dependencies and are favored by Word Grammar. The brackets in (f) are seldom used, but are nevertheless quite capable of reflecting the dependency hierarchy; dependents appear enclosed in more brackets than their heads. And finally, the indentations like those in (g) is another convention that is sometimes employed to indicate the hierarchy of words. Dependents are placed underneath their heads and indented. Like tree (d), the indentations in (g) abstract away from linear order.

The point to these conventions is that they are just that, namely conventions. They do not influence the underlying nature of the hierarchical structures that dependency grammars acknowledge.

[edit] Strings and catenae

Tesnière emphasized that syntactic units are organized along two dimensions,[8] the horizontal (precedence) dimension and the vertical (dominance) dimension. In this respect, acknowledging dependency-based structures encourages one to isolate units in the one or the other dimension. Combinations of elements that are organized along the horizontal dimension alone are called strings (a common concept) and combinations of elements organized along the vertical dimension alone are called catenae (a novel concept).

[edit] Dependency grammars

The following frameworks are dependency-based:

Hybrid constituency/dependency tree from the Quranic Arabic Corpus

Link grammar is also based on the dependency relation, but link grammar does not include directionality in the dependencies between words, and thus does not describe head-dependent relationships. Hybrid dependency/constituency grammar uses dependencies between words, but also includes dependencies between phrasal nodes - see for example the Quranic Arabic Dependency Treebank. The derivation trees of Tree-Adjoining Grammar are dependency-based, although the full trees of TAG are constituency-based, so in this regard, it is not clear whether TAG should be viewed more as a dependency or constituency grammar.

There are major differences between the grammars just listed. In this regard, the dependency relation is compatible with other major tenets of theories of grammar. Thus like constituency grammars, dependency grammars can be mono- or multistratal, representational or derivational, construction- or rule-based.

[edit] Notes

  1. ^ Percival 1990
  2. ^ Percival 1976
  3. ^ e.g. Hays 1960, Gaifman 1965, Robinson 1970
  4. ^ e.g. Hudson 1984, Sgall, Hajičová et Panevova 1986, Mel’čuk 1988
  5. ^ e.g. Heringer 1996, Eroms 2000, Ágel et al. 2003/6
  6. ^ Osborne et al. 2011
  7. ^ Tesnière 1959:103-105
  8. ^ Tesnière 1959:16ff.

[edit] References

  • Ágel, V., Ludwig Eichinger, Hans-Werner Eroms, Peter Hellwig, Hans Heringer, and Hennig Lobin (eds.) 2003/6. Dependency and Valency: An International Handbook of Contemporary Research. Berlin: Walter de Gruyter.
  • Hans-Werner Eroms 2000. Syntax der deutschen Sprache. Berlin: de Gruyter.
  • Helbig, Gerhard and Joachim Buscha 1987. Deutsche Grammatik. Ein Handbuch für den Ausländerunterricht. Leipzig: VEB Verlag Enzyklopädie, 10. Auflage.
  • Heringer, Hans 1996. Deutsche Syntax dependentiell. Tübingen: Stauffenburg.
  • Hays, David 1960. Grouping and dependency theories. P-1910, RAND Corporation.
  • Hudson, Richard 1984. Word Grammar. Oxford: Blackwell.
  • Mel'cuk, Igor 1988. Dependency Syntax: Theory and Practice. Albany: SUNY Press.
  • Osborne, Timothy, Michael Putnam, and Thomas Gross 2011. Bare phrase structure, label-less structures, and specifier-less syntax: Is Minimalism becoming a dependency grammar? The Linguistic Review 28, 315-364.
  • Percival, Keith 1976. On the historical source of immediate-constituent analysis. In: Notes from the linguistic underground, James McCawley (ed.), Syntax and Semantics 7, 229-242. New York: Academic Press.
  • Percival, Keith 1990. Reflections on the history of dependency notions in linguistics. Historiographia Linguistica, 17, 29–47.
  • Robinson, Jane 1970. Dependency structures and transformational rules. Language 46, 259-285.
  • Sgall, Peter, Eva Hajičová, and J. Panevová 1986. The meaning of the sentence in its semantic and pragmatic aspects. Dordrecht: D. Reidel Publishing Company.
  • Tesnière, Lucien 1959. Éleménts de syntaxe structurale. Paris: Klincksieck.

[edit] See also

[edit] Implementations

  • Stanford parser A statistical phrase-structure parser which provides a tool to convert the output into a form of dependency graph called "Stanford Dependencies".
  • DeSR A statistical dependency shift/reduce dependency parser.
  • RelEx generates a dependency parse for the English language, by applying graph rewriting to the output of the link grammar parser. Open source license.
  • XDG Development Kit An Integrated Development Environment for Extensible Dependency Grammar (XDG)
  • TULE A linguistic framework that takes a natural language sentence in input (Italian) and returns a full dependency tree describing its syntactic structure.

[edit] External links

Personal tools
Namespaces
Variants
Actions
Navigation
Interaction
Toolbox
Print/export
Languages