Constituent (linguistics)

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In syntactic analysis, a constituent is a word or a group of words that functions as a single unit within a hierarchical structure. The analysis of constituent structure is associated mainly with phrase structure grammars, although dependency grammars also allow sentence structure to be broken down into constituent parts. The constituent structure of sentences is identified using constituency tests. These tests manipulate some portion of a sentence and based on the result, clues are delivered about that constituent structure of the sentence.

Contents

[edit] Constituency tests

Constituency tests are diagnostics employed to identify the constituent structure of sentences.[1] There are numerous constituency tests applied to English sentences, many of which are listed here: 1. topicalization (=fronting), 2. clefting, 3. pseudoclefting, 4. pro-form substitution, 5. answer fragments, 6. passivization, 7. omission, 8. coordination, etc. These tests are rough-and-ready tools that grammarians employ to reveal clues about syntactic structure. A word of caution is warranted when employing these tests, since they often deliver contradictory results. Some syntacticians even arrange the tests on a scale of reliability, with less-reliable tests treated as useful to confirm constituency though not sufficient on their own[2]. Failing to pass a single test does not mean that the unit is not a constituent, and conversely, passing a single test does not mean necessarily that the unit is a constituent. It is best to apply as many tests as possible to a given unit in order to prove or to rule out its status as a constituent.

[edit] Topicalization (fronting)

Topicalization involves moving the test sequence to the front of the sentence. It is a simple movement operation:

He is going to attend another language course to improve his English.
To improve his English, he is going to attend another course.

[edit] Clefting

Clefting involves placing a sequence of words X within the structure beginning with It is/was: It was X that...

She bought a pair of gloves with silk embroidery.
It was a pair of gloves with silk embroidery that she bought.

[edit] Pseudoclefting

Pseudoclefting (also preposing) is similar to clefting in that it puts emphasis on a certain phrase in a sentence. It involves inserting a sequence of words before is/are what or is/are who:

She bought a pair of gloves with silk embroidery.
A pair of gloves with silk embroidery is what she bought.

[edit] Pro-form substitution (replacement)

Pro-form substitution, or replacement, involves replacing the test constituent with the appropriate pro-form (e.g. pronoun). Substitution normally involves using definite pro-form like it, he, there, here, etc. in place of a phrase or a clause. If such a change yields a grammatical sentence where the general structure has not been altered, then the test sequence is a constituent:

I don't know the man who is sleeping in the car.
*I don't know him who is sleeping in the car. (ungrammatical)
I don't know him.

The ungrammaticality of the first changed version and the grammaticality of the second one demonstrates that the whole sequence, the man who is sleeping in the car, and not just the man is a constituent functioning as a unit.

[edit] Answer fragments (question test)

The answer fragments test refers to the ability of a sequence of words to stand alone as a reply to a question. It is often used to test the constituency of a verbal phrase but can also be applied to other phrases:

What did you do yesterday? - Worked on my new project.
What did you do yesterday? - *Worked on. (unacceptable, so worked on is not a constituent).

Linguists do not agree whether passing the answer fragment test is sufficient, though at a minimum they agree that it can help confirm the results of another constituency test[2].

[edit] Passivization

Passivization involves changing an active sentence to a passive sentence, or vice versa. The object of the active sentence is changed to the subject of the corresponding passive sentence:

A car driving at breakneck speed nearly hit the little dog.
The little dog was nearly hit by a car driving at breakneck speed.

In case passivization results in a grammatical sentence, the phrases that have been moved can be regarded as constituents.

[edit] Omission (deletion)

Omission checks whether a sequence of words can be omitted without influencing the grammaticality of the sentence — in most cases, local or temporal adverbials can be safely omitted and thus qualify as constituents.

Fred relaxes at night on his couch.
Fred relaxes on his couch.
Fred relaxes at night.

Since they can be omitted, the prepositional phrases at night and on his couch are constituents.

[edit] Coordination

The coordination test assumes that only constituents can be coordinated, i.e., joined by means of a coordinator such as "and", e.g.

He enjoys writing sentences and reading them.
He enjoys writing and she enjoys reading sentences.
He enjoys but she hates writing sentences.

Based on the fact that writing sentences and reading them are coordinated using and, one can conclude that they are constituents. The validity of the coordination test is challenged by additional data, however. The next two sentences suggest that the sequences in bold should be understood as constituents. Most grammars do not view sequences such as He enjoys to the exclusion of the VP writing sentences as a constituent. Thus while the coordination test is widely employed as a diagnostic for constituent structure, it is faced with major difficulties and is therefore perhaps the least reliable of all the tests mentioned.

[edit] Constituency tests and disambiguation

Syntactic ambiguity characterizes sentences which can be interpreted in different ways depending solely on how one perceives syntactic connections between words and arranges them into phrases. Possible interpretations of the sentence They killed the man with a gun:

'The man was shot'.
'The man who was killed had a gun with him'.

The ambiguity of this sentence results from two possible arrangements into constituents:

They killed [the man] [with a gun].
They killed [the man with a gun].

In the first sentence, with a gun is an independent constituent with instrumental meaning. In the second sentence, it is embedded into the noun phrase the man with a gun and is modifying the noun man. The autonomy of the unit with a gun in the first interpretation can be tested by the answer fragment test:

How did they kill the man? - With a gun.

However, the same test can be used to prove that the man with a gun in the second sentence should be treated as a unit:

Who(m) did they kill? - The man with a gun.

The ability of constituency tests to disambiguate certain sentence in this manner bears witness to their utility. Most if not all syntacticians employ constituency tests in some form or another to arrive at the structures that they assign to sentences.

[edit] Competing theories

Alternate theoretical approaches to syntax make different assumptions regarding what is considered a constituent. In mainstream phrase structure grammar (and its derivatives), individual words are constituents in and of themselves as well as being parts of other constituents, whereas in dependency grammar[3] certain core words in each phrase are not a constituent by themselves, but only members of a phrasal constituent. The following trees show the same sentence in two different theoretical representations, with a phrase structure representation on the left and a dependency grammar representation on the right. In both trees, a constituent is understood to be the entire tree or any complete labelled subtree (a node plus all the nodes dominated by that node)—note that words like killed and with, for instance, form subtrees (and are considered constituents) in the phrase structure representation but not in the dependency structure representation.

Illustrating constituency and dependency

[edit] See also

[edit] Notes

  1. ^ See for instance Burton-Roberts 1997:7–23 and Carnie 2002:51-53.
  2. ^ a b April 22, 2006 Language Log posting by Eric Bakovic of University of California, San Diego
  3. ^ See Ágel, et al. (eds.) 2003/2006.

[edit] References

  • Ágel, V., L. Eichinger, H.-W. Eroms, P. Hellwig, H. Heringer, and H. Lobin (eds.) 2003/6. Dependency and valency: An international handbook of contemporary research. Berlin: Walter de Gruyter.
  • Burton-Roberts, N. 1997. Analysing sentences: An introduction to English syntax. 2nd Edition. Longman.
  • Carnie, A. 2002. Syntax: A generative introduction. Oxford: Blackwell.
Personal tools
Namespaces
Variants
Actions
Navigation
Interaction
Toolbox
Print/export
Languages