Context-free language
In formal language theory, a context-free language is a language generated by some context-free grammar. The set of all context-free languages is identical to the set of languages accepted by pushdown automata.
Contents |
[edit] Examples
An archetypical context-free language is
, the language of all non-empty even-length strings, the entire first halves of which are a's, and the entire second halves of which are b's. L is generated by the grammar
, and is accepted by the pushdown automaton M = ({q0,q1,qf},{a,b},{a,z},δ,q0,z,{qf}) where δ is defined as follows:
δ(q0,a,z) = (q0,az)
δ(q0,a,a) = (q0,aa)
δ(q0,b,a) = (q1,ε)
δ(q1,b,a) = (q1,ε)
δ(q1,ε,z) = (qf,z)
δ(state1,read,pop) = (state2,push)
Context-free languages have many applications in programming languages; for example, the language of all properly matched parentheses is generated by the grammar
. Also, most arithmetic expressions are generated by context-free grammars.
[edit] Closure properties
Context-free languages are closed under the following operations. That is, if L and P are context-free languages, the following languages are context-free as well:
- the union
of L and P - the reversal of L
- the concatenation
of L and P - the Kleene star L * of L
- the image φ(L) of L under a homomorphism φ
- the image φ − 1(L) of L under an inverse homomorphism φ − 1
- the cyclic shift of L (the language
)
Context-free languages are not closed under complement, intersection, or difference. However, if L is a context-free language and D is a regular language then both their intersection
and their difference
are context-free languages.
[edit] Nonclosure under intersection and complement
The context-free languages are not closed under intersection. This can be seen by taking the languages
and
, which are both context-free. Their intersection is
, which can be shown to be non-context-free by the pumping lemma for context-free languages.
Context-free languages are also not closed under complementation, as for any languages A and B:
.
[edit] Decidability properties
The following problems are undecidable for arbitrary context-free grammars A and B:
- Equivalence: is L(A) = L(B)?
- is
? (However, the intersection of a context-free language and a regular language is context-free, so if B were a regular language, this problem becomes decidable.) - is L(A) = Σ * ?
- is
?
The following problems are decidable for arbitrary context-free languages:
- is
? - is L(A) finite?
- Membership: given any word w, does
? (membership problem is even polynomially decidable - see CYK algorithm and Earley's Algorithm)
[edit] Properties of context-free languages
- The reverse of a context-free language is context-free, but the complement need not be.
- Every regular language is context-free because it can be described by a context-free grammar.
- The intersection of a context-free language and a regular language is always context-free.
- There exist context-sensitive languages which are not context-free.
- To prove that a given language is not context-free, one may employ the pumping lemma for context-free languages.
[edit] Parsing
Determining an instance of the membership problem; i.e. given a string w, determine whether
where L is the language generated by some grammar G; is also known as parsing.
Formally, the set of all context-free languages is identical to the set of languages accepted by non-deterministic pushdown automata (NPDA).
Practical parser algorithms, i.e. software realisations of NPDA, include
For the deterministic subclass of context-free languages there are a number of well-known parsing methods.
See also parsing expression grammar as an alternative approach to grammar and parser.
[edit] See also
[edit] References
- Seymour Ginsburg (1966). The Mathematical Theory of Context-Free Languages. New York, NY, USA: McGraw-Hill, Inc..
- Michael Sipser (1997). Introduction to the Theory of Computation. PWS Publishing. ISBN 0-534-94728-X. Chapter 2: Context-Free Languages, pp. 91–122.
- Jean-Michel Autebert, Jean Berstel, Luc Boasson, Context-Free Languages and Push-Down Automata, in: G. Rozenberg, A. Salomaa (eds.), Handbook of Formal Languages, Vol. 1, Springer-Verlag, 1997, 111-174.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
of L and P
of L and P
)
? (However, the intersection of a context-free language and a regular language is context-free, so if
?
?
? (membership problem is even polynomially decidable - see