Model specification

From Wikipedia, the free encyclopedia
  (Redirected from Specification (regression))
Jump to navigation Jump to search

In regression analysis, model specification is the process of developing a regression model. This process consists of selecting an appropriate functional form for the model and choosing which variables to include. For instance, one may specify the functional relationship between personal income and human capital, with the latter proxied by schooling and on-the-job experience as[1]

where is the unexplained error term that is supposed to be independent and identically distributed. If assumptions of the regression model are correct, the least squares estimates of the parameters and will be efficient and unbiased. Hence specification diagnostics usually involve testing the first to fourth moment of the residuals.[2]

Specification error and bias[edit]

Specification error occurs when the functional form or the choice of independent variables does not coincide with that of the true underlying process. In particular, bias (the expected value of the difference of an estimated parameter and the true underlying value) occurs if an independent variable is correlated with the errors inherent in the underlying process. There are several different causes of specification error:

Detection of misspecification[edit]

The Ramsey RESET test can help test for specification error.

Model building[edit]

Building a model involves finding a set of relationships that coincide with the underlying process that is generating the data. This requires avoiding all the sources of misspecification mentioned above. It is best to start with a model in general form that relies on a theoretical understanding of the underlying process; then the model can be fit to the data and checked for the various sources of misspecification, in the process called regression validation. Theoretical understanding can then guide the modification of the model in such a way as to retain theoretical validity while removing the sources of misspecification. But if it proves impossible to find a theoretically acceptable specification that fits the data, the theoretical model may have to be rejected and replaced with another one.

See also[edit]

References[edit]

  1. ^ This particular example is known as Mincer earnings function.
  2. ^ Long, J. Scott; Trivedi, Pravin K. (1993). "Some Specification Tests for the Linear Regression Model". In Bollen, Kenneth A.; Long, J. Scott. Testing Structural Equation Models. London: Sage. pp. 66–110. ISBN 0-8039-4506-X.
  3. ^ Untitled

Further reading[edit]

  • Asteriou, Dimitrios; Hall, Stephen G. (2011). "Misspecification: Wrong Regressors, Measurement Errors and Wrong Functional Forms". Applied Econometrics (Second ed.). London: Palgrave MacMillan. pp. 172–197.
  • Gujarati, Damodar N.; Porter, Dawn C. (2009). "Econometric Modeling: Model Specification and Diagnostic Testing". Basic Econometrics (Fifth ed.). New York: McGraw-Hill Irwin. pp. 467–522. ISBN 978-0-07-337577-9.
  • Kmenta, Jan (1986). Elements of Econometrics (Second ed.). New York: Macmillan. pp. 442–455. ISBN 0-02-365070-2.
  • Leamer, Edward E. (1978). Specification Searches: Ad hoc Inference with Nonexperimental Data. New York: Wiley. ISBN 0-471-01520-2.
  • Maddala, G. S.; Lahiri, Kajal (2009). "Diagnostic Checking, Model Selection, and Specification Testing". Introduction to Econometrics (Fourth ed.). Chichester: Wiley. pp. 401–449. ISBN 978-0-470-01512-4.