Jump to content

Kitchen sink regression

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Marcocapelle (talk | contribs) at 20:37, 18 November 2016 (removed Category:Regression analysis; added Category:Regression variable selection using HotCat). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Pejoratively, a kitchen sink regression is a statistical regression which uses a long list of possible independent variables to attempt to explain variance in a dependent variable. In economics, psychology, and other social sciences, regression analysis is typically used deductively to test hypotheses, but a kitchen sink regression does not follow this norm. Instead, the analyst throws "everything but the kitchen sink" into the regression in hopes of finding some statistical pattern.[citation needed]

The results of this type of regression may be misleadingly interpreted inductively to suggest that the same pattern of relationships between independent and dependent variables will be found in other data, which can lead to hasty generalizations. The difficulty in valid interpretation is that the more independent variables are included in a regression, the greater is the possibility that one or more will be found to be statistically significant while in fact they have no causal effect on the dependent variable—that is, the more likely the results are to be afflicted with Type I error.[citation needed]

The kitchen sink regression is an example of the practice of data dredging.[citation needed]

References

  • Barreto and Howland (2005). "Chapter 17: Joint Hypothesis Testing". Introductory Econometrics: Using Monte Carlo Simulation with Microsoft Excel. Cambridge University Press. ISBN 0-521-84319-7.