In statistics, the Breusch–Pagan test (named after Trevor Breusch and Adrian Pagan) is used to test for heteroscedasticity in a linear regression model. It tests whether the estimated variance of the residuals from a regression are dependent on the values of the independent variables. In that case, we have heteroskedasticity in our model.
Suppose that we estimate the regression model
and obtain from this fitted model a set of values for , the residuals. Ordinary least squares constrains these so that their mean is 0 and so, given the assumption that their variance does not depend on the independent variables, an estimate of this variance can be obtained from the average of the squared values of the residuals. If the assumption is not held to be true, a simple model might be that the variance is linearly related to independent variables. Such a model can be examined by regressing the squared residuals on the independent variables, using an auxiliary regression equation of the form
This is the basis of the Breusch–Pagan test. If an F-test confirms that the independent variables are jointly significant then the null hypothesis of homoscedasticity can be rejected.
The Breusch–Pagan test tests for conditional heteroscedasticity. It is a chi-squared test:[clarification needed] the test statistic is nχ2 with k degrees of freedom. If the Breusch–Pagan test shows that there is conditional heteroscedasticity, the original regression can be corrected by using the Hansen method, using robust standard errors, or re-thinking the regression equation by changing and/or transforming independent variables.
Under the classical assumptions, including homoscedasticity, ordinary least squares is the best linear unbiased estimator (BLUE), i.e., it is unbiased and efficient. The efficiency is lost, however, in the presence of heteroscedastic disturbances. Before deciding upon an estimation method, one may conduct the Breusch–Pagan test to examine the presence of heteroscedasticity. The Breusch–Pagan test is based on models of the type for the variances of the observations where explain the difference in the variances. The null hypothesis is equivalent to the parameter restrictions:
This test is analogous to following the simple three-step procedure:
- Step 1: Apply OLS in the model
and compute the regression residuals.
- Step 2: Perform the auxiliary regression
- Step 3: The test statistic is the result of the coefficient of determination of the auxiliary regression in Step 2 and sample size with:
In Stata, one specifies the full regression, and then enters the command
estat hettest followed by all independent variables.
In SAS, breusch pagan can be obtained using the Proc Model option.
In Python, there is a method het_breushpagan in statsmodels.stats.diagnostic (the statsmodels package) for breusch-pagan test.
- Breusch, T. S.; Pagan, A. R. (1979). "A Simple Test for Heteroscedasticity and Random Coefficient Variation". Econometrica 47 (5): 1287–1294. JSTOR 1911963. MR 545960.
- Gujarati, Damodar N.; Porter, Dawn C. (2009). Basic Econometrics (Fifth ed.). New York: McGraw-Hill Irwin. pp. 385–86. ISBN 978-0-07-337577-9.
- Heij, C.; P. de Boer (2004). Econometric Methods with Applications in Business and Economics. Oxford University Press. pp. 334–353. ISBN 0-19-160840-8.
- Maddala, G. S.; Lahiri, Kajal (2009). Introduction to Econometrics (Fourth ed.). Chichester: Wiley. pp. 216–218. ISBN 978-0-470-01512-4.