Tukey's test of additivity

In statistics, Tukey's test of additivity,^[1] named for John Tukey, is an approach used in two-way ANOVA (regression analysis involving two qualitative factors) to assess whether the factor variables (categorical variables) are additively related to the expected value of the response variable. It can be applied when there are no replicated values in the data set, a situation in which it is impossible to directly estimate a fully general non-additive regression structure and still have information left to estimate the error variance. The test statistic proposed by Tukey has one degree of freedom under the null hypothesis, hence this is often called "Tukey's one-degree-of-freedom test."

Introduction

The most common setting for Tukey's test of additivity is a two-way factorial analysis of variance (ANOVA) with one observation per cell. The response variable Y_ij is observed in a table of cells with the rows indexed by i = 1,..., m and the columns indexed by j = 1,..., n. The rows and columns typically correspond to various types and levels of treatment that are applied in combination.

The additive model states that the expected response can be expressed EY_ij = μ + α_i + β_j, where the α_i and β_j are unknown constant values. The unknown model parameters are usually estimated as

{\widehat {\mu }}={\bar {Y}}_{\cdot \cdot }

{\widehat {\alpha }}_{i}={\bar {Y}}_{i\cdot }-{\bar {Y}}_{\cdot \cdot }

{\widehat {\beta }}_{j}={\bar {Y}}_{\cdot j}-{\bar {Y}}_{\cdot \cdot }

where Y_i• is the mean of the i^th row of the data table, Y_•j is the mean of the j^th column of the data table, and Y_•• is the overall mean of the data table.

The additive model can be generalized to allow for arbitrary interaction effects by setting EY_ij = μ + α_i + β_j + γ_ij. However, after fitting the natural estimator of γ_ij,

{\widehat {\gamma }}_{ij}=Y_{ij}-({\widehat {\mu }}+{\widehat {\alpha }}_{i}+{\widehat {\beta }}_{j}),

the fitted values

{\widehat {Y}}_{ij}={\widehat {\mu }}+{\widehat {\alpha }}_{i}+{\widehat {\beta }}_{j}+{\widehat {\gamma }}_{ij}\equiv Y_{ij}

fit the data exactly. Thus there are no remaining degrees of freedom to estimate the variance σ², and no hypothesis tests about the γ_ij can performed.

Tukey therefore proposed a more constrained interaction model of the form

\operatorname {E} Y_{ij}=\mu +\alpha _{i}+\beta _{j}+\lambda \alpha _{i}\beta _{j}

By testing the null hypothesis that λ = 0, we are able to detect some departures from additivity based only on the single parameter λ.

Method

To carry out Tukey's test, set

SS_{A}\equiv n\sum _{i}({\bar {Y}}_{i\cdot }-{\bar {Y}}_{\cdot \cdot })^{2}

SS_{B}\equiv m\sum _{j}({\bar {Y}}_{\cdot j}-{\bar {Y}}_{\cdot \cdot })^{2}

SS_{AB}\equiv {\frac {(\sum _{ij}Y_{ij}({\bar {Y}}_{i\cdot }-{\bar {Y}}_{\cdot \cdot })({\bar {Y}}_{\cdot j}-{\bar {Y}}_{\cdot \cdot }))^{2}}{\sum _{i}({\bar {Y}}_{i\cdot }-{\bar {Y}}_{\cdot \cdot })^{2}\sum _{j}({\bar {Y}}_{\cdot j}-{\bar {Y}}_{\cdot \cdot })^{2}}}

SS_{T}\equiv \sum _{ij}(Y_{ij}-{\bar {Y}}_{\cdot \cdot })^{2}

SS_{E}\equiv SS_{T}-SS_{A}-SS_{B}-SS_{AB}

Then use the following test statistic ^[2]

{\frac {SS_{AB}/1}{MS_{E}}}.

Under the null hypothesis, the test statistic has an F distribution with 1, q degrees of freedom, where q = mn − (m + n) is the degrees of freedom for estimating the error variance.

References

^ Tukey, John (1949). "One degree of freedom for non-additivity". Biometrics. 5 (3): 232–242. doi:10.2307/3001938. JSTOR 3001938.
^ Alin, A. and Kurt, S. (2006). “Testing non-additivity (interaction) in two-way ANOVA tables with no replication”. Statistical Methods in Medical Research 15, 63–85.

[1] Tukey, John (1949). "One degree of freedom for non-additivity". Biometrics. 5 (3): 232–242. doi:10.2307/3001938. JSTOR 3001938.

[2] Alin, A. and Kurt, S. (2006). “Testing non-additivity (interaction) in two-way ANOVA tables with no replication”. Statistical Methods in Medical Research 15, 63–85.

[1]

[2]

Introduction

Method

See also

References