Misinterpreting the Overall F-Statistic in Regression

Most software includes an "overall F-statistic" and its corresponding p-value in the output for a least squares regression. This is the statistic for the hypothesis test with null hypothesis

H₀: All non-constant coefficients in the regression equation are zero

and alternate hypothesis

H_a: At least one of the non-constant coefficients in the regression equation is non-zero.

More explicitly, if Y is the response variable and the predictors are X₁, X₂, ... , X_m and the model equation¹ assumed for the regression is

(*) E(Y|X₁, X₂, ... , X_m) = β₀ + β₁ X₁+ β₂ X₂+ ... + β_mX_m

then the null and alternate hypotheses for this F-test are

H₀: β₁ = β₂ = ... = β_m= 0

and

H_a: At least one of β₁, β₂ , ... or β_m is non-zero.

Misinterpreting the output for this hypothesis test is a common mistake in regression. Two types of mistakes are common here.

First type of mistake: Assuming that if the output for this hypothesis test has a small p-value, then the regression equation fits the data well.

Second type of mistake: Assuming that if the output for this hypothesis test does not show statistical significance, then Y does not depend on the variables X₁, X₂, ... , X_m.

Both mistakes are based on neglecting a model assumption -- namely, the assumption expressed by (*): that the conditional mean E(Y|X₁, X₂, ... , X_m) is a linear function of the variables X₁, X₂, ... , X_m.

Examples of each type of mistake:

1. The following graph shows DC output vs. wind speed for a windmill.

Plot of DC output vs. windspeed for a windmill

Running a regression with model assumption

E(DC output|wind speed) = β₀ + β₁×(wind speed)

gives overall F-statistic 160.257 with 1 degree of freedom, and corresponding p-value 7.5455E-12, which is certainly statistically significant.

However, the data clearly have a curved pattern; thus a model equation expressing a suitable curved relationship will fit better than a linear model equation. (For a good way to do this, see Example 3 of Overinterpreting High R².) All that the F-statistic says is that we have strong evidence that the best fitting line has non-zero slope (which is pretty clear from the picture anyhow).

Of course, in a case with several predictor variables, it is typically difficult (if not impossible) to tell in advance whether or not a linear model fits. Thus, unless there is other evidence that a linear model does fit, all that a statistically significant F-test can say is that the data give evidence that the best-fitting linear model of the type specified has at least one predictor with a non-zero coefficient.

One method that sometimes works to get around this problem is to (attempt to) transform the variables to have a multivariate normal distribution, then work with the transformed variables. This will ensure that the conditional means are a linear function of the transformed explanatory variables, no matter which subset of explanatory variables is chosen. Such a transformation is sometimes possible with some variant of a Box-Cox transformation procedure. See, e.g., pp. 236 and 324 - 329 of Cook and Weisberg's text² for more details.

2. The following graph shows data and the computed regression line.

Data lying on a parabola going from (-1,1) to (1,1) and fitted regression line with zero slope

The fitted regression line is y = 0.35 + 0x. The overall F-statistic is essentially 0, giving p-statistic essentially 1. However, the data are constructed so that y depends on x: y = x². Thus there is a strong dependence of y on x, but the F-test for the linear model does not detect this at all.

Notes:

1. In the expresion used above, E(Y|X₁, X₂, ... , X_m) refers to the mean of the conditional distribution of Y given X₁, X₂, ... , X_m; see also Overfitting. Depending on notation used, the model equation might be expressed in different ways, for example as

Y = β₀ + β₁ X₁+ β₂ X₂+ ... + β_mX_m + ε

or as

y_i = β₀ + β₁ x_i1+ β₂ x_i2+ ... + β_mx_im + ε_i

2. Cook and Weisberg (1999) Applied Regression Including Computing and Graphics, Wiley.

Last updated June 13, 2014