Linear bivariate regression: One group, size of slope

A linear regression is used to estimate the parameters a, b of a linear relationship Y = a + bX between the dependent variable Y and the independent variable X. X is assumed to be a set of fixed values, whereas Yi is modeled as a random variable: Yi = a + bXi + εi, where εi denotes normally distributed random errors with mean 0 and standard deviation σi. A common assumption also adopted here is that all σi's are identical, that is σi = σ. The standard deviation of the error is also called the standard deviation of the residuals.

A common task in linear regression analysis is to test whether the slope b is identical to a fixed value b0 or not. The null and the two-sided alternative hypotheses are:

H0 : bb0 = 0
H1 : bb0 ≠ 0.

Effect size index

Slope H1, the slope b of the linear relationship assumed under H1 is used as effect size measure. To fully specify the effect size, the following additional inputs must be given:

Slope H0.

This is the slope b0 assumed under H0.

Std dev σ x.

This is the standard deviation σx of the values in X and must be > 0.
Linear-regression-size-of-slope-one-group-e01.png

Std dev σ y.

This is the standard deviation σy > 0 of the Y values. Important relationships of σy to other relevant measures are:

σy = (bσx)/ρ
σy = σ/√(1 − ρ2)
where σ denotes the standard deviation of the residuals Yi − (aX + b) and ρ the correlation coefficient between X and Y.

Pressing the Determine button on the left side of the effect size label in the main window opens the effect size drawer. The effect size drawer may be used to determine the standard deviation σy > 0 of the Y values (Std dev σ y) and/or the slope b of the linear relationship assumed under H1 (Slope H1) from other values based on the two equations given above.

Linear-regression-size-of-slope-one-group-effect-size-drawer-1.png

Below you see the effect size drawer with different combinations of input and output values in different input modes. The input variables are placed on the left side of the arrow (=>), the output variables on the right side. The input values must conform to the usual restrictions, that is, σ > 0, σx > 0, σy > 0, −1 < ρ < 1. In addition, σy = (bσx)/ρ (see above) together with the restriction on ρ implies the additional restriction −1 < b · σxy < 1.

Linear-regression-size-of-slope-one-group-effect-size-drawer-2.png

Clicking on the Calculate and transfer to main window button copies the values given in Std dev σ_x, Std dev σ_y, and Slope H1 to the corresponding input fields in the main window.

Options

This test has no options.

Examples

We replicate an example given on page 593 in Dupont and Plummer (1998). The question investigated in this example is whether the actual average time spent per day exercising is related to the body mass index (BMI) after 6 months on a training program.

The estimated standard deviation of exercise time of participants is σx = 7.5 minutes. From a previous study the standard deviation of the BMI in the group of participants is estimated to be σy = 4. The sample size is N = 100.  A standard level of α = 0.05 is assumed. We want to determine the power with which a slope b0 = −0.0667 of the regression line (corresponding to a drop of BMI by 2 per 30 min/day exercise) can be detected.

Select

Type of power analysis: Post hoc

Input

Tail(s): Two
Slope H1: -0.0667
α err prob: 0.05
Total sample size: 100
Slope H0: 0
Std dev σ x: 7.5
Std dev σ y: 4

Output

Noncentrality parameter δ: 1.260522
Critical t : -1.984467
Denominator df: 98
Power (1- β ): 0.238969

The output shows that the power of this test is about 0.24. This confirms the value estimated by Dupont and Plummer (1998, p. 596) for this example.

Relation to Multiple Regression: Omnibus

The present procedure is a special case of the Multiple Regression procedure, or better: a different interface to the same procedure using more convenient variables. To show this, we demonstrate how the Multiple Regression: Omnibus (R2 deviation from zero) procedure can be used to compute results of the example above.

First, we determine R2 = ρ2 from the relation b = ρ · σyx, which implies ρ2 = (b · σxy)2 . Entering (-0.0667*7.5/4)2 into the G*Power calculator gives ρ2 = 0.01564062. Then we perform these selections and inputs:

Select

Multiple Regression: Omnibus (R2 deviation from zero)
Type of power analysis: Post hoc

Input

Effect size f2: 0.01588914 (calculated from ρ2 = 0.01564062 using the effect size drawer)
α err prob: 0.05
Total sample size: 100
Number of  predictors: 1

Output

Noncentrality parameter λ: 1.588915
Critical F : 3.938111
Numerator df: 1
Denominator df: 98
Power (1- β ): 0.238969
Thus, we get exactly the same power that we got using the Linear Regression: Size of slope, one group procedure. Also note that λ = δ2 = 1.2605222 = 1.588915, and that F = t2 = -1.984467 = 3.93811.

Related tests

Correlation: Point biserial model
Linear multiple regression: R2 deviation from zero

Implementation notes

The H0 distribution is the central F distribution with numerator degrees of freedom df1 = 1 and denominator degrees of freedom df2 = N − 2, where N is the sample size. The H1 distribution is the noncentral F distribution with the same degrees of freedom and noncentrality parameter λ = N([σx(bb0)]/σ)2.

Validation

The results were checked against the values produced by by PASS (Hintze, 2006). Perfect correspondance was found.

References

Dupont, W. D., & Plummer, W. D., Jr. (1998). Power and sample size calculations for studies involving linear regression. Controlled Clinical Trials, 19, 589-601.

Hintze, J. (2006). NCSS, PASS, and GESS. Kaysville, Utah: NCSS.


    Freitag, 10. 02. 2012


gpicon-128.png

Questions about this website? Contact

Axel Buchner


Letzte Änderung: 29.06.2009, 16:45
Seitenende