- Regression
- Want to draw the line through the scatterplot that minimizes the error in prediction
- Intuitively
- Least squares regression line - prediction line that minimizes
S (Y-Y')2
Constructing the Least Squares Regression Line: Regression of Y on X
Remember line formula - Y = bX + a we want b and a to minimize the S (Y-Y')2
By =
Ay =
Regression of X on Y
Same equation does not work for X on Y
Because
So,
The Standard Error of Estimate - average deviation of prediction errors about the regression line
Like standard deviation
floating mean
sum prediction errors = 0, therefore must square
must divide by N-2
Then take square root
Computational Formula:
Homoscedasticity - assumed variability of Y remains constant across all values of X
Interpretation
Larger SE means
Standard Deviation and %ages
Considerations
Relationship of Regression and Correlation
r = regression slope for standard scores
Can derive BY and BX from r