The equation for the line of regression of y on x is used to predict the value of y for a given value of x
It represents the best linear fit minimizing the sum of squared errors in the y-direction
The equation passes through the mean point \((\bar{x}, \bar{y})\) and has a slope \(b_{yx}\)
The equation is given by:
$$ (y - \bar{y}) = b_{yx} (x - \bar{x}) $$
The regression coefficient \(b_{yx}\) (slope of y on x) is calculated as:
$$ b_{yx} = r \frac{\sigma_y}{\sigma_x} $$
where \(r\) is the Pearson correlation coefficient between x and y, \(\sigma_y\) is the standard deviation of y, and \(\sigma_x\) is the standard deviation of x
Substituting the expression for the slope gives:
$$ y - \bar{y} = r \frac{\sigma_y}{\sigma_x} (x - \bar{x}) $$
This matches option (2)
Option (4) represents the regression line of x on y
Options (1) and (3) have incorrect slopes