Question
Question: The two lines of regressions are \(x + 2y - 5 = 0\) and \(2x + 3y - 8 = 0\) and the variance of \(x\...
The two lines of regressions are x+2y−5=0 and 2x+3y−8=0 and the variance of x is 12. Find the variance of y and the coefficient of correlation.
Solution
Here, it is given the regressions of two lines and the variance of x. We have to find the variance of y and coefficient of correlation. We are going to find the coefficient correlation by using the given regressions of lines and to find the variance of y, we have to use the variance of x.
Formula used: byx=rσxσy
bxy=rσyσx
Correlation coefficient,
r2=bxy×byx⇒r=bxy×byx
Complete step-by-step solution:
It is given in the question, the regressions of two lines are x+2y−5=0 and 2x+3y−8=0
The variance of x is 12.
Let us consider the equations,
x+2y−5=0−−−−−(1)
2x+3y−8=0−−−−−(2)
From equation (1), divide the equation by 2 to get the regression line of y on x,
Hence, x+2y−5=0
⇒2x+y−25=0
Rewriting the equation we get,
⇒y=−2x+25
The is equation is the regression line of y on x.
From that we can get,
byx=rσxσy=−21−−−−(∗)
From equation (2), divide the equation by 2 to get the regression line of x on y,
Hence, 2x+3y−8=0
⇒x+23y−28=0
Rewriting the equation we get,
⇒x=−23y+4
The is equation is the regression line of x on y.
From that we can get,
bxy=rσyσx=−23
Now we are going find correlation of coefficient,
Since, r2=bxy×byx⇒r=bxy×byx
Subtituting the values,
⇒r=−23×−21
Multiplying the terms
⇒43
Taking square root we get,
⇒±23
bxy and byx being both are negative,
Hence, r is also negative.
r=−23
∴ The correlation coefficient r=−23
Consider the given variance of x is 12,
That is, σx2=12,
⇒σx=12
Since already we known that,
byx=rσxσy=−21−−−−(∗)
Hence we have the values of σx and r, so substituting those in (*), we get,
⇒−23×12σy=−21
Rearranging the terms to solve for σy,
⇒σy=−21×(−32)×12
Simplifying their terms we get,
⇒σy=−21×(−32)×23
Hence,
⇒σy=2
Squaring on both sides,
⇒σy2=4
Hence we got the variance of y,
∴σy2=4
The Variance of y is 4.
Note: We have to remember that the composition of regression lines is based on the least square assumptions. Regression analysis is generally based on the summation of squares of deviations of observed values from the lines of the best fit. A line of regression gives the best average value of one variable from any given value of the other variable.