Question
Question: n=25, \(\sum{x}=125\), \(\sum{{{x}^{2}}}=650\), \(\sum{y}=100\), \(\sum{{{y}^{2}}}=460\), \(\sum{xy}...
n=25, ∑x=125, ∑x2=650, ∑y=100, ∑y2=460, ∑xy=508. It was observed that two pair of values of (x,y) were copied as (6,14) and (8,6) instead of (8,12), (6,8). The correct correlation coefficient is
A. 0.667
B. 0.87
C. −0.25
D. 0.356
Solution
In this problem we need to calculate the value of correct correlation coefficient for the given set of data with the given conditions. We can observe that some of the data is corrupted or copied wrong. So we will calculate the actual values of ∑x, ∑x2, ∑y, ∑y2, ∑xy by subtracting the respective values which are mistakenly copied and adding the actual values at the same time. After having the actual values of ∑x, ∑x2, ∑y, ∑y2, ∑xy for the true data set we will use the formula r=[n∑x2−(∑x)2][n∑y2−(∑y)2]n∑xy−(∑x)×(∑y) to find the correlation coefficient.
Complete step by step answer:
Given that, n=25, ∑x=125, ∑x2=650, ∑y=100, ∑y2=460, ∑xy=508 and
two pair of values of (x,y) were copied as (6,14) and (8,6) instead of (8,12), (6,8).
If the data set is copied as (x1,y1) instead of (x2,y2), then the respect values of ∑x changed as
∑xact=∑x−x1+x2
So, the data set is copied (6,14) and (8,6) instead of (8,12), (6,8), then the values are modified as
∑xact=∑x−6−8+8+6
Substituting the value ∑x=125 in the above equation, then we will get
∑xact=125+0⇒∑xact=125
Now the actual or correct value of ∑y is given by
∑yact=∑y−14−6+12+8⇒∑yact=∑y⇒∑yact=100
Now the actual or correct value of ∑x2 is given by
∑x2act=∑x2−62−82+62+82⇒∑x2act=∑x2⇒∑x2act=650
Now the actual or correct value of ∑y2 is given by
∑y2act=∑y2−142−62+122+82⇒∑y2act=460−24⇒∑y2act=436
Now the actual or correct value of ∑xy is given by
∑xyact=∑xy−6×8−8×6+8×12+6×8⇒∑xyact=508+12⇒∑xyact=520
From the all the above values the correlation coefficient will be calculated as
r=[n∑x2−(∑x)2][n∑y2−(∑y)2]n∑xy−(∑x)×(∑y)
Substituting all the actual or corrected values in the above equation, then we will get
r=(25×650−1252)(25×436−1002)25×520−125×100
Simplifying the above equation, then we will get
r=(16250−15625)(10900−10000)13000−12500⇒r=625×900500⇒r=25×30500⇒r=0.667
So, the correct answer is “Option A”.
Note: In type of problem needs the student attention. While calculating the corrected values one may do a lot of mistakes like substituting the wrong values or use x values instead of y values or vice versa. So, one should be careful about substitution while calculating the corrected values.