Question
Question: The sum of 100 observations and the sum of their squares are 400 and 2475, respectively. Later on, t...
The sum of 100 observations and the sum of their squares are 400 and 2475, respectively. Later on, three observations, 3,4 and 5, were found to be incorrect. If the incorrect observations are omitted, then the variance of the remaining observations is.
(a) 8.00
(b) 8.50
(c) 8.25
(d) 9.00
Solution
It is given to us that there are 100 observations and the sum of the values and sum of the squares of their values are given to us. Now, if we remove three observations, the number of observations also decreases by three, and the sum of the values of the observations and sum of the squares of the values of the observations also decrease. We will find decreased sums. From this information we will find the mean of the observations and after finding mean, we can find the variation as N∑X2−(X)2, where X is the mean of the data set.
Complete step-by-step solution:
It is given to us that there are 100 observations and the sum of the values of the observation is 400. We are also given the sum of the squares of the values of the observations as 2475.
Let x1,x2,....,x100 be the observations.
⇒i=1∑100xi=400⇒i=1∑100xi2=2475
We know that the mean is of a data set is defined as the quotient of the sum of the values of all the observations and number of observations.
Therefore, if there are 100 observations and the sum of the values of the observations is 400, the mean of the data set will be the quotient of 400 and 100. That is 4.
But we are given that the 3 observations are not correct and their values are 3, 4 and 5. These observations are removed from the data set.
The number of observations in the modified data set will thus be 100 – 3 = 97.
As these observations are removed, the modified sum of the observations will be
⇒i=1∑97xi= 400 – 3 – 4 – 5 = 388.
Therefore, the mean of the modified data set will be given as follows:
X=97i=1∑97xi=97388=4.
The sum of the squares of the values of the observations will also decrease. It will be as follows:
⇒∑x2=2475−(3)2−(4)2−(5)2⇒∑x2=2425
We know that the variance in a data set is given as N∑X2−(X)2
⇒Var(X)=972425−(4)2⇒Var(X)=25−16⇒Var(X)=9
Hence, option (d) is the correct option.
Note: The variance of a data set is defined as Var(X)=N∑(xi−X)2. We will solve the parenthesis. N∑(xi2−2xiX+X2) . Then we split the denominator. N∑(xi2)−N2∑xiX+N∑X2=N∑(xi2)−2X2+NNX2. Thus, we get the modified formula for variance as Var(X)=N∑(xi2)−X2 .