Solveeit Logo

Question

Question: A sample of boys and girls were asked to choose their favorite sport, with the following results: ...

A sample of boys and girls were asked to choose their favorite sport, with the following results:

| Football| Cricket| Hockey| Basketball
---|---|---|---|---
Boys| 8686| 6060| 4444| 1010
Girls| 4040| 3030| 2525| 55

Find the value of χ2{\chi ^2} statistics.

Explanation

Solution

In this question, we are given data of boys’ and girls’ favorite sports. We have been asked to find the χ2{\chi ^2}. At first, calculate expected frequencies using the formula Ti×TjN\dfrac{{{T_i} \times {T_j}}}{N}. After this, subtract the expected frequency from the corresponding actual observation and square the difference. Then, divide the square by their corresponding expected frequency and add all of them. You will have the final answer.

Formula used: 1) Eij=Ti×TjN{E_{ij}} = \dfrac{{{T_i} \times {T_j}}}{N}, where Eij={E_{ij}} = Expected frequency for i(th) row and j(th) column, Ti={T_i} = Total of i(th) row, Tj={T_j} = Total of j(th) column, N=N = Total cells in table.
2) χ2=(OijEij)2Eij{\chi ^2} = \dfrac{{\sum {{{\left( {{O_{ij}} - {E_{ij}}} \right)}^2}} }}{{{E_{ij}}}}

Complete step-by-step solution:
We will start by finding the expected frequencies. We have to calculate the expected frequencies of each and every cell in the given table. But first we will find the total of each row and columns.

| Football| Cricket| Hockey| Basketball| Row total (Ti)\left( {{T_i}} \right)
---|---|---|---|---|---
Boys| 8686| 6060| 4444| 1010| 200200
Girls| 4040| 3030| 2525| 55| 100100
Column Total (Tj)\left( {{T_j}} \right)| 126126| 9090| 6969| 1515| 300300

Now, we will find the expected frequencies using the formula Eij=Ti×TjN{E_{ij}} = \dfrac{{{T_i} \times {T_j}}}{N}.
Expected frequency for i=1i = 1 and j=1j = 1,
E11=200×126300=84{E_{11}} = \dfrac{{200 \times 126}}{{300}} = 84
Expected frequency for i=1i = 1 and j=2j = 2,
E12=200×90300=60{E_{12}} = \dfrac{{200 \times 90}}{{300}} = 60
Expected frequency for i=1i = 1 and j=3j = 3,
E13=200×69300=46{E_{13}} = \dfrac{{200 \times 69}}{{300}} = 46
Expected frequency for i=1i = 1 and j=4j = 4,
E14=200×15300=10{E_{14}} = \dfrac{{200 \times 15}}{{300}} = 10
Expected frequency for i=2i = 2 and j=1j = 1,
E21=100×126300=42{E_{21}} = \dfrac{{100 \times 126}}{{300}} = 42
Expected frequency for i=2i = 2 and j=2j = 2,
E22=100×90300=30{E_{22}} = \dfrac{{100 \times 90}}{{300}} = 30
Expected frequency for i=2i = 2 and j=3j = 3,
E23=100×69300=23{E_{23}} = \dfrac{{100 \times 69}}{{300}} = 23
Expected frequency for i=2i = 2 and j=4j = 4,
E24=100×15300=5{E_{24}} = \dfrac{{100 \times 15}}{{300}} = 5
The table of expected frequencies is below:

| Football| Cricket| Hockey| Basketball| Row total (Ti)\left( {{T_i}} \right)
---|---|---|---|---|---
Boys| 8484| 6060| 4646| 1010| 200200
Girls| 4242| 3030| 2323| 55| 100100
Column Total (Tj)\left( {{T_j}} \right)| 126126| 9090| 6969| 1515| 300300

Next step is to subtract the expected frequency from the corresponding actual observation and then we have to square the difference. The final square of the difference is divided by the expected frequency of that cell. Each cell’s final answer is added to each other.
It is done like below:
(8684)284+(6060)260+(4446)246+(1010)210+(4042)242+(3030)230+(2523)223+(55)25\Rightarrow \dfrac{{{{\left( {86 - 84} \right)}^2}}}{{84}} + \dfrac{{{{\left( {60 - 60} \right)}^2}}}{{60}} + \dfrac{{{{\left( {44 - 46} \right)}^2}}}{{46}} + \dfrac{{{{\left( {10 - 10} \right)}^2}}}{{10}} + \dfrac{{{{\left( {40 - 42} \right)}^2}}}{{42}} + \dfrac{{{{\left( {30 - 30} \right)}^2}}}{{30}} + \dfrac{{{{\left( {25 - 23} \right)}^2}}}{{23}} + \dfrac{{{{\left( {5 - 5} \right)}^2}}}{5}
On simplifying the above equation, we will get –
484+0+446+0+442+0+423+0\Rightarrow \dfrac{4}{{84}} + 0 + \dfrac{4}{{46}} + 0 + \dfrac{4}{{42}} + 0 + \dfrac{4}{{23}} + 0
Hence,
0.048+0.087+0.095+0.174\Rightarrow 0.048 + 0.087 + 0.095 + 0.174
On adding,
χ2=0.404\Rightarrow {\chi ^2} = 0.404

The required value of χ2{\chi ^2} is 0.404.

Note: We can observe that the expected frequency is a probability count that appears in contingency table calculations including the Chi-square test. Expected frequencies also used to calculate standardized residuals, where the expected count is subtracted from the observed count in the numerator. An observed frequency are counts made from experimental data. In other words, you actually observe the data happening and take measurements. An expected frequencies are counts calculated using probability theory.