21 3.4 Contingency Tables
A contingency table provides a way of portraying data that can facilitate calculating probabilities. The table helps in determining conditional probabilities quite easily. The table displays sample values in relation to two different variables that may be dependent or contingent on one another. Later on, we will use contingency tables again, but in another manner.
The following video shows and example of finding the probability of an event from a table.
Example 1
Suppose a study of speeding violations and drivers who use cell phones produced the following fictional data:
Speeding violation in the last year | No speeding violation in the last year | Total | |
---|---|---|---|
Cell phone user | 25 | 280 | 305 |
Not a cell phone user | 45 | 405 | 450 |
Total | 70 | 685 | 755 |
The total number of people in the sample is 755. The row totals are 305 and 450. The column totals are 70 and 685. Notice that 305 + 450 = 755 and 70 + 685 = 755.
Calculate the following probabilities using the table.
- Find P(Person is a car phone user).
Show Answer
[latex]\displaystyle\frac{{\text{number of car phone users}}}{{\text{total number in study}}}=\frac{{305}}{{755}}[/latex]
- Find P(person had no violation in the last year).
Show Answer
[latex]\displaystyle\frac{{\text{number that had no violation}}}{{\text{total number in study}}}=\frac{{685}}{{755}}[/latex]
- Find P(Person had no violation in the last year AND was a car phone user).
Show Answer
[latex]\displaystyle\frac{{280}}{{755}}[/latex]
- Find P(Person is a car phone user OR person had no violation in the last year).
Show Answer
[latex]\displaystyle{(\frac{{305}}{{755}}+\frac{{685}}{{755}})}-\frac{{280}}{{755}}=\frac{{710}}{{755}}[/latex]
- Find P(Person is a car phone user GIVEN person had a violation in the last year).
Show Answer
[latex]\displaystyle\frac{{25}}{{70}}[/latex](The sample space is reduced to the number of persons who had a violation.)
- Find P(Person had no violation last year GIVEN person was not a car phone user)
Show Answer
[latex]\displaystyle\frac{{405}}{{450}}[/latex] (The sample space is reduced to the number of persons who were not car phone users.)
This video shows an example of how to determine the probability of an AND event using a contingency table.
Try It
This table shows the number of athletes who stretch before exercising and how many had injuries within the past year.
Injury in last year | No injury in last year | Total | |
---|---|---|---|
Stretches | 55 | 295 | 350 |
Does not stretch | 231 | 219 | 450 |
Total | 286 | 514 | 800 |
- What is P(athlete stretches before exercising)?
Show Answer
P(athlete stretches before exercising) = [latex]\displaystyle\frac{{350}}{{800}}[/latex] = 0.4375
- What is P(athlete stretches before exercising|no injury in the last year)?
Show Answer
P(athlete stretches before exercising|no injury in the last year) = [latex]\displaystyle\frac{{295}}{{514}}[/latex] = 0.5739
Example 2
This table shows a random sample of 100 hikers and the areas of hiking they prefer.
Hiking Area Preference
Sex | The Coastline | Near Lakes and Streams | On Mountain Peaks | Total |
---|---|---|---|---|
Female | 18 | 16 | ___ | 45 |
Male | ___ | ___ | 14 | 55 |
Total | ___ | 41 | ___ | ___ |
- Complete the table.
Show Answer
Hiking Area Preference
Sex The Coastline Near Lakes and Streams On Mountain Peaks Total Female 18 16 11 45 Male 16 25 14 55 Total 34 41 25 100 - Are the events “being female” and “preferring the coastline” independent events?
Hint:
Let F = being female and let C = preferring the coastline.
Check if P(F AND C) = P(F) * P(C).
If P(F AND C) = P(F) * P(C), then F and C are independent.
If P(F AND C) [latex]\ne[/latex] P(F) * P(C), then F and C are not independent.Show Answer
P(F AND C) = [latex]\displaystyle\frac{{18}}{{100}}[/latex] = 0.18
P(F) * P(C) = [latex]\displaystyle(\frac{{45}}{{100}})(\frac{{34}}{{100}})[/latex] = (0.45)(0.34) = 0.153
P(F AND C) ≠ P(F) * P(C), so the events F and C are not independent. - Find the probability that a person is male given that the person prefers hiking near lakes and streams.
Hint:
Let M = being male, and let L = prefers hiking near lakes and streams.
- What word tells you this is a conditional?
- Fill in the blanks and calculate the probability: P(___|___) = ___.
- Is the sample space for this problem all 100 hikers? If not, what is it?
Show Answer
The word given tells you that this is a conditional.
[latex]\displaystyle{P}{({M}|{L})}=\frac{{25}}{{41}}[/latex]
No, the sample space for this problem is the 41 hikers who prefer lakes and streams. - Find the probability that a person is female or prefers hiking on mountain peaks.
Hint:
Let F = being female, and let P = prefers mountain peaks.
- Find P(F).
- Find P(P).
- Find P(F AND P).
- Find P(F OR P).
Show Answer
The probability that a person is female or prefers hiking on mountain peaks = [latex]\frac{59}{100}[/latex]
- P(F) = [latex]\displaystyle\frac{{45}}{{100}}[/latex]
- P(P) = [latex]\displaystyle\frac{{25}}{{100}}[/latex]
- P(F AND P) = [latex]\displaystyle\frac{{11}}{{100}}[/latex]
- P(F OR P) = [latex]\displaystyle\frac{{45}}{{100}} + \frac{{25}}{{100}} - \frac{{11}}{{100}} = \frac{{59}}{{100}}[/latex]
Try It
This table shows a random sample of 200 cyclists and the routes they prefer. Let M = males and H = hilly path.
Gender | Lake Path | Hilly Path | Wooded Path | Total |
---|---|---|---|---|
Female | 45 | 38 | 27 | 110 |
Male | 26 | 52 | 12 | 90 |
Total | 71 | 90 | 39 | 200 |
- Out of the males, what is the probability that the cyclist prefers a hilly path?
Show Answer
P(H|M) = [latex]\displaystyle\frac{{52}}{{90}}[/latex] = 0.5778
- Are the events “being male” and “preferring the hilly path” independent events?
Show Answer
For M and H to be independent, show P(H|M) = P(H)
P(H|M) = 0.5778, P(H) = [latex]\displaystyle\frac{{90}}{{200}}[/latex] = 0.45
P(H|M) [latex]\ne[/latex] P(H), so M and H are not independent.
Example 3
Muddy Mouse lives in a cage with three doors.
If Muddy goes out the first door, the probability that he gets caught by Alissa the cat is [latex]\displaystyle\frac{{1}}{{5}}[/latex] and the probability he is not caught is [latex]\displaystyle\frac{{4}}{{5}}[/latex].
If he goes out the second door, the probability he gets caught by Alissa is [latex]\displaystyle\frac{{1}}{{4}}[/latex] and the probability he is not caught is [latex]\displaystyle\frac{{3}}{{4}}[/latex].
The probability that Alissa catches Muddy coming out of the third door is [latex]\displaystyle\frac{{1}}{{2}}[/latex] and the probability she does not catch Muddy is [latex]\displaystyle\frac{{1}}{{2}}[/latex].
It is equally likely that Muddy will choose any of the three doors so the probability of choosing each door is [latex]\displaystyle\frac{{1}}{{3}}[/latex].
Door Choice
Caught or Not | Door One | Door Two | Door Three | Total |
---|---|---|---|---|
Caught | [latex]\displaystyle\frac{{1}}{{15}}[/latex] | [latex]\displaystyle\frac{{1}}{{12}}[/latex] | [latex]\displaystyle\frac{{1}}{{6}}[/latex] | ____ |
Not Caught | [latex]\displaystyle\frac{{4}}{{15}}[/latex] | [latex]\displaystyle\frac{{3}}{{12}}[/latex] | [latex]\displaystyle\frac{{1}}{{6}}[/latex] | ____ |
Total | ____ | ____ | ____ | 1 |
- The first entry [latex]\displaystyle\frac{{1}}{{15}}={(\frac{{1}}{{5}})}{(\frac{{1}}{{3}})}[/latex] is P(Door One AND Caught)
- The entry [latex]\displaystyle\frac{{4}}{{15}}={(\frac{{4}}{{5}})}{(\frac{{1}}{{3}})}[/latex] is P(Door One AND Not Caught)
Verify the remaining entries.
- Complete the probability contingency table. Calculate the entries for the totals. Verify that the lower-right corner entry is 1.
Show Answer
Door Choice
Caught or Not Door One Door Two Door Three Total Caught [latex]\displaystyle\frac{{1}}{{15}}[/latex] [latex]\displaystyle\frac{{1}}{{12}}[/latex] [latex]\displaystyle\frac{{1}}{{6}}[/latex] [latex]\displaystyle\frac{{19}}{{60}}[/latex] Not Caught [latex]\displaystyle\frac{{4}}{{15}}[/latex] [latex]\displaystyle\frac{{3}}{{12}}[/latex] [latex]\displaystyle\frac{{1}}{{6}}[/latex] [latex]\displaystyle\frac{{41}}{{60}}[/latex] Total [latex]\displaystyle\frac{{5}}{{15}}[/latex] [latex]\displaystyle\frac{{4}}{{12}}[/latex] [latex]\displaystyle\frac{{2}}{{16}}[/latex] 1 - What is the probability that Alissa does not catch Muddy?
Show Answer
[latex]\displaystyle\frac{{41}}{{60}}[/latex]
- What is the probability that Muddy chooses Door One OR Door Two given that Muddy is caught by Alissa?
Show Answer
[latex]\displaystyle\frac{{9}}{{19}}[/latex]
Example 4
This table contains the number of crimes per 100,000 inhabitants from 2008 to 2011 in the U.S.
United States Crime Index Rates Per 100,000 Inhabitants 2008–2011
Year | Robbery | Burglary | Rape | Vehicle | Total |
---|---|---|---|---|---|
2008 | 145.7 | 732.1 | 29.7 | 314.7 | |
2009 | 133.1 | 717.7 | 29.1 | 259.2 | |
2010 | 119.3 | 701 | 27.7 | 239.1 | |
2011 | 113.7 | 702.2 | 26.8 | 229.6 | |
Total |
TOTAL each column and each row. Total data = 4,520.7
- Find P(2009 AND Robbery).
Show Answer
0.0294
- Find P(2010 AND Burglary).
Show Answer
0.1551
- Find P(2010 OR Burglary).
Show Answer
0.7165
- Find P(2011|Rape).
Show Answer
0.2365
- Find P(Vehicle|2008).
Show Answer
0.2575
This video gives and example of determining an “OR” probability given a table.
Try It
This table relates the weights and heights of a group of individuals participating in an observational study.
Weight/Height | Tall | Medium | Short | Totals |
---|---|---|---|---|
Obese | 18 | 28 | 14 | |
Normal | 20 | 51 | 28 | |
Underweight | 12 | 25 | 9 | |
Totals |
- Find the total for each row and column.
Show Answer
Weight/Height Tall Medium Short Totals Obese 18 28 14 60 Normal 20 51 28 99 Underweight 12 25 9 46 Totals 50 104 51 205 - Find the probability that a randomly chosen individual from this group is Tall.
Show Answer
P(Tall) = [latex]\displaystyle\frac{{50}}{{205}}[/latex] = 0.244
- Find the probability that a randomly chosen individual from this group is Obese and Tall.
Show Answer
P(Obese AND Tall) = [latex]\displaystyle\frac{{18}}{{205}}[/latex] = 0.088
- Find the probability that a randomly chosen individual from this group is Tall given that the idividual is Obese.
Show Answer
P(Tall|Obese) = [latex]\displaystyle\frac{{18}}{{60}}[/latex] = 0.3
- Find the probability that a randomly chosen individual from this group is Obese given that the individual is Tall.
Show Answer
P(Obese|Tall) = [latex]\displaystyle\frac{{18}}{{50}}[/latex] = 0.36
- Find the probability a randomly chosen individual from this group is Tall and Underweight.
Show Answer
P(Tall AND Underweight = [latex]\displaystyle\frac{{12}}{{205}}[/latex] = 0.0585
- Are the events Obese and Tall independent?
Show Answer
No. P(Tall) [latex]\ne[/latex] (Tall|Obese).
References
“Blood Types.” American Red Cross, 2013. Available online at http://www.redcrossblood.org/learn-about-blood/blood-types (accessed May 3, 2013).
Data from the National Center for Health Statistics, part of the United States Department of Health and Human Services.
Data from United States Senate. Available online at www.senate.gov (accessed May 2, 2013).
Haiman, Christopher A., Daniel O. Stram, Lynn R. Wilkens, Malcom C. Pike, Laurence N. Kolonel, Brien E. Henderson, and Loīc Le Marchand. “Ethnic and Racial Differences in the Smoking-Related Risk of Lung Cancer.” The New England Journal of Medicine, 2013. Available online at http://www.nejm.org/doi/full/10.1056/NEJMoa033250 (accessed May 2, 2013).
“Human Blood Types.” Unite Blood Services, 2011. Available online at http://www.unitedbloodservices.org/learnMore.aspx (accessed May 2, 2013).
Samuel, T. M. “Strange Facts about RH Negative Blood.” eHow Health, 2013. Available online at http://www.ehow.com/facts_5552003_strange-rh-negative-blood.html (accessed May 2, 2013).
“United States: Uniform Crime Report – State Statistics from 1960–2011.” The Disaster Center. Available online at http://www.disastercenter.com/crime/ (accessed May 2, 2013).
Concept Review
There are several tools you can use to help organize and sort data when calculating probabilities. Contingency tables help display data and are particularly useful when calculating probabilites that have multiple dependent variables.