Friday, July 27, 2012

TEACHING STATISTICS AND THE LONDON SUMMER OLYMPICS


Probably few if any popular events offer such an abundance of statistics teaching topics as the Olympics. The closest rival I can think of would be the upcoming presidential election, which will be covered in a separate post later this year. The only downside about the Olympics is that it will be history by the time the fall semester starts. L

Here is my list – I do not claim completeness and will update it as more ideas cross my mind:

1)      There will certainly be an abundance of graphs related to the Olympics. Let the students find “good” and “bad” ones and discuss them.
2)      I know that in ski jumping the min and max of the judges’ scores is eliminated. I am sure there will be a discipline in the summer Olympics as well where that applies.
3)      The most successful nation will be the one with the most “total medals” (U.S. custom) or most “gold medals” (more European custom) – this is just another example for the mode.
4)      The numeric score that determines the final rankings comes in all types of scales . . . time and distance and other measures are obviously on the ratio scale. Basketball teams’ points (2 for a win, 1 for a loss) during the preliminary round are on the interval scale. Decathlon scoring is on the ordinal scale (there is an exponent involved that makes it not on the interval scale). I just hope there is no winner determined by something measured on the nominal scale. J
I am sure among gymnastics, dressage, synchronized swimming, etc., experts can find additional interesting scales.
5)      Certain “mass” events like the marathon can be used to teach creating histograms and to show skewness. You can also use it to compute mean, median, percentiles, standard deviation and variance. It is also a good starting point to discuss the uselessness of modes in certain situations.
6)      Drug testing allows for all kinds of probability related questions – simple and conditional probability, independence, etc. come to mind. You are a drug-using cyclist. They will randomly test 5000 of the 12000 athletes. What is the chance they will catch you? There are 500 cyclists at the Olympics and they test 300 of them. What is the chance they will catch you now? You are getting tested 4 times and thanks to your superior drugs, the chance that a test returns a positive result is 25% each time you get tested. What is the chance they will never catch you?
7)      The hypergeometric distribution can also be used in the context of drug testing. 40 of the 290 weight lifters use drugs. If 20 of them are randomly chosen for drug tests that never fail to detect a drug user, what is the chance that exactly 7 drug users will get caught?
8)      Guessing the medalist in some sport where you have no prior knowledge allows for the coverage of permutations vs. combinations. What is the chance you will guess the medalists correctly in the men’s 400m free-style final if you don’t know anything about swimming? Or: The top three runners in the women’s 200 meter semi-final advance to the final. If you don’t know anything about track, what is the chance you will guess the three women that advance correctly?
9)      The expected traffic chaos can be used to apply the normal distribution: It takes you X minutes to get to the team handball arena and X ~N(60, 152). What is the chance you will be late for the game if you leave 78 minutes before the start?
10)   You can use the public transportation to frame questions about the uniform distribution. The bus from the athletes’ village runs to the stadium runs every 8 minutes, but not according to a printed schedule. What is the chance a random athlete needs to wait between 3 and 5 minutes?
11)   Points scored in many sports (e.g. basketball) can be assumed to be normally distributed:  In basketball points for the USA~N(98,122). What is the chance they will score more than 90 points? The same works for most track and field disciplines.
12)   Differences in normally distributed variables to determine the winner. E.g. Points for the USA are X~N(90,102) and for Argentina are Y~N(85,202) . . .  what is the chance that Argentin wins the basketball game?
13)   In some disciplines you should be able to find correlations between certain physical attributes of the athletes (height, weight, . . .) and their results. E.g. between the height of high jumpers and the height they jump. You can use that also to discuss cause-and-effect and thus simple regression analysis.
14)   You can find correlations between results in the heats and semifinals or semifinals and finals (swimming, running, . . .).
15)   Proportions can be found in basketball, skeet, etc… and allow for the binomial distribution: If each time I shoot, the chance I hit the disk is 99%; what is the chance I hit 49 out of 50 disks?
16)   There is some research about a country’s expected medal haul using multiple regression analysis using population, GDP/capita and other variables. For more, see:

My list is nowhere near complete, but it shows that virtually every topic that is traditionally covered in business statistics can be applied to the Olympics . . . Enjoy. J

No comments:

Post a Comment