1. The mean is the measure of central tendency that is most influenced by outliers. If a data set includes a number that is either very large or very small, it will make the mean either much higher or lower than a more “normal” number would. For example, the mean of 3, 4, 5 and 6 is 4.5. The mean of 3, 4, 5 and 30, however, is 10.5. The outlier, 30, makes the mean disproportionately higher than it ought to be. Accordingly, unusually high or low temperature values will affect the mean more than they “should”. Errors in measurement, as well as systematic events such as taking a cold drink or a hot shower, probably caused these extreme values. We cannot rely on these extreme data values because they are “exceptional” cases and, as such, are not representative of normal temperature values. The correct average body temperature is 98.2. According to the article, the original value of 98.6 is incorrect because, for example, the thermometers used to obtain this value were unreliable, and this value was obtained 100 years ago.
Emily’s arithmetic average temperature is 98.01 degrees F. The difference between her temperature and the correct average temperature is 0.19 degrees F. The standard deviation given in the article is 0.73. Her body temperature is well within 0.73 degrees of the correct average temperature and, as such, is not particularly low because it is much smaller than the “average difference” between a random temperature value and the mean.
Sadie’s mean was 98.20588, while the correct body temperature average was 98.2. The difference was that Sadie’s body temperature was 0.00588 points higher. This shows that she does not have an unusual body temperature average. Also, if you look at her median and mode for the data, they were also 98.2. Thus her data show that her average was not only close to the mean (standard deviation was 0.73 and so Sadie’s average was well within the realm between the mean and the first standard deviation) but also that she had the most occurrence of 98.2 and the exact middle of her data was 98.2. This was a surprising discovery due to the fact that she didn’t feel her temperature was very stable throughout the week of data collection. Not only did she have have a severe cold but she spent most nights at work, where she often took her temperature while she was working (regardless of the strange looks she received from customers). So the fact that her temperature was not unique in the slightest is very interesting. Part of the explanation could be the fever reducing medication she took throughout the week that might have leveled out her temperature and kept it a little more stabilized.
Sadie’s data are not very representative of what her actual temperatures are. Throughout the data collection she was sick with a very severe cold and so not only was she taking medication (several different types of cold medicine although obviously not at the same time) but also her sleeping and eating patterns were thrown off. However, the interesting discovery is that her mean temperature (as well as the mode and median) was very close to the average correct temperature. Also, her standard deviation was half that of what the standard deviation for the correct temperature was. This means that her data on average deviated from the mean much less than normal, which is surprising as one might expect her temperature to deviate more wildly since this cold disrupted the pattern of her life so severely. However, one might take into account the fact that she was on a fever reducer, which may have leveled out her temperature. The mean of her data could be affected by the one outlier she had that raised the mean. Also, the medicine Sadie took may have regulated her body processes enough so that she received more temperatures closer to her mean. If she continued to take measurements throughout the year, she would have a more representative data collection. However, the data would have a higher standard deviation due to the fact that her temperature would fluctuate more and also, it would have a lower mean due to the fact that she usually has a lower temperature than that of 98.2. It’s usually around 97.8 or so when she takes it normally.
Emily’s temperature data are likely not very representative. Emily only took her temperature over the course of five days, often immediately after being exposed to extreme cold. Other systematic events such as eating very hot or cold food immediately before obtaining a temperature reading also had a likely impact on her data. If a substantial amount of the data are slightly “off”, the measures of central tendency, and standard deviation, will be “off” as well. Her data would probably be more accurate if she continued to take measurements throughout the semester, because the weather would become warmer and largely cancel out any temperature readings that were lower than normal. Additionally, the greater the number of readings, the more representative the mean will be. A larger sample size tends to mitigate the effects of outliers.
Sadie’s arithmetic average is 36.78104 degrees Celsius. Emily’s arithmetic average is 36.67708 degrees Celsius. We found these temperatures using the equation: Tc= (5/9)*(Tf – 32). To convert from Fahrenheit to Celsius, you subtract 32 from the Fahrenheit temperature and multiply by the fraction 5/9.
2. This relates to the statistical topic from class because as we see when we graph the data using a frequency distribution, the data follow a rough normal distribution. By finding the mode, median, mean, standard deviation, and variance we can show how well our data follow the average “correct” temperature data. This allows us to apply order to the chaos of our random data, which allows us to better predict where the next temperature will fall. By finding the mode, median, and mean we create an understanding of what the average temperature would be (and also how any outliers will affect the data especially by comparing the median with the mean). By finding the standard deviation and variance, we see how on average a temperature will deviate from the mean.
3. The following represent the mean, median, mode and standard deviation of Emily’s temperature values. Both the median and the mode are 98.3 degrees F. Interestingly, this value is even closer to the temperature that, according to the article, is the “true” average temperature (98.2) than her own average temperature. Had there not been an outlier in her data set (95.6), her average temperature would likely be very close to 98.3 degrees, given the frequency with which it appears in the data.
Calculated using a graphing calculator:
Mean: 98.01875
Median: 98.3(The calculator does not calculate mode automatically but I was able to obtain it from the ordered list I made on the calculator.)
Mode: 98.3
Standard deviation: (n-1) 0.729
Standard deviation (n): 0.717
Using SPSS:
Mean: 98.019
Median: 98.3
Mode: 98.3
Standard Deviation: 0.7293
Variance: 0.532
A histogram of Emily’s data: 
The following is Sadie’s measures using a hand calculator and SPSS. As you can see, her data do not deviate very much from the mean (which is almost the same as the median and mode) except for the one outlier in her data. Attached is also the frequency distribution from SPSS.
Using a hand calculator:
Mean: 98.20588
Median: 98.2
Mode: 98.2
Standard deviation (using n-1): 0.37331
Variance (using n-1): 0.13936
Standard deviation (using n): 0.36778
Variance (using n): 0.13526
Using SPSS:
Mean: 98.2059
Median: 98.2
Mode: 98.2
Standard Deviation: 0.36778
Variance: 0.135
And this is what her frequency distribution showed: 
As you can see her data were well grouped around the mean. Also it roughly shows a normal distribution (or at least the normal distribution model since it can’t ever be a true normal distribution since your temperature could never go on into infinity or negative infinity).
4. 24 January 2008. Standard deviation-Wikipedia, the free encyclopedia. Retrieved 24 January 2008 from http://en.wikipedia.org/wiki/Standard_deviation
Source: (for information about Fahrenheit to Celsius)
Converting between Fahrenheit and Celsius temperature scales. (2008). Retrieved January 28, 2008 from http://www.usatoday.com/weather/wtempcf.htm
Shoemaker, Allen L. (1996). What’s Normal? — Temperature, Gender and Heart Rate. Journal of Statistics Education, 4, 2. Retrieved 25 January 2008, from http://www.amstat.org/publications/jse/v4n2/datasets.shoemaker.html
5. The data have some limitations. The presence of an outlier in Emily’s data (95.6) made her average temperature much lower than it would have been if the temperature had not been an outlier. Therefore, the mean that Emily calculated probably does not represent her true “average” temperature; it may, in fact, be significantly higher than the one obtained in this experiment. One strength of the data is that Emily obtained enough temperature values to provide a reasonable estimate of what her average temperature could be. While this is still not perfect, obtaining 32 temperature values provides a much more accurate approximation of her “true” average temperature than, say, a collection of 5 temperature values. The mean of a larger data set will be less affected by outliers than the mean of a smaller data set.
Sadie also had an outlier in her data (99.4). It slightly affected her mean, raising it to 98.20588, while her median and mode showed that 98.2 would have been a slightly more accurate representation of her data.
Done by Sadie Tyree and Emily Vorek