4.4.1 - StatKey: Standard Error Method

4.4.1 - StatKey: Standard Error Method

The following examples use StatKey to construct bootstrap sampling distributions. When the sampling distribution is approximately normally distributed, we can use the standard error method to construct a confidence interval from the bootstrap sampling distribution. Recall for a 95% confidence interval: \(statistic \pm 2 (standard\ error)\). This method can only be used when the sampling distribution is approximately normal. If the sampling distribution is not approximately normal, then the percentile method must be used. 

To construct a bootstrapped confidence interval using the standard error method follow these steps:

  1. Determine what type of variable(s) you have and what parameters you want to estimate. StatKey will bootstrap a confidence interval for a mean, median, standard deviation, proportion, difference in two means, difference in two proportions, simple linear regression slope, and correlation (Pearson's r). 
  2. Get your sample data into StatKey. There are some built-in datasets and you have the ability to enter in your own data. This procedure varies depending on the test you're conducting. For a proportion you need to enter the number of successes and number of trials. For anything involving quantitative data you will need to copy and paste your data into StatKey (this is the recommended method) or upload it as a txt, csv, or tsv file. 
  3. Generate at least 5,000 bootstrap samples.
  4. Confirm that your bootstrap distribution is approximately normal. If it's not approximately normal you should consider using the percentile method.
  5. Use your original sample statistic and the standard error from your bootstrap distribution to construct a confidence interval.
    For a 95% confidence interval the formula is \(statistic \pm 2 (standard\ error)\)

It is possible to use the standard error method to construct confidence intervals at levels other than 95% if you have the appropriate multiplier.  Later in the course, in Lesson 7, we will learn more about how other multipliers can be found. 


4.4.1.1 - Example: Proportion of Lactose Intolerant German Adults

4.4.1.1 - Example: Proportion of Lactose Intolerant German Adults

In a representative sample of 500 German adults, 78 are lactose intolerant. Use StatKey to construct a 95% confidence interval to estimate the proportion of all German adults who are lactose intolerant.


4.4.1.2 - Example: Difference in Mean Commute Times

4.4.1.2 - Example: Difference in Mean Commute Times

The following example uses StatKey to compare the average commute times in Atlanta and St. Louis. Commute time is a quantitative variable, and we are examining the difference in two independent (i.e., not match/paired) groups.


Legend
[1]Link
Has Tooltip/Popover
 Toggleable Visibility