Estimating Data Accuracy with the Bootstrap Tool

Bootstrapping is a simple technique that estimates the reliability or accuracy of forecast statistics or other sample data. Classical methods rely on mathematical formulas to describe the accuracy of sample statistics. When a statistic’s sampling distribution is not normally distributed or easily found, these classical methods are difficult to use or are invalid.

Bootstrapping analyzes sample statistics by repeatedly sampling the data and creating distributions of the different statistics from each sampling. The term bootstrap comes from the saying, “to pull oneself up by one’s own bootstraps”, since this method uses the distribution of statistics itself to analyze the statistics’ accuracy.

Two bootstrap methods are available with this tool:

Note:

When you use the multiple-simulation method, the tool temporarily turns off the Use Same Sequence Of Random Numbers option. In statistics literature, the one-simulation method is also called the non-parametric bootstrap, and the multisimulation method is also called the parametric bootstrap.

Figure 58. Bootstrap Simulation Methods

This figure displays the one-simulation method and the multiple-simulation methods

Since the bootstrap technique does not assume that the sampling distribution is normally distributed, you can use it to estimate the sampling distribution of any statistic, even an unconventional one such as the minimum or maximum value of a forecast. You can also easily estimate complex statistics, such as the correlation coefficient of two data sets, or combinations of statistics, such as the ratio of a mean to a variance.

To estimate the accuracy of Latin Hypercube statistics, you must use the multiple-simulation method.