To use distribution fitting:
Choose Define, then Define Assumption, .
(In Microsoft Excel 2007 or later, click the upper half of the Define Assumption icon.)
In the Distribution Gallery, click Fit to select the source of the fitted data.
If the historical data is in a worksheet in the active workbook, choose Range, and then enter the data’s cell range. If the range has a name, you can enter the name, preceded by an = sign.
If the historical data is in a separate text file, click Text File, and then either enter the path and name of the file or click Browse to search for the file. If you want, you can check Column and enter the number of columns in the text file.
When you use a file as the source of data, each data value in the file must be separated by either a comma, a tab character, a space character, or a list separator defined in Windows’ Regional and Language Options panel. If actual values in the file contain commas or the designated list separator, those values must be enclosed in quotes. Allowable formats for values are identical to those allowed within the assumption parameter dialog, including date, time, currency, and numbers.
Specify which distributions are to be fitted:
AutoSelect performs a basic analysis of the data to choose a distribution fitting option and ranking method. If the data includes only integers, fitting to all discrete distributions (with the exception of Yes-No) is completed using the Chi-square ranking statistic choice.
All Continuous fits the data to all of the built-in continuous distributions (these distributions are displayed as solid shapes on the Distribution Gallery).
All Discrete fits to all discrete distributions except yes-no and uses the Chi-square ranking statistic.
Choose displays another dialog where you can select a subset of the distributions to include in the fitting.
The final setting selects the distribution that was highlighted on the Distribution Gallery when you clicked the Fit button.
If you try to fit negative data to a distribution that can only accept positive data, that distribution will not be fitted to the data.
Specify how the distributions should be ranked.
In ranking the distributions, you can use any one of three standard goodness-of-fit tests:
Anderson-Darling. This method closely resembles the Kolmogorov-Smirnov method, except that it weights the differences between the two distributions at their tails greater than at their mid-ranges. This weighting of the tails helps to correct the Kolmogorov-Smirnov method’s tendency to over-emphasize discrepancies in the central region.
Chi-Square. This test is the oldest and most common of the goodness-of-fit tests. It gauges the general accuracy of the fit. The test breaks down the distribution into areas of equal probability and compares the data points within each area to the number of expected data points. The chi-square test in Crystal Ball does not use the associated p-value the way other statistical tests (e.g., t or F) do.
Kolmogorov-Smirnov. The result of this test is essentially the largest vertical distance between the two cumulative distributions.
The first setting, AutoSelect,, selects the ranking statistic automatically based on several factors. If all data values are integers, Chi-Square is selected.
If you know the data corresponds to certain shape, location, or other special parameter values for some distributions, you can choose to lock parameters to those values.
Check Lock Parameters and enter appropriate values in the Lock Parameters dialog. For details, see Locking Parameters When Fitting Distributions.
By default, only values for the selected ranking statistic are displayed in the Comparison Chart dialog. To show values for all three statistics, check Show All Goodness-of-fit Statistics at the bottom of the Fit Distribution dialog.
The Comparison Chart opens.