Summarize Starting Data

Not all data is truly needed. As discussed earlier, the use of summarization points in data before it is loaded is one of the best tools. Account detail is a common example. Rather than loading expenses at the lowest level of detail, use aggregate cost pools instead. Use this strategy for every dimension where possible in your data. Refer to the earlier questions regarding detail needed for reporting or allocation process.

Ask these questions to determine whether details are needed for the reporting or allocation process:

  • Is the detail needed for reporting?

  • Is the detail needed to differentiate data to support allocation logic?

This step alone can shrink starting data size by one or more orders of scale.