Data Sources and Subject Areas

You can combine data sources with subject areas to explore and analyze the data.

A subject area either extends a dimension by adding attributes or extends facts by adding measures and optional attributes. You can't define hierarchies in data sources.

A subject area organizes attributes into dimensions, often with hierarchies, and a set of measures, often with complex calculations, that can be analyzed against the dimension attributes, for example, the measure net revenue by customer segment for the current quarter, and the same quarter a year ago.

When you use data from a source such as an Excel file, the file adds information that is new to the subject area. For example, suppose you purchased demographic information for postal areas or credit risk information for customers and want to use this data in an analysis before adding the data to the data warehouse or an existing subject area.

Using a source as standalone means that the data from the source is used independently of a subject area. It’s either a single file used by itself or it’s several files used together and in both cases a subject area isn't involved.

You can extend a dimension by adding attributes from a data source to a subject area:

  • You can only make matches to a single dimension.

  • The set of values in matched columns are unique in the data source. For example, if the data source matches on ZIP code, then ZIP codes in the source are unique.

  • You make matches between one or composite columns. An example of a one column match is that product key matches product key. For composite columns, an example is that company matches company and business unit matches business unit.

  • All other columns must be attributes.

You can add measures from a data source to a subject area:

  • You make matches to one or more dimensions.

  • The set of values in matched columns aren't necessarily unique in the data source. For example, if the data source is a set of sales matched to date, customer, and product, then you can have multiple sales of a product to a customer on the same day.

  • You make matches between one or composite columns. An example of a one column match is that product key matches product key. For composite columns, an example is that company matches company and business unit matches business unit.

A data source that adds measures can include attributes. You can use these attributes alongside external measures and not alongside curated measures in visualizations. For example, when you add a source with the sales figures for a new business, you can match these new business sales to an existing time dimension and nothing else. The data might include information about the products sold by this new business. You can show the sales for the existing business with those of the new business by time, but you can’t show the old business revenue by new business products, nor can you show new business revenue by old business products. You can show new business revenue by time and new business products.