1 Data Catalog Overview

Data Catalog is the logical representation of the underlying Data Model, which is contextualized by the Metadata to enable a better understanding of the Data Model and the enterprise-wide data. For example, understanding the End of Period Book Balance in the context of Loans and Securities may require two definitions of the term in discovering. A further analysis of the Metadata helps discovering, the sources, current business uses of the element, validation checks, and any privacy aspects.

The Data Catalog comprises of elements called Business Terms supporting business needs of the Banking and Financial Services Industry across the Finance, Risk, and Regulatory Compliance Functions. Data Catalog includes sourced, calculated, and master elements. Elements that require conformation to the standards will have a list of expected values. A combination of Business Terms and Entities form the underlying Data Model. The search capability allows you to explore data by different dimensions.

The Data Catalog helps you understand the business relevance of an Element and the associated Data Definition, grain through Entities and Subject Areas. You can group the Data Catalog by Subject Area and this subset is narrowed down to a business use case (For example, Basel Credit Risk).

In a multi-domain Data Catalog environment, you need to select the required Financial Domain that filters the Subject Areas relevant to the selected Domain or Service, Entities under the Subject Area, and Elements mapped to the Entities.

The Data Catalog is the gateway to create, view, or manage the physical instance of the Data Model.

You can use out-of-the-box define Pipelines (table to table process, Connector) to load data into the Entity, and execute and manage the Process.

The fundamental objectives of the Data Catalog are as follows:
  • To provide a unified logical view of the Enterprise Data Model.
  • To enable data discovery using predefined Metadata that is modifiable by Users.
  • To support end-to-end data lineage as it connects data sources and uses.