2.1 External Data Descriptor (EDD)
- It enables:
- Ingestion of external data into DFCS.
Extraction of data from DFCS.
- Control information management during data ingestion.
- Ingestion of external data into DFCS.
Each EDS can have multiple EDDs, each tailored to a specific data format or control logic.
- On the Financial Services Data Foundation Service for Banking home page,
click External Data Descriptors.The Summary page is displayed.
- Click Search to search the existing Data Descriptor from
the available list. Sort by clicking on the Name drop
down.
- Name
- Last Modified Date
- Click Add to configure additional EDDs.
- Enter the Specify Data Descriptor name and Describe the Data Descriptor details.
- Configure File Settings (under the Data tab):
Table 2-1 Data Tab Description
Fields Description Toggle Is the file archived? By enabling this option, you can upload an archived data file to the object store as input to the DFCS data ingestion process. You can also combine multiple files into an archive and use it to load data into DFCS. Note:
The Archive file name field appears only when Is the file archived option is enabled. Enter the archive file name.Example: td_contracts%#MISDATE%.zip
Only archives with.zip extension using standard DEFLATE algorithm is supported.Specify data file names Click on Add More Files. Enter the file name and click Ok. You can add multiple data files to an EDD. For example, if you need to add the Term Deposits Contracts data file. There are Term Deposits Contracts data files for retail as well as corporate accounts. Therefore, to get both these details, you first add the Term Deposits Contracts data file for retail accounts, such as td_contracts%#MISDATE%_1.csv, and as the next record, add the Term Deposits Contracts data file for corporate accounts.In case the file sizes are large, it is recommended you break the file into smaller files. Optimize file size is 3 GB for each file. Example: td_contracts%#MISDATE%_1.csv Select File Format There are two options: - Fixed Length: The file has records and columns with a fixed length. Each column has a predetermined and unchanging size, set when the record layout is designed, and the sum of the column sizes add up to the record size.
- Delimited: There is a separation of the records and columns using a delimiter character like a comma, semicolon, hyphen, and so on.
For example, select Delimited.
Select Record delimiter The records are stored differently in different operating systems. The available options are: - MS-DOS
- Unix
- No Record Delimiter
- Other
For example, select Unix.
Specify Text Qualifier (Optional) A character that identifies text. Generally, double quotes are prefixed and suffixed to identify text. This is optional. Specify the number of records to skip Provide the number of records to be skipped. The records are skipped from the top. Generally, this is used to skip headers. Specify Decimal Separator The character used to identify the decimal and fractional part. Usually point (.) or comma (,) Do you want to read from template? (Optional) Enable Do you want to read from template? to auto-populate values in the Excel file format. If the template is not available, create it manually by clicking Add, under the Data Elements. If the template is available, you can browse for the template. See the File EDD Template. You can also drop the template in the area “Drop template here or click to select “.
Select Template (*.xls,*.xlsx,*.csv Files Only) Click Browse and select the required template. Data Elements If not using a template, manually add data elements using the Add option. Name Name of the field in EDD. Example: Field name in a file or column name in a table.
Note:
The Field name of XML type must not be more than 25 characters and for others must not be more than 30 characters.Type This shows the Data type. Example: String, Number, and Date.
Length This is applicable only for the EBCDIC format. This is the length of the EBCDIC data type. In the case of a file, it is length only. Scale This is used to specify the number of digits after the decimal point. Example: 10.3.
Format Specify the format for columns of type date here. If left blank, a default format of DD/MM/YYYY is assumed and used.
Note:
- The default format is fixed and does not change with database or system language settings.
- For data ingestion, enter the format in which date fields are provided.
- For data extraction, enter the format in which extracted date fields must be recorded.
For example, in the file extract, to represent the date, 31st January 2020, as “31/01/2020”, specify the format as DD/MM/YYYY for the date columns.
Record Type Code This identifies the Record type in a file where Header, Trailer, and Data are of different record length and type. The values can be any string available in the text file. This value is only possible for the first field in a file. Example: The values can be DATA; CTRL to specify it is a control record.
- Steps to Configure Control Settings in EDD: Reconcile between source data received and data loaded into Financial Services Data Foundation Cloud Service for Banking, using the control information available from the source system. This feature is used to implement record count validation or amount reconciliation from a source file. It also allows specifying a threshold for validation. The specified threshold is compared with the difference (record count or amount) to perform the validation. The execution fails only if the difference is more than the threshold value.
- The threshold can be an absolute or a percentage value. The connector execution process considers the threshold type while performing reconciliation.
- Generate control information for file extracts from DFCS. Specifying control
for extract EDDs generates control details, based on the configuration in the
EDD. Both record count and amount value (sum, average, max, and so on) for
specific columns are recorded into the control file.
Note:
This option is applicable only for file type EDDs (ASCII and EBCDIC).Table 2-2 Control Tab Description
Field Description NOTE: The Reconciliation details present in separate file option cannot be modified if the option Is the file archived in Data Tab is enabled. When Archive option is enabled:
If you want to perform reconciliation, control file must always be present in the same archive as input data files and should be a separate file.
The control file should have only two columns that are Control Name and Expected Value.
When Separate File is selected as Yes. File Name Specify the name of the file. File Format There are two options:
- Fixed Length: The file has records and columns with a fixed length. Each column has a predetermined and unchanging size, set when the record layout is designed, and the sum of the column sizes add up to the record size.
- Delimited: There is a separation of the records and columns using a delimiter character like a comma, semicolon, hyphen, or so on.
In the previous example, select Delimited.
Column Delimiter If the File Format is selected as Fixed Length, the Column Delimiter would be Other by default. If the File format is selected as Delimited, the following options are available in the drop-down list.
- Other
- Space
- Semicolon
- Comma
- Tab
In the previous example, select Comma.
Record Type Code Used to uniquely identify a record within a file. A financial institution sometimes provides files that have data and control records within the same file. In such a case, to distinguish between data record and control record, the first field is Record Type. It has a specific value to identify that. Here, specify the value that identifies the data. Values can be ‘DATA’ and so on. For the Control record, the value is specified under the Control tab. Only the first field of a file is used for Record Type. Record Delimiter The records are stored differently in different operating systems. The following options are available: - MS-DOS
- Unix
- No Record Delimiter
- Other
For example, select Unix.
Skip number of records Provide the number of records to be skipped. The records are skipped from the top. Generally, this is used to skip Headers. Example: If you specify this as 1, the first row in the file will be ignored. Text Qualifier A character that identifies text. Generally, double quotes are prefixed and suffixed to identify text. This is optional. Decimal separator Specify up to which decimal digit you want to view the result. Record Type Length The length of the record type value to pick up the correct record. For example, if the control record is “DATATotal Records 400” and DATA is the Record type, the length is ‘4’. This is applicable only for Control records that are of fixed length. Control Name Length Based on the previous example, the Control name is “Total Records”. Therefore, the Control Name Length is ‘13’. Control Value Length Based on the previous example, the Control value is 400. Hence, the length of the control value is ‘3’. When Are Reconciliation details present in Separate File option is disabled, the below fields will be enabled. Record Type Code Used to uniquely identify a record within a file. A financial institution sometimes provides files that have data and control records within the same file. In such a case, to distinguish between data record and control record, the first field is Record Type. It has a specific value to identify that. Here, specify the value that identifies the data. Values can be ‘DATA’ and so on. For the Control record, the value is specified under the Control tab. Only the first field of a file is used for Record Type. Control Value Length Based on the previous example, the Control value is 400. Hence, the length of the control value is ‘3’ Control Name Length Based on the previous example, the Control name is “Total Records”. Hence, the Control Name Length is ‘13’. Controls Control Name Specify the name of the control. Aggregation Method Select either Aggregation Method or Count. The supported aggregation methods are as follows:
- Min
- Max
- Average
- Sum
Aggregation Column Name Select the column on which the aggregation method is applied. Note:
For count, no column needs to be selected.Threshold Type This field is optional. There are two available options for threshold: percentage or absolute. If you select percentage, the reconciliation difference in percent is matched against this threshold value.
If you select absolute, the absolute percent difference is matched against this threshold value.
Threshold Value Specify the difference value in percent or absolute. - Click on the Transformation tab at the top of the EDD configuration
screen.
Table 2-3 Transformation Tab Description
Fields Description Name Name of the derived field in the EDD. Note:
Field names must not be more than 30 characters.Type Shows the Data type of thefield.
Example: Varchar2, Number, Date, and so on.
Expression When you select the Add option, the Specify Expression window is displayed. Here, you can select the required entities, functions, and operators. That is, you can write your expression. Enter the field name and click OK. Then the newly created field name is listed. If the Transformation Type is selected as Aggregation:
Derived Data Elements Name Name of the derived field in the EDD. Note:
Field names must not be more than 30 characters.Type This shows the Data type of thefield. Example: Varchar2, Number, Date, and so on.
Expression When you select the Add option, the Specify Expression window is displayed. Here, you can select the required entities, functions, and operators. That is, you can write your expression. Enter the field name and click OK. Now the newly created field name is listed. Aggregation Properties Group By This is available when Aggregation is selected. Having This is available when Aggregation is selected. - Save the Descriptor: Click Save as Draft to store your progress or Save to finalize the descriptor once all required fields are completed.