5.6.4 Obtaining SAR Labels for Customer Risk Scoring
Obtain the SAR Information for Sandbox
Disposition/SAR information of the historical alerts that are required in sandbox for the purpose of supervised machine learning model training. SAR information acts as a target/depended variable for the model training.
Populate Investigated Entity Details
Obtain the SAR from CRR/ECM
Use
aif.load_sar_data ()
API to load the Suspicious Activity Report (SAR) entities details from the
Compliance Regulatory Reporting (CRR) application and Non-SAR entities from ECM into
Compliance Studio.
Note:
“aif” is just a package that is available as part of compliance Studio.The data will be loaded into the
aif_investigated_entity
table.
The following parameters are the input value for the
paragraph:
- from_date: From date range in YYYYMMDD format for SAR/Alert creation date.
- to_date: To date range in YYYYMMDD format for SAR/Alert creation date.
- CRR_conn: CRR Connection object.
- ECM_conn: ECM Connection object.
Note:
- Register Oracle wallet entries/aliases for CRR & ECM Atomic schema to connect within Compliance Studio
- Use the aliases mentioned here to create/register entries. If aliases are being created with some other name, use them accordingly in the Admin Notebook.
Obtain the SAR from the CSV file
Use
aif.load_sars_from_csv()
API to load the SAR and
Non-SAR entities into a CSV file.
The following parameters are the input value for the
paragraph:
- filename: Complete path of the CSV file.
- headerIncluded: This parameter has two options: Y
or N. If the file has data with the header, then Y or N.
Note:
- The date should be in
YYYYMMDD HH24:MI:SS
format. - Records should be comma-separated (CSV).
- The date should be in
Ensure that the following columns are available in the CSV
files with the required values:
- ENTITY_ID: Customer Id or Account Id
- SUSPICIOUS_FLAG: This parameter has two options: Y or N. If E-file for Regulatory body has been sent for Customer or Account, then Y or N.
- ALERT_DATE: SAR/EVENT generated to date from Customers and Accounts
- CREATED_ON: CSV file creation date
- CREATED_BY: CSV file created by
- UPDATED_ON: CSV file updated date
- UPDATED_BY: CSV file updated by
- LABELLED_SCENARIO: This value has the following
options:
- CUST: For customer-level SAR
- ACCT: For account level SAR
- ENTITY_CD: This value has the following options:
- If entity type is customer
- If entity type is the account
Obtain the SAR classification from the CRR database
The
aif.get_case_data_and_sar_classification()
API gets SAR
classification from CRR schema, merge with entity ID (Customer ID) in ECM, and
stores as metadata in Compliance Studio schema table,
aif_case_information
.
The
aif_case_information
table columns
are as follows:
- ENTITY_ID
- CASE_ID
- SAR_CLASSIFICATION
- FILING_AM
- CONTINUING_SAR
- FILING_DATE
The following parameters are the input value for the
paragraph:
- from_date: From date range in YYYYMMDD format.
- to_date: To date range in YYYYMMDD format.
- CRR_conn: CRR Connection object.
- ECM_conn: ECM Connection object.
- AIF_conn: AIF Connection object.
Format:
cx_Oracle.connect(<db_user/db_password@tns>)
On successful execution of the paragraph, the details will be loaded in
the
aif_case_information
table.
Note:
- Register Oracle wallet entries/aliases for CRR and ECM Atomic schema to connect within Compliance Studio.
- Use the aliases mentioned here to create/register entries. If aliases are being created with some other name, use them accordingly in the Admin Notebook.