E.6 Alternatives to Standard Data Management Practices

The following sections provide alternative data management practices.

Data Management Archiving

During ingestion processing, the system moves processed files into an archive directory. Firms can use these files to recover from processing malfunctions, and they can copy these files to off-line media for backup purposes.

The Pre-processor moves files in the /inbox directory. All other components move their input files to date-labeled subdirectories within the /backup directory.

Periodically, an Oracle client can run the runIMC.sh script to perform the Ingestion Manager cleanup activities. This script deletes old files from the archive area based on a configurable retention date. Periodic running of the cleanup script ensures that archive space is available to archive more recent data.

Fuzzy Name Matcher Utility

During Datamap processing, the Fuzzy Name Matcher utility is used to match names of individuals and corporations (candidates) against a list of names (targets). The utility calculates a score that indicates how strongly the candidate name matches the target name. All matches are case-insensitive.