Configure Classification Jobs

You can create classification jobs to automate detection of languages and classification of documents when the documents are received. After classification jobs are committed, asset languages are set in Oracle Content Management. The results can then be used to create appropriate custom digital assets and assign the correct language to the source asset.

Note:

If your Oracle Content Management instance was created before mid-February 2023, you need to enable OCI services content sharing deployment options required for using advanced Content Capture features. For more information, see Edit Your Oracle Content Management Instance in Administering Oracle Content Management.
  1. In the procedures pane on the left, select your procedure.

    The configuration pages for the selected procedure appear on the right.

  2. Open the Processing tab.
  3. In the Classification Jobs table, click Add classification job, or to edit an existing job, click Edit classification job.

    You can also copy a classification job by selecting it, clicking Copy classification job, and entering a new name when prompted. Copying a job allows you to quickly duplicate and modify it.

  4. On the Document Selection page:

    1. In the Job Details section:
      • Enter a name and description for the job.

      • Select the Online check box to make the available for processing.
      • Select Language Detection and Document Classification actions. Actions is a required field. One of the two actions must be selected to proceed further. However, for efficiency, it is recommended that you create a single job to automate language detection and document classification.
    2. In the Document Processing section:
      • Select the Process Documents check box if applicable to the choices you made in the previous steps.
      • To process the classification job for specific document profiles, select one or more document profiles listed in the Restrict to Document Profiles field, or select All to process documents for all defined document profiles. Not selecting any is also equivalent to All.

    3. If you selected the Document Classification action check box in the Job Details section, the Attachment Processing section is enabled for you:
      1. Select the Process Attachments check box if applicable to the choices you made in the previous steps. This option leverages auto language detection and auto classification for attachments that are received from Content Capture Client.
      2. Restrict to the required attachment types by selecting the check boxes for the available attachment types. You can also select all of them.
  5. If you selected the Language Detection action in the Job Details section on the Document Selection page, the Language Detection page is enabled for you:
    • Selecting primary language is mandatory. Select the required language from the Primary Language drop-down list. Primary language has the maximum weightage in the document. This is also the language that is the most prominently used in the document.
    • Set the minimum threshold by dragging the Minimum Threshold bar. It isn't not mandatory to set the threshold as it will at least be zero. The minimum threshold ties with the primary language. The threshold indicates that if there is no language above this value, the Primary Language field remains blank.
    • Select the required languages from the Document Language drop-down list. A document can have multiple languages.
  6. If you selected the Document Classification action in the Job Details section on the Document Selection page, the Document Classification page is enabled for you. On this page:
    1. In the Document Classification section:
      • Define classifications types out-of-the-box by document understanding in the Document Profile Mappings section, if required.
      • Set the minimum threshold to indicate the confidence score for classification types. This score applies across all classification types. The minimum threshold applies to the mappings. This threshold must be met and the mapped document profile is assigned.

        However, this also pertains to the classification type that has the highest value. For example, document understanding may give the type of invoice with the highest value. If that is mapped and above the threshold, then the document profile is assigned. If there are no matches or the threshold isn't met, then the default document profile is used.

    2. In the Attachment Classification section,
      • Define classifications types out-of-the-box by document understanding in the Attachment Type Mappings section, if required.
      • Set the minimum threshold to indicate the confidence score for classification types. This score applies across all classification types. The minimum threshold applies to the mappings. This threshold must be met and the mapped attachment type is assigned. However, this also pertains to the classification type that has the highest value. If there are no matches or the threshold isn't met, then the default attachment type is used.
  7. On the Post-Processing page, specify based on the following what happens after your classification job completes:
    • No system error situations are cases in which all the criteria on the previous page were met. A successful transformation can flow to commit.
    • System errors are any cases in which the transformation fails: no records found, too many records found, and so on. For unsuccessful transformations, the batch returns to the Content Capture Client for repair.
  8. Review settings on the Summary page and click Submit to save the job.

  9. Configure how batches flow to your classification job. See Configure Batch Flow to a Classification Job.

  10. Test the classification job you created.

Configure Post-Processing and Monitoring of a Classification Job

Use post-processing options of a classification job to specify what happens after processing completes.

To configure post-processing and monitoring of a classification job:
  1. In the procedures pane on the left, select your procedure.
    The configuration pages for the selected procedure appear on the right.
  2. Open the Processor tab.
  3. In the Classification Jobs table, add or edit a classification job.
  4. Click the Post-Processing page to display the processing options for successful processing (no system errors) and unsuccessful processing (one or more system errors). The next steps apply to both sections: When there is no system error and When there are system errors.
  5. In the Batch Processor and Batch Processor Job fields, specify which processing step, if any, occurs after the classification job completes. Only those batch processor jobs that you have defined for your procedure are available for selection. For example, if you didn't define a recognition processor job, then the recognition processor option will not be available in the Batch Processor field. If you choose recognition processor or conversions to TIFF/PDF in the Batch Processor field, then specify the corresponding processor job.
  6. In the email address field, optionally enter an address to which to send an email after processing completes successfully or fails. While configuring and testing a classification job, you might set yourself to receive email notifications upon system errors, then later automatically alert an administrator of processing errors.
  7. In the remaining fields, specify how to change processed batches.
    • Rename batches by adding a prefix. For example, rename batches that were unsuccessful with the prefix ERR for follow-up.

    • Change batch status or priority. For example, you might change the status of batches with system errors, then create a client profile with batch filtering set to this status to allow qualified users to manually edit and complete batches that encountered errors.

  8. Click Submit to save the job.

Configure Batch Flow to a Classification Job

To run a classification job, you must configure batches to flow to the job for processing. You do this by setting the classification job as a post-processing step in a client profile, an import processor job, or a processing job discussed under processing settings. See the post-processing information for the job or client profile from which you want to configure a batch flow, under Configure Processing Settings or Configure Capture Settings.

Deactivate or Delete a Classification Job

When you delete a classification job, it no longer remains available for batches for which it is set as a post-processing step. If a job specified for post-processing is not available, an error results for the batch. You may want to change a job to offline for a time before deleting it, allowing you to resolve unexpected issues with its deletion. Online classification jobs run when they are selected in a client profile or on the Post-Processing page of a processor job. You can temporarily stop a job (take it offline) or change a deactivated job to run again. You cannot delete batch processing jobs if they are configured as a post processing job in another batch processor.

To deactivate or delete a classification job:
  1. In the procedures pane on the left, select your procedure.
    The configuration pages for the selected procedure are displayed on the right.
  2. Open the Processing tab.
  3. To deactivate a job, select it in the Classification Jobs table and click Deactivate or activate a classification job.

    You can also deactivate or activate a classification job by selecting or deselecting the Online field on the Document Selection page.

  4. Select the deactivated job in the Classification Jobs table and click Delete a classification job.
  5. When prompted, click Yes to confirm that you want to delete this job.