Download Extract Output from Oracle WebCenter Content

To download extracted content from Oracle WebCenter Content:
  1. Search for manifest files where the DOCTITLE starts with MANIFEST_EXTRACT_ and sort the results by DOCDATE in descending order.
  2. Download each manifest file using the DOCID.
  3. Parse the manifest file to identify output ZIP files and their DOCID.
  4. For each output file listed in the manifest:
    • Download the ZIP file using the DOCID.
    • Compute the SHA-256 checksum of the downloaded file.
    • Verify the file content using the checksum provided in the manifest file.
  5. Extract the downloaded files and process the files based on their file extension, such as .csv or .json
  6. After successful processing, rename the corresponding manifest file by prefixing it with a timestamp in the format <timestamp>_<original_manifest_file_name>.mf to avoid reprocessing.
  7. Delete or expire the manifest file and processed files after 30 days to manage storage.