File ID is part of Oracle's family of OEM products known as Outside In Technology, a powerful document extraction, conversion and viewing technology that can access the information in more than 600 file formats.
There may be references to other Outside In Technology SDKs within this manual. To obtain complete documentation for any other Outside In product, see:
http://www.oracle.com/technetwork/indexes/documentation/index.html#middleware
and click on Outside In Technology.
This chapter includes the following sections:
The updated list of supported formats is linked from the page http://www.outsideinsdk.com/. Look for the data sheet with the latest supported formats.
The following new formats are supported:
Microsoft Word 2016
Microsoft Excel 2016
Microsoft PowerPoint 2016
MS Outlook 2011 for Mac (OLM and EML)
Corel WordPerfect X7
Corel Quattro Pro X7
Corel Presentations X7
Corel Draw X7
iWork KeyNote (text only)
AutoCAD 2015
The following new options are introduced:
A new option, SCCOPT_PDF_FILTER_MAX_EMBEDDED_OBJECTS, is added that allows you to limit the number of embedded objects produced in PDF files.
A new option, SCCOPT_PDF_FILTER_MAX_VECTOR_PATHS, is added that allows you to limit the number of vector paths produced in PDF files.
A new option, SCCOPT_PDF_FILTER_WORD_DELIM_FRACTION, is added. This allows you to control the spacing threshold in PDF input documents.
Support for the following general accuracy and fidelity features is provided:
MS Word table styles supported
MS Office Chart data label styles extended
Font selection algorithm improvements implemented
Outlook MSG “best body” algorithm implemented
PPTX Master slide Transparency provided
Four Color (CMYK) progressive JPEG supported
Processing of very large spreadsheets containing large areas of white space are optimized for improved performance supported
Strict Open XML versions of office formats identified
Apple iWork Pages and Numbers identified
WebP raster identified
The following Operating System support is provided:
Windows 10
SLES 12
The Outside In File ID API allows developers to identify files using the same technology that all Outside In products use internally. This specification uses a 16-bit value called the ID or type ID to identify different file formats. These IDs are defined in sccfi.h.
This API includes the following functions:
FIIdFile: Returns an ID given a file.
FIIdFileEx: Returns an ID and an ID name given a file.
FIGetFirstId: Returns the first ID in the range of IDs used by this API.
FIGetNextId: Returns the next ID in the range of IDs used by the API.
Each Outside In product has an sdk directory, under which there is a subdirectory for each platform on which the product ships (for example, fi/sdk/fi_win-x86-32_sdk). Under each of these directories are the following three subdirectories:
redist: Contains only the files that the customer is allowed to redistribute. These include all the compiled modules, filter support files, .xsd and .dtd files, cmmap000.bin, and third-party libraries, like freetype.
sdk: Contains the other subdirectories that used to be at the root-level of an sdk: common, lib (windows only), resource, samplefiles, and samplecode (previously samples). In addition, one new subdirectory has been added, demo, that holds all of the compiled sample apps and other files that are needed to demo the products. These are files that the customer should not redistribute (.cfg files, exportmaps, and so forth.).
In the root platform directory (for example, fi/sdk/fi_win-x86-32_sdk), there are two files:
README: Explains the contents of the sdk, and that makedemo must be run in order to use the sample applications.
makedemo (either .bat or .sh – platform-based): This script will either copy (on Windows) or Symlink (on UNIX) the contents of …/redist into …/sdk/demo, so that sample applications can then be run out of the demo directory.