13 Search Export .NET Classes
The following classes are covered:
13.1 ArchiveNode Class
ArchiveNode provides information about an archive node. This is a read-only class where the technology fills in all the values.
Namespace
OutsideIn
Properties
-
IsDirectory (Boolean) A value of true indicates that the record is an archive node.
-
FileSize (Int32) File size of the archive node
-
NodeTime (Int32) Time the archive node was created
-
NodeNum (Int32) Serial number of the archive node in the archive
-
NodeName (String) The name of the archive node
13.2 Exporter Interface
This section describes the properties and methods of Exporter.
All of Outside In's Exporter functionality can be accessed through the Exporter Interface. The object returned by OutsideIn class is an implementation of this interface. This class derives from the Document Interface, which in turn is derived from the OptionsCache Interface.
Namespace
OutsideIn
Methods
-
GetExportStatus
ExportStatus GetExportStatus()
This function is used to determine if there were conversion problems during an export. The ExportStatus object returned may have information about sub-document failures, areas of a conversion that may not have high fidelity with the original document. When applicable the number of pages in the output is also provided.
-
NewSubDocumentExporter
Exporter NewSubDocumentExporter( int SubDocId, SubDocumentIdentifierTypeValue idType )
Create a new Exporter for a subdocument.
SubDocId: Identifier of the subdocument
idType: Type of subdocument
SubDocumentIdentifierTypeValue: This is an enumeration for the type of subdocument being opened.
-
IDTYPE_XX: Subdocument to be opened is based on output of XML Export (SubdocId is the value of the object_id attribute of a locator element.)
-
IDTYPE_ATTACHMENT_LOCATOR: Subdocument to be opened is based on the locator value provided by the one of the Export SDKs.
-
IDTYPE_ATTACHMENT_INDEX: Subdocument to be opened is based on the index of the attachment from an email message. (SubdocId is the zero-based index of the attachment from an email message file. The first attachment presented by OutsideIn has the index value 0, the second has the index value 1, etc.)
Returns: A new Exporter object for the subdocument
-
-
NewSubObjectExporter
Exporter NewSubObjectExporter( SubObjectTypeValue objType, uint data1, uint data2, uint data3, uint data4 )
Create a new Exporter for a subobject.
objType: Type of subobject
data1: Data identifying the subobject from SearchML
data2: Data identifying the subobject from SearchML
data3: Data identifying the subobject from SearchML
data4: Data identifying the subobject from SearchML
Returns: A new Exporter object for the subobject
SubObjectTypeValue: An enumeration to describe the type of SubObject to open.
-
LinkedObject
-
EmbeddedObject
-
CompressedFile
-
Attachment
-
-
NewArchiveNodeExporter
Exporter NewArchiveNodeExporter( int dwRecordNum )
Create a new Exporter for an archive node. You may get the number of nodes in an archive using getArchiveNodeCount. The nodes are numbered from 0 to getArchiveNodeCount -1.
dwRecordNum: The number of the record to retrieve information about. The first node is node 0 and the total number of nodes may be obtained from GetArchiveNodeCount.
Returns: A new Exporter object for the archive node
-
NewArchiveNodeExporter with Search Export Data
Exporter NewArchiveNodeExporter( uint flags, uint params1, uint params2 )
Create a new Exporter for an archive node. To use this function, you must first process the archive with Search Export and save the Node data for later use in this function. setOpenForNonSequentialAccess must be set to TRUE to use this function.
Flags: Special flags value from Search Export
Params1: Data1 from Search Export
Params2: Data2 from Search Export
Returns: A new Exporter object for the archive node
-
Export
void Export()
Perform the conversion and close the Export process.
-
SetExportTemplate
SetExportTemplate(FileInfo template)
This method sets the template file to be used for export.
template: A FileInfo object representing the template to be used for export.
-
SetExportTimeout
OptionsCache SetExportTimeout(int millisecondsTimeout);
This method sets the time that the export process should wait for a response from the Outside In export engine to complete the export of a document, setting an upper limit on the time that will elapse during a call to Export(). If the specified length of time or the default timeout amount is reached before the export has completed, the export operation is terminated and an OutsideInException is thrown. If this option is not set, the default timeout is 5 minutes.
-
Close
Close()
This function closes the current Export process.
-
NewLocalExporter
static Exporter NewLocalExporter(Exporter source)
This method creates and returns an instance of an Exporter object based on the source Exporter. All the options of source are copied to the new Exporter. The source and destination file information will not be copied.
13.2.1 Document Interface
All of the Outside In document-related methods are accessed through the Document Interface.
Namespace
OutsideIn
Methods
-
Close
void Close()
Closes the currently open document
-
GetArchiveNodeCount
Int32 GetArchiveNodeCount()
Retrieves the number of nodes in an archive file.
Returns the number of nodes in the archive file or 0 if the file is not an archive file.
-
GetFileId
FileFormat GetFileId(FileIdInfoFlagValue dwFlags)
Gets the format of the file based on the technology's content-based file identification process.
dwFlags: Option to retrieve the file identification pre-Extended or post-Extended Test
Returns the format identifier of the file.
-
GetObjectInfo
ObjectInfo GetObjectInfo()
Retrieves the information about an embedded object.
Return: An ObjectInfo object with the information about the embedded object
-
GetArchiveNode
TreeRecord GetArchiveNode(Int32 nNodeNum)
Retrieves information about a record in an archive file. You may get the number of nodes in an archive using getArchiveNodeCount.
nNodeNum: The number of the record to retrieve information about. The first node is node 0.
Return Value: An ArchiveNode object with the information about the record
-
SaveArchiveNode
void SaveArchiveNode( Int32 nNodeNum, FileInfo fileinfo) void SaveArchiveNode( Int32 nNodeNum, string strFileName)
Extracts a record in an archive file to disk.
nNodeNumType: The number of the record to retrieve information about. The first node is node 0.
strFileNameType/fileinfo: Full path of the destination file to which the file will be extracted
-
SaveArchiveNode with ArchiveNode
void SaveArchiveNode( ArchiveNode arcNode, FileInfo fileinfo) void SaveArchiveNode( ArchiveNode arcNode, string strFileName)
Extracts a record in an archive file to disk.
arcNode: An ArchiveNode object retrieved from GetArchiveNodeInfo with information about the node to extract
strFileNameType/fileinfo: Full path of the destination file to which the file will be extracted
-
SaveArchiveNode with Search Export Flags
void SaveArchiveNode( uint flags, uint params1, uint params2, FileInfo fileinfo) void SaveArchiveNode( uint flags, uint params1, uint params2, string strFileName)
Extracts a record in an archive file to disk without reading the data for all nodes in the archive in a sequential order. To use this function, you must first process the archive with Search Export and save the Node data for later use in this function. setOpenForNonSequentialAccess must be set to TRUE to use this function.
flagsType: Special flags value from Search Export
params1: Data1 from Search Export
params2: Data2 from Search Export
strFileNameType/fileinfo: Full path of the destination file to which the file will be extracted
13.2.2 OptionsCache Class
This section describes the OptionsCache class.
The options that configure the way outputs are generated are accessed through the OptionsCache class.
All of the options described in the following subsections are available through this interface. Other methods in this interface are described below.
Namespace
OutsideIn
Methods
-
OptionsCache SetSourceFile(FileInfo file)
Sets the source document to be opened.
file: Full path to source file
-
OptionsCache SetSourceFile(string filename)
Set the source document.
filename: Full path of the source document
Returns: The options cache object associated with this document
-
OptionsCache AddSourceFile(FileInfo file)
Sets the next source document file to be exported in sequence. This allows multiple documents to be exported to the same output destination.
file: Full path to source file
-
OptionsCache AddSourceFile(string filename)
Set the next source document file to be exported in sequence. This allows multiple documents to be exported to the same output destination.
filename: Full path to the source file
returns: The updated options object
-
OptionsCache SetSourceFormat(FileFormat fileId)
Sets the source format to process the input file as, ignoring the algorithmic detection of the file type.
fileId: the format to treat the input document as.
-
OptionsCache SetDestinationFile(FileInfo file)
Sets the location of the destination file.
file: Full path to the destination file
-
OptionsCache SetDestinationFile(string filename)
Set the location of the destination file.
filename: Full path to the destination file
returns: The updated options object
-
OptionsCache SetDestinationFormat(FileFormat fileId)
Sets the destination file format to which the file should be converted to.
fileId: the format to convert the input document(s) to.
-
OptionsCache SetCallbackHandler(Callback callback)
Sets the object to use to handle callbacks.
callback: the callback handling object.
-
OptionsCache SetPasswordsList(List<String> Passwords)
Provides a list of strings to use as passwords for encrypted documents. The technology will cycle through this list until a successful password is found or the list is exhausted.
Passwords: List of strings to be used as passwords.
-
OptionsCache SetLotusNotesId(String NotesIdFile)
Sets the Lotus Notes ID file location.
NotesIdFile: Full path to the Notes ID file.
-
OptionsCache SetOpenForNonSequentialAccess(bool bOpenForNonSequentialAccess)
Setting this option causes the technology to open archive files in a special mode that is only usable for non-sequential access of nodes.
bOpenForNonSequentialAccess : If set to true would open the archive file in the special access mode. Note that turning this flag on a non-archive file will throw an exception at RunExport time.
-
OptionsCache SetSourceFile(Stream file)
Set an input stream as the source document. Exporting a file using this method may have issues with files that require the original name of the file (example: extension of the file for identification purposes or name of a secondary file dependent on the name/path of this file).
-
OptionsCache SetSourceFile(Stream file, String Filename)
Set an input stream as the source document and provide information about the filename (fully qualified path or file name that may be used to derive the extension of the file or name of a secondary file dependent on the name/path of this file).
-
OptionsCache SetNextSourceFile(Stream file)
Set an input stream as the next source document to be exported to the original destination file. This method has the same limitations as the similar SetSourceFile(Stream file) method.
-
OptionsCache SetNextSourceFile(Stream file, String Filename)
Set an input stream as the next source document to be exported to the original destination file. The file name provided is used as in the method SetSourceFile(Stream file, String Filename)
-
OptionsCache SetNextSourceFile(FileInfo file)
Set an input stream as the next source document to be exported to the original destination file.
-
OptionsCache SetDestinationFile(Stream file)
Set an output stream as the destination for an export.
13.2.2.1 DefaultInputCharacterSet
OIT Option ID: SCCOPT_DEFAULTINPUTCHARSET
This option is used in cases where Outside In cannot determine the character set used to encode the text of an input file. When all other means of determining the file's character set are exhausted, Outside In will assume that an input document is encoded in the character set specified by this option. This is most often used when reading plain-text files, but may also be used when reading HTML or PDF files.
Data Type
DefaultInputCharacterSetValue
DefaultInputCharacterSetValue Enumeration
DefaultInputCharacterSetValue can be one of the following enumerations:
SystemDefault
Unicode
BigEndianUnicode
LittleEndianUnicode
Utf8
Utf7
Ascii
UnixJapanese
UnixJapaneseEuc
UnixChineseTrad1
UnixChineseEucTrad1
UnixChineseTrad2
UnixChineseEucTrad2
UnixKorean
UnixChineseSimple
Ebcdic37
Ebcdic273
Ebcdic274
Ebcdic277
Ebcdic278
Ebcdic280
Ebcdic282
Ebcdic284
Ebcdic285
Ebcdic297
Ebcdic500
Ebcdic1026
Dos437
Dos737
Dos850
Dos852
Dos855
Dos857
Dos860
Dos861
Dos863
Dos865
Dos866
Dos869
Windows874
Windows932
Windows936
Windows949
Windows950
Windows1250
Windows1251
Windows1252
Windows1253
Windows1254
Windows1255
Windows1256
Windows1257
Iso8859_1
Iso8859_2
Iso8859_3
Iso8859_4
Iso8859_5
Iso8859_6
Iso8859_7
Iso8859_8
Iso8859_9
MacRoman
MacCroatian
MacRomanian
MacTurkish
MacIcelandic
MacCyrillic
MacGreek
MacCE
MacHebrew
MacArabic
MacJapanese
HPRoman8
BiDiOldCode
BiDiPC8
BiDiE0
RussianKOI8
JapaneseX0201
Default
SystemDefault
13.2.2.2 DocumentMemoryMode
OIT Option ID: SCCOPT_DOCUMENTMEMORYMODE
This option determines the maximum amount of memory that the chunker may use to store the document's data, from 4 MB to 1 GB. The more memory the chunker has available to it, the less often it needs to re-read data from the document.
Data
-
SMALLEST: 1 - 4MB
-
SMALL: 2 - 16MB
-
MEDIUM: 3 - 64MB
-
LARGE: 4 - 256MB
-
LARGEST: 5 - 1 GB
Default
LARGE: 4 - 256MB
13.2.2.3 DropPDFHyphens
This option controls whether or not the PDF filter will drop hyphens at the end of a line. Since most PDF-generating tools create them as generic dashes, it's impossible for Outside In to know if the hyphen is a syllable hyphen or part of a hyphenated word.When this option is set to true, all hyphens at the end of lines will be dropped from the extracted text.
Data Type
bool
Default
false
13.2.2.4 EnableAllSubObjects
Oracle Outside In has an internal flag that is used to optimize several of the input filters for searching. One of the side effects of this optimization is that many embedded bitmaps, including Progressive JPEG, aren't output by the filter. This option can override this internal optimization.
Data Type
bool
Default
false
13.2.2.5 ExportPerformanceMode
This option allows for skipping the processing of some or all style information when possible. This should result in better performance, but certain output will no longer be available.
Data Type
ExportPerformanceModeValue
ExportPerformanceModeValue Enumeration
ExportPerformanceModeValue can be one or more of the following set of values:
-
Normal - Process the style information normally.
-
TextAndFonts - Process only the font and character set information within a style.
-
TextOnly - Skip processing all style information.
Default
Normal
13.2.2.6 ExtractXMPMetadata
OIT Option ID: SCCOPT_EXTRACTXMPMETADATA
Adobe's Extensible Metadata Platform (XMP) is a labeling technology that allows you to embed data about a file, known as metadata, into the file itself. This option enables the XMP feature, which does not interpret the XMP metadata, but passes it straight through without any interpretation. This option will be ignored if the ParseXMPMetadata option is enabled.
Data Type
bool
Data
-
true: This setting enables XMP extraction.
-
false: This setting disables XMP extraction.
Default
-
false
13.2.2.7 FallbackFormat
This option controls how files are handled when their specific application type cannot be determined. This normally affects all plain-text files, because plain-text files are generally identified by process of elimination, for example, when a file isn't identified as having been created by a known application, it is treated as a plain-text file. It is recommended that None be set to prevent the conversion from exporting unidentified binary files as though they were text, which could generate many pages of "garbage" output.
Data Type
FallbackFormatValue
FallbackFormatValue Enumeration
-
Text: Unidentified file types will be treated as text files.
-
None: Outside In will not attempt to process files whose type cannot be identified
Default
Text
13.2.2.8 IECondCommentMode
OIT Option ID: SCCOPT_HTML_COND_COMMENT_MODE
Some HTML input files may include "conditional comments", which are HTML comments that mark areas of HTML to be interpreted in specific versions of Internet Explorer, while being ignored by other browsers. This option allows you to control how the content contained within conditional comments will be interpreted by Outside In's HTML parsing code.
Data
-
NONE: Don't output any conditional comment
-
IE5: Include the IE5 comments
-
IE6: Include the IE6 comments
-
IE7: Include the IE7 comments
-
IE8: Include the IE8 comments
-
IE9: Include the IE9 comments
-
ALL: Include all conditional comments
13.2.2.9 IgnorePassword
OIT Option ID: SCCOPT_IGNORE_PASSWORD
This option can disable the password verification of files where the contents can be processed without validation of the password. If this option is not set, the filter should prompt for a password if it handles password-protected files.
Data Type
bool
13.2.2.10 IncludeCharacterAttributes
This option allows the developer to track character attributes contained in the input document and choose which are output to tags in the XML document produced.
Data Type
IncludeCharacterAttributeValues
Data
An IncludeCharacterAttributeValues object with the character attributes to be included
IncludeCharacterAttributeValues Enumeration
The following set of flags:
AllOff
RevisionDelete
Bold
Italic
Underline
DoubleUnderline
Outline
Hidden
Strikeout
SmallCaps
Allcaps
OCE
RevisionAdd
Default
AllOff
13.2.2.11 IncludeSearchMLOffset
The value of this option is a boolean that if set to true will include offset information in the SearchML output according to the schema. If the option is set to false, no offset information is produced.
Data Type
bool
Default
false
13.2.2.12 ISODateTimes
OIT Option ID: SCCOPT_FORMATFLAGS
When this flag is set, all Date and Time values are converted to the ISO 8601 standard. This conversion can only be performed using dates that are stored as numeric data within the original file.
Data
bool
Default
false
13.2.2.13 LotusNotesDirectory
OIT Option ID: SCCOPT_LOTUSNOTESDIRECTORY
This option allows the developer to specify the location of a Lotus Notes or Domino installation for use by the NSF filter. A valid Lotus installation directory must contain the file nnotes.dll.
Data
A path to the Lotus Notes directory.
Default
If this option isn't set, then OIT will first attempt to load the Lotus library according to the operating system's PATH environment variable, and then attempt to find and load the Lotus library as indicated in HKEY_CLASSES_ROOT\Notes.Link.
13.2.2.14 NullReplacementCharacter
This option specifies a two-byte Unicode character that will be used to replace null characters if null path separators are being used. This option defaults to '/' and is valid for the SearchML 3.x, SearchHTML and SearchText output formats.
Data Type
short
Default
47
13.2.2.15 PageMLFlags
This option allows the developer to set flags that enable options unique to the PageML schema.
Data Type
PageMLFlagValues
PageMLFlagValues Enumeration
The following set of flags:
-
NoXMLDeclaration: Do not generate xml declaration
-
IncludeText: Include text in PageML output
Default
0 - all flags off
13.2.2.16 ParseXMPMetadata
OIT Option ID: SCCOPT_PARSEXMPMETADATA
Adobe's Extensible Metadata Platform (XMP) is a labeling technology that allows you to embed data about a file, known as metadata, into the file itself. This option enables parsing of the XMP data into normal OIT document properties. Enabling this option may cause the loss of some regular data in premium graphics filters (such as Postscript), but won't affect most formats (such as PDF).
Data Type
bool
Data
-
true: This setting enables parsing XMP.
-
false: This setting disables parsing XMP.
Default
false
13.2.2.17 PDFInputMaxEmbeddedObjects
This option allows the user to limit the number of embedded objects that are produced in a PDF file.
Data Type
UInt32
Data
The maximum number of embedded objects to produce in PDF output. Setting this to 0 would produce an all embedded objects in the input document.
Default
0 – produce all objects.
13.2.2.18 PDFInputMaxVectorPaths
This option allows the user to limit the number of vector paths that are produced in a PDF file.
Data
The maximum number of paths to produce in PDF output. Setting this to 0 would produce an all vector objects in the input document.
Default
0 – produce all vector objects.
13.2.2.19 PDFReorderBiDi
OIT Option ID: SCCOPT_PDF_FILTER_REORDER_BIDI
This option controls whether or not the PDF filter will attempt to reorder bidirectional text runs so that the output is in standard logical order as used by the Unicode 2.0 and later specification. This additional processing will result in slower filter performance according to the amount of bidirectional data in the file.
PDFReorderBiDiValue Enumeration
This enumeration defines the type of Bidirection text reordering the PDF filter should perform.
-
StandardBiDi: Do not attempt to reorder bidirectional text runs.
-
ReorderedBiDi: Attempt to reorder bidirectional text runs.
13.2.2.20 PDFWordSpacingFactor
This option controls the spacing threshold in PDF input documents. Most PDF documents do not have an explicit character denoting a word break. The PDF filter calculates the distance between two characters to determine if they are part of the same word or if there should be a word break inserted. The space between characters is compared to the length of the space character in the current font multiplied by this fraction. If the space between characters is larger, then a word break character is inserted into the text stream. Otherwise, the characters are considered to be part of the same word and no word break is inserted.
Data Type
float
Data
A value representing the percentage of the space character used to trigger a word break. Valid values are positive values less than 2.
Default
0.85
13.2.2.21 PerformExtendedFI
OIT Option ID: SCCOPT_FIFLAGS
This option affects how an input file's internal format (application type) is identified when the file is first opened by the Outside In technology. When the extended test flag is in effect, and an input file is identified as being either 7-bit ASCII, EBCDIC, or Unicode, the file's contents will be interpreted as such by the export process.
The extended test is optional because it requires extra processing and cannot guarantee complete accuracy (which would require the inspection of every single byte in a file to eliminate false positives.)
Data Type
bool
Data
One of the following values:
-
false: When this is set, standard file identification behavior occurs.
-
true: If set, the File Identification code will run an extended test on all files that are not identified.
Default
true
13.2.2.22 PrinterName
This option is Windows-specific. It is used to set which device context to use to render the pages.
It specifies, as a byte string, the name of the printer whose metrics should be used to calculate pagination information. If unspecified, the default printer will be used. The screen metrics of the system will be used if a printer is not specified and a default printer does not exist. As pagination is affected by the metrics of the device context and installed fonts, PageML XML output can vary between different systems and configurations.
Data Type
string
Default
None - PageML uses the Windows default printer
13.2.2.23 ProcessOLEEmbeddingMode
OIT Option ID: SCCOPT_PROCESS_OLE_EMBEDDINGS
Microsoft Powerpoint versions from 1997 through 2003 had the capability to embed OLE documents in the Powerpoint files. This option controls which embeddings are to be processed as native (OLE) documents and which are processed using the alternate graphic.
Note:
The Microsoft Powerpoint application sometimes does embed known Microsoft OLE embeddings (such as Visio, Project) as an "Unknown" type. To process these embeddings, the ProcessOLEEmbedAll option is required. Post Office-2003 products such as Office 2007 embeddings also fall into this category.
Data
-
Standard: Process embeddings that are known standard embeddings. These include Office 2003 versions of Word, Excel, Visio, etc.
-
All: Process all embeddings in the file.
-
None: Process none of the embeddings in the file.
Default
Standard
13.2.2.24 RenderEmbeddedFonts
This option allows you to disable the use of embedded fonts in PDF input files. If the option is set to true, the embedded fonts in the PDF input are used to render text; if the option is set to false, the embedded fonts are not used and the fallback is to use fonts available to Outside In to render text.
Data Type
bool
Default
true
13.2.2.25 SearchMLFlags
This option allows the developer to set flags that enable options unique to the following SearchML formats: SearchML 3.x, SearchHTML and SearchText.
Data Type
SearchMLFlagValues
SearchMLFlagValues Enumeration
The following set of flags:
-
ShowParagraphStyleNames: Add paragraph style name reference to p tags
-
ProcessEmbeddings: Process embeddings
-
NoXMLDeclaration: Don't generate xml declaration
-
SuppressProperties: Suppress processing of document properties in all indexing related products.
-
GenerateText: Produce generated text in all indexing related products.
-
SuppressAttachments: Suppress processing of attachments.
-
SuppressArchiveSubDocs: Suppress processing of sub-documents in archives
-
MetadataOnly: Produce only metadata.
-
Annotations: Annotation text should be noted as such
-
ProduceURLs: Produce URLs for hyperlinks
-
ProduceObjectInfo: Produce information allowing for reference of sub-document objects.
-
EnableErrorInfo: Output sub-document error information.
-
ProduceCellInfo: Output spreadsheet row and column information.
-
GenerateSystemMetadata: Generate system metadata
-
SkipStyles: Skip style information for performance reasons. This option overrides other style related directives.
-
ProduceHiddenCells: Produce hidden cell attribute
-
AllOff
Default
AllOff
13.2.2.26 SearchMLParaAttributes
This option allows the developer to track paragraph attributes contained in the input document and, optionally, include them in the XML output. This option only affects SearchML output. The option is not valid for the SearchHTML, SearchText and PageML output flavors.
Data Type
SearchMLParaAttributeValues
Enumeration
SearchMLParaAttributeValues can be one or more of the following enumerations ORed together:
- ParagraphSpacing: Track paragraph spacing
- CharacterHeight: Track Character Height
- LeftIndent: Track left indent (in twips)
- RightIndent: Track right indent (in twips)
- FirstIndent: Track first line indent (in twips)
- AllOff: All tracking turned off
Default
AllOff
13.2.2.27 ShowArchiveFullPath
OIT Option ID: SCCOPT_ARCFULLPATH
This option causes the full path of a node to be returned in "GetArchiveNodeInfo" and "GetObjectInfo".
Data Type
bool
Data
-
true: Provide the full path.
-
false: Do not provide the path.
Default
false
13.2.2.28 StrictFile
When an embedded file or URL can't be opened with the full path, OutsideIn will sometimes try and open the referenced file from other locations, including the current directory. When this option is set, it will prevent OutsideIn from trying to open the file from any location other than the fully qualified path or URL.
Data Type
bool
Default
false
13.2.2.29 TimeZoneOffset
OIT Option ID: SCCOPT_TIMEZONE
This option allows the user to define an offset to GMT that will be applied during date formatting, allowing date values to be displayed in a selectable time zone. This option affects the formatting of numbers that have been defined as date values. This option will not affect dates that are stored as text. To query the operating system for the time zone set on the machine, specify TimeZoneOffset_UseNative.
Note:
Daylight savings is not supported. The sent time in msg files when viewed in Outlook can be an hour different from the time sent when an image of the msg file is created.
Data Type
Int32
Data
Integer parameter from -96 to 96, representing 15-minute offsets from GMT. To query the operating system for the time zone set on the machine, specify SCC_TIMEZONE_USENATIVE.
Default
-
0: GMT time
13.2.2.30 UnmappableCharacter
OIT Option ID: SCCOPT_UNMAPPABLECHAR
This option selects the character used when a character cannot be found in the output character set. This option takes the Unicode value for the replacement character. It is left to the user to make sure that the selected replacement character is available in the output character set.
Data Type
UShort
Data
The Unicode value for the character to use.
Default
-
0x002a = "*"
13.2.2.31 UnmappedText
This option allows for the production of unmapped text (the original code points from the input document). A new <unmapped> element will be produced to enclose this text. The <unmapped> element will contain base64-encoded text. It will also contain two attributes. "OCE" will contain a hex value representing the character set. "font" will contain a string value of the original font name. This is necessary for non-standard encodings such as wingdings or webdings. This option is only valid in the SearchML 3.2 (and higher) schema.
Data Type
UnmappedTextValue
UnmappedTextValue Enumeration
The following set of values:
-
OnlyUnmapped: Output just the unmapped text
-
NoUnmappedText: No unmapped text is output
-
Both: Both original and unmapped text are output
Default
NoUnmappedText
13.2.2.32 XMLDefinitionReference
This option determines whether the converted file will reference a specified schema, DTD, or no reference when generating output.
Data Type
XMLReference
Data
A XMLReference object that defines the XML Definition Reference to be used.
Default
No reference defined
13.2.2.33 DSTTimezone
This option uses the time zone of the system (computer) and calculates the time based on the system time offset.
Data Type
Boolean
Default
False
13.2.2.34 GenerateExcelRevisions
This option controls the extraction of tracked changes from Excel files.
Data Type
Boolean
Default
False
13.2.2.35 EnableAlphaBlending
This option allows the user to enable alpha-channel blending (transparency) in rendering vector images. This is primarily useful for improving fidelity when vector images are rendered with a slower graphics engine such as X-Windows, over a network where performance is not an issue.
Data
Boolean
Default
False
13.2.2.36 InternalRendering
Note:
This option is no longer relevant. Outside In no longer performs graphic rendering through X11 on Linux/Unix platforms.The internal rendering engine is available on all of these platforms. If this option is set, the results will always use the internal rendering engine regardless of the value of this option. The $GDFONTPATH environment variable must be set to specify where to reference fonts. On Windows systems, the Windows graphical rendering engine is always used.
13.3 ExportStatus Class
The ExportStatus class provides access to information about a conversion. This information may include information about sub-document failures, areas of a conversion that may not have high fidelity with the original document. When applicable the number of pages in the output is also provided.
Namespace
OutsideIn
Properties
-
PageCount (Int32) - A count of all of the output pages produced during an export operation.
-
StatusFlags (ExportStatusFlags) - Gets the information about possible fidelity issues with the original document.
-
SubDocsFailed (Int32) - Number of subdocuments that were not converted.
-
SubDocsPassed (Int32) - Number of subdocuments that were successfully converted.
ExportStatusFlags Enumeration
This enumeration is the set of possible known problems that can occur during an export process.
-
NoInformationAvailable: No Information is available
-
MissingMap: A PDF text run was missing the toUnicode table
-
VerticalText: A vertical text run was present
-
TextEffects: A run that had unsupported text effects applied. One example is Word Art
-
UnsupportedCompression: A graphic had an unsupported compression
-
UnsupportedColorSpace: A graphic had an unsupported color space
-
Forms: A sub documents had forms
-
RightToLeftTables: A table had right to left columns
-
Equations: A file had equations
-
AliasedFont: The desired font was missing, but a font alias was used
-
MissingFont: The desired font wasn't present on the system
-
SubDocFailed: a sub-document was not converted
-
TypeThreeFont: A type 3 font was encountered.
-
UnsupportedShading: An unsupported shading pattern was encountered.
-
InvalidHTML: An HTML parse error, as defined by the W3C, was encountered.
13.4 FileFormat Class
This class defines the identifiers for file formats.
Namespace
OutsideIn
Methods
-
getDescription
String getDescription()
This method returns the description of the format.
-
getId
int getId()
This method returns the numeric identifier of the format.
-
forId
FileFormat forId(int id)
This method returns the FileFormat object for the given identifier.
id : The numeric identifier for which the corresponding FileFormat object is returned.
13.5 ObjectInfo Class
ObjectInfo provides all the information available about an Outside In Object (object may be an embedded object, a linked object, or a compressed file). This is a read only class where the technology fills in all the values.
Namespace
OutsideIn.Options
Properties
-
Compression (Int32) The type of compression used to store the object, if known.
-
Flags (ObjectInfoFlags) Flags indicating attributes of the object.
-
FormatId (FileFormat) The format Identifier of the object.
-
Name (String) Name of the object.
ObjectInfoFlags Enumeration
Bit fields to describe information about an object.
-
PartialFile: Object would not normally exist outside the source document
-
ProtectedFile: Object is encrypted or password protected
-
UnsupportedCompression: Object uses an unsupported compression mechanism
-
DRMFile: Object uses Digital Rights Management protection
-
UnidentifiedFile: Object is extracted, but can not successfully identified
-
LinkToFile: Object links to file, it can not be extracted
-
EncryptedFile: Object is encrypted and can be decrypted with the known password
13.6 Option Interface
The Option Interface provides the methods and properties to retrieve information about an Outside In Option.
Namespace
Outside In
Properties
-
Name — Name of the option
-
Description — Description of the option
-
DataType — The type of the option value
-
SupportingProducts — The list of products that support this option
Methods
void Set(OptionsCache exporter, Object objValue);
This method sets the option to the exporter object.
-
exporter — The exporter object
-
objValue — Value of the option
Note:
If the type of objValue cannot be converted to the data type the option is expecting, an OutsideInCastException is thrown.
void Get(OptionsCache exporter)
This method gets the currently set value for the option.
-
exporter — The exporter object who’s option value is requested.
OutsideInProducts Enumeration
-
HTMLExport — Outside In HTML Export
-
ImageExport — Outside In Image Export
-
PDFExport — Outside In PDF Export
-
SearchExport — Outside In Search Export
-
WebViewExport — Outside In Web View Export
-
XMLExport — Outside In XML Export
-
AllExports — All Outside In export products
13.7 OutsideIn Class
This is a utility class that creates an instance of an Exporter object on request.
Namespace
OutsideIn
Methods
static Exporter NewLocalExporter()
This method creates an instance of an Exporter object. It returns a newly created Exporter object.
static Exporter NewLocalExporter(Exporter source)
This method creates and returns an instance of an Exporter object based on the source Exporter. All the options of source are copied to the new Exporter. The source and destination file information will not be copied.
OutsideInVersion GetCoreVersion()
This static method returns an OutsideInVersion object with information of the Outside In Core Technology used.
13.8 OutsideInConfig Class
The OutsideInConfig Class is used to describe the Outside In Configuration Options.
Namespace
OutsideIn
Constructors
OutsideInConfig()
Creates a OutsideInConfig instance with default values.
OutsideInConfig(DirectoryInfo InstallLocation, UInt32 IdleWorkerTimeout, UInt32 MinimumWorkerCount)
Creates a OutsideInConfig instance with specified values.
Properties
DirectoryInfo InstallLocation: The Location of the technology directory.
UInt32 IdleWorkerTimeout: value indicating the number of milliseconds that an idle process in excess of the minimum worker count is kept alive before being terminated. This timeout only applies to worker processes created beyond the number of MinimumWorkerCount processes.
UInt32 MinimumWorkerCount: Specifies the minimum number of running worker processes kept available for export operations. If there is a higher number of exporter objects performing simultaneous export operations, additional worker processes will be created. Those additional worker processes will be terminated according to the IdleWorkerTimeout setting. If any of these processes are terminated due to errors, they will be replaced by a new process to maintain this minimum count of loaded worker processes.
13.9 OutsideInVersion Class
The OutsideIn Class is used to describe the version of the Outside In Core Module.
Namespace
OutsideIn
Methods
String GetVersion()
This method returns the version information as a string in the format of “MajorVersion.MinorVersion.DotVersion”.
Properties
-
int MajorVersion: The major version component
-
int MinorVersion: The minor version component
-
int DotVersion: The dot version component
13.10 OutsideInException Class
This is the exception that is thrown when an Outside In Technology error occurs.
This class derives from the Exception class. This class has no public methods or properties except those of the parent Exception class.
Namespace
OutsideIn
13.11 XMLReference Class
The XMLReference class is a data class used to define the XML definition reference to be used.
Namespace
OutsideIn.Options
Properties
-
ReferenceMethod (ReferenceMethodValue): The type of reference.
-
DefinitionReference (String): The DTD or Schema referenced.
Constructors
XMLReference()
Create an instance of a XMLReference object using No XML definition reference
XMLReference(XMLReference.ReferenceMethodValue, String)
Create an instance of a XMLReference object to provide a DTD/XSD
ReferenceMethodValue Enumeration
This enumeration is used to set whether Export will reference a schema, a DTD, or no reference when generating output.
-
DTD: Document Type Definition (DTD)
-
XSD: Extensible Schema Definition
-
NONE: No definition reference