Skip Headers
Oracle® Content Database Administrator's Guide for Oracle WebCenter Suite
10g (10.1.3.2)

Part Number B32191-01
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Feedback page
Contact Us

Go to previous page
Previous
Go to next page
Next
View PDF

10 Managing Oracle Content DB Formats

Oracle Content DB associates a format (also known as a MIME type) with each document. You can add, modify, and delete formats using the Application Server Control.

This chapter provides information about the following topics:

About Formats

The format of a document indicates the file type (for example, .doc or .zip). Oracle Content DB needs to know the format of documents to determine how to index their content.

A format contains the following information:

Indexing a format type is the basis of content searching in Oracle Content DB. If a format is not indexed, content searches will fail. Content searches can also fail when formats are indexed incorrectly.

See Appendix B, "Oracle Text Supported Document Formats" in Oracle Text Reference for information about which formats can be indexed by Oracle Text.

Adding Formats

You can add more formats to Oracle Content DB for special types of content. See "Default Formats" for a list of default formats.

To add a format:

  1. Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.

  2. On the Content DB Home page, click the Administration tab.

  3. In the Formats table row, click the Go to Task icon.

  4. On the Formats page, click New Format. The New Format page appears.

    Figure 10-1 shows the New Format page.

    Figure 10-1 New Format Page

    Description of Figure 10-1 follows
    Description of "Figure 10-1 New Format Page"

  5. Enter the following information:

    • Name: Provide a name for the format (for example, FrameMaker or Jar).

    • MIME Type: Specify the type of content stored in Oracle Content DB, such as text/plain or text/html. Click the Flashlight icon to select from a list of MIME types.

    • Extension: Specify the default extension for files that use this format, such as .fm or .jar. Click the Flashlight icon to select from a list of file extensions.

      The names of files uploaded from UNIX or Linux clients are case-sensitive. If the case of the extension for your format (for example, .ZIP) does not match the case of the extension for the file uploaded from UNIX or Linux (for example, .zip), the uploaded file will be classified as the Unknown format, and the content will not be indexed. Files that are not indexed do not show up in content search results.

    • Binary: Specify whether files that use this format are of binary type.

    • Omitted From Antivirus Scan: Specify whether files that use this format need to be omitted from antivirus scans.

    • Indexed: Specify whether files that use this format need to be indexed.

  6. Click OK.

Modifying Formats

You can modify formats using the Application Server Control. The Unknown format is a required system format and cannot be modified.

To modify a format:

  1. Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.

  2. On the Content DB Home page, click the Administration tab.

  3. In the Formats table row, click the Go to Task icon.

  4. On the Formats page, click the name of the format you want to modify.

  5. On the Edit Format page, you can change the following information:

    • MIME Type: Specify the type of content stored in Oracle Content DB, such as text/plain or text/html. Click the Flashlight icon to select from a list of MIME types.

    • Extension: Specify the default extension for files that use this format, such as .fm or .jar. Click the Flashlight icon to select from a list of file extensions.

      The names of files uploaded from UNIX or Linux clients are case-sensitive. If the case of the extension for your format (for example, .ZIP) does not match the case of the extension for the file uploaded from UNIX or Linux (for example, .zip), the uploaded file will be classified as the Unknown format, and the content will not be indexed. Files that are not indexed do not show up in content search results.

    • Binary: Specify whether files that use this format are of binary type.

    • Omitted From Antivirus Scan: Specify whether files that use this format need to be omitted from antivirus scans.

    • Indexed: Specify whether files that use this format need to be indexed. Changing this setting only affects new documents that are uploaded to Oracle Content DB; the index setting for existing documents that use this format will not be changed. To force indexing of existing documents, upload the documents again after changing this setting.

  6. Click OK.

Some formats must be indexed. For these formats, the index setting cannot be changed.

Deleting Formats

You can delete formats using the Application Server Control. The Unknown format is a required system format and cannot be deleted.

To delete a format:

  1. Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.

  2. On the Content DB Home page, click the Administration tab.

  3. In the Formats table row, click the Go to Task icon.

  4. On the Formats page, select the format you want to delete.

  5. Click Delete.

  6. On the Warning page, click Yes.

Default Formats

Table 10-1 provides a list of default formats.

Table 10-1 Default System Formats

Format Name Extension Indexed by Default? Can Change Index Setting?Foot 1 

Advanced Stream Redirector File

asx

No

Yes

Advanced Streaming Format

asf

No

Yes

Apple Quicktime

mov

Yes

No

Apple Quicktime (qt)

qt

Yes

No

Audio Interchange File (aif)

aif

Yes

No

Audio Interchange File (aifc)

aifc

Yes

No

Audio Interchange File (aiff)

aiff

Yes

No

Basic audio

au

Yes

No

Bitmap image

bmp

Yes

No

c file

c

Yes

Yes

C Header

h

Yes

Yes

C++ Header (h++)

h++

Yes

Yes

C++ Header (hh)

hh

Yes

Yes

C++ Header (hpp)

hpp

Yes

Yes

C++ Header (hxx)

hxx

Yes

Yes

C++ Source Code (C++)

c++

Yes

Yes

C++ Source Code (cc)

cc

Yes

Yes

C++ Source Code (cpp)

cpp

Yes

Yes

CC++ Source Code (cxx)

cxx

Yes

Yes

Comma-Separated Values

csv

Yes

Yes

Compiled WML Document

wmlc

No

Yes

Compiled WML Script

wmlsc

No

Yes

Compressed File

taz

No

Yes

Corel Photo-Paint Image

cpt

No

Yes

Corel Vector Graphic Drawing

cdr

No

Yes

Corel Vector Pattern

pat

No

Yes

CorelDraw Template

cdt

No

Yes

Debian Linux Package

deb

No

Yes

Difference File

diff

Yes

Yes

Email Message

eml

Yes

No

Encapsulated PostScript

eps

Yes

Yes

Extensible HyperText Markup Language File

xhtml

Yes

Yes

Extensible Markup Language

xml

Yes

Yes

FileMaker Pro Spreadsheet

fm

Yes

Yes

FrameMaker Book

book

Yes

Yes

FrameMaker FBDOC

fbdoc

Yes

Yes

FrameMaker FRAME

frame

Yes

Yes

FrameMaker FRM

frm

Yes

Yes

FrameMaker MAKER

maker

Yes

Yes

GIF

gif

Yes

No

GNU tar Compressed File Archive (GNU Tape Archive)

gtar

No

Yes

GZIP

gz

No

Yes

HTML

htm

Yes

Yes

HTML unix

html

Yes

No

Hypertext Cascading Style Sheet

css

Yes

Yes

JAR

jar

No

Yes

Java Bytecode

class

No

Yes

java file

java

Yes

Yes

Java Serialized Object File

ser

No

Yes

JavaScript Source Code

js

Yes

Yes

JNLP

jnlp

No

Yes

JPEG

jpg

Yes

No

JPEG (jpe)

jpe

Yes

No

JPEG (jpeg)

jpeg

Yes

No

JSP

jsp

Yes

Yes

Lotus 123 Spreadsheet

wk

Yes

Yes

Macintosh Sound Resource

snd

No

Yes

Macromedia Director Movie

dir

No

Yes

Macromedia Director Protected Movie File

dxr

No

Yes

Macromedia Flash Format File

swf

No

Yes

Macromedia Flash Format File - swfl

swfl

No

Yes

MHTML Document mhtm

mht

Yes

Yes

MHTML Document mhtml

mhtml

Yes

Yes

Microsoft AVI

avi

Yes

No

Microsoft PowerPoint

ppt

Yes

Yes

Microsoft Powerpoint (pot)

pot

Yes

Yes

Microsoft Powerpoint Show

pps

Yes

Yes

Microsoft Wave Audio

wav

Yes

No

MIDI

mid

No

Yes

Money Data File

mny

No

Yes

MP3 Playlist File

m3u

No

Yes

MPEG

mpg

No

Yes

MPEG (mpe)

mpe

No

Yes

MPEG (mpeg)

mpeg

No

Yes

MPEG - mpega

mpega

Yes

No

MPEG Layer 2

mp2

Yes

No

MPEG Layer 3 Audio

mp3

Yes

No

MPEG Layer 3 Audio Stream

mpga

Yes

No

MS Access

mdb

Yes

Yes

MS DOS Batch Processing

bat

Yes

Yes

MS Excel

xls

Yes

Yes

MS Excel (xlb)

xlb

Yes

Yes

MS Executable File

exe

No

Yes

MS Windows Dynamic Link Library

dll

No

Yes

MS Word

doc

Yes

Yes

MS Word (dot)

dot

Yes

Yes

MS Works

msw

Yes

Yes

Object File

o

No

Yes

OpenOffice.org Drawing

sda

No

Yes

OpenOffice.org Presentation

sdd

Yes

Yes

Outlook Express News File

nws

No

Yes

PCX

pcx

No

Yes

PDF

pdf

Yes

Yes

PERL Program File

pl

Yes

Yes

Portable (Public) Network Graphic

png

No

Yes

portable pixmap

ppm

No

Yes

Postscript

ps

No

Yes

postscript-ai

ai

No

Yes

Project File

mpp

Yes

Yes

Real Audio (ra)

ra

Yes

No

Real Audio (ram)

ram

Yes

Yes

Real Media (rm)

rm

Yes

No

Real Video

rv

Yes

No

RedHat Package Manager

rpm

No

Yes

RichText

rtf

Yes

Yes

RichText (rtx)

rtx

Yes

Yes

Schedule/Schedule+ Data

scd

No

Yes

SGI Video

movie

No

Yes

Shell Script

sh

Yes

Yes

Shockwave Movie

dcr

No

Yes

Sourcecode

src

Yes

Yes

Standard General Markup Language

sgml

Yes

Yes

Tab Separated Values File

tsv

Yes

Yes

Tar

tar

No

Yes

Tcl (Tool Command Language) Language Script

tcl

Yes

Yes

Text

txt

Yes

Yes

Text Document (text)

text

Yes

Yes

TIFF

tif

Yes

No

TIFF (tiff)

tiff

Yes

No

Tk Language Script

tk

Yes

Yes

UNIX Compressed Archive File

z

No

Yes

UNIX csh Shell Script

csh

Yes

Yes

UNIX Tar File Gzipped

tgz

No

Yes

Unknown

(N/A)

No

No

Unknown Binary

bin

No

Yes

URL Reference

url

No

Yes

vCalendar File

vcs

No

Yes

vCard File

vcf

Yes

Yes

Visio Drawing

vsd

Yes

Yes

VRML

vrml

No

Yes

Windows Help File

hlp

No

Yes

Windows Icon

ico

No

Yes

Wireless Markup Language File

wml

Yes

Yes

WML Script

wmls

Yes

Yes

Word Perfect

wpd

Yes

Yes

Wordperfect 5.1 Document

wp5

Yes

Yes

XFIG Graphic File

fig

No

Yes

xpixmap

xpm

No

Yes

xpixmap pm

pm

No

Yes

Zip

zip

No

Yes


Footnote 1 Some formats must be indexed. For these formats, the index setting cannot be changed.