Oracle® Content Database Administrator's Guide for Oracle WebCenter Suite 10g (10.1.3.2) Part Number B32191-01 |
|
|
View PDF |
Oracle Content DB associates a format (also known as a MIME type) with each document. You can add, modify, and delete formats using the Application Server Control.
This chapter provides information about the following topics:
The format of a document indicates the file type (for example, .doc or .zip). Oracle Content DB needs to know the format of documents to determine how to index their content.
A format contains the following information:
MIME type: Specifies the type of content stored in Oracle Content DB, such as text/plain
or text/html
.
Extension type: Specifies the default extension for files that use this format, such as .fm
or .jar
.
Binary setting: Determines whether files that use this format are of binary type.
Index setting: Determines whether files that use this format need to be indexed.
Omitted From Antivirus Scan: Determines whether files that use this format need to be omitted from antivirus scans.
Indexing a format type is the basis of content searching in Oracle Content DB. If a format is not indexed, content searches will fail. Content searches can also fail when formats are indexed incorrectly.
See Appendix B, "Oracle Text Supported Document Formats" in Oracle Text Reference for information about which formats can be indexed by Oracle Text.
You can add more formats to Oracle Content DB for special types of content. See "Default Formats" for a list of default formats.
To add a format:
Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.
On the Content DB Home page, click the Administration tab.
In the Formats table row, click the Go to Task icon.
On the Formats page, click New Format. The New Format page appears.
Figure 10-1 shows the New Format page.
Enter the following information:
Name: Provide a name for the format (for example, FrameMaker or Jar).
MIME Type: Specify the type of content stored in Oracle Content DB, such as text/plain
or text/html
. Click the Flashlight icon to select from a list of MIME types.
Extension: Specify the default extension for files that use this format, such as .fm
or .jar
. Click the Flashlight icon to select from a list of file extensions.
The names of files uploaded from UNIX or Linux clients are case-sensitive. If the case of the extension for your format (for example, .ZIP
) does not match the case of the extension for the file uploaded from UNIX or Linux (for example, .zip
), the uploaded file will be classified as the Unknown format, and the content will not be indexed. Files that are not indexed do not show up in content search results.
Binary: Specify whether files that use this format are of binary type.
Omitted From Antivirus Scan: Specify whether files that use this format need to be omitted from antivirus scans.
Indexed: Specify whether files that use this format need to be indexed.
Click OK.
You can modify formats using the Application Server Control. The Unknown format is a required system format and cannot be modified.
To modify a format:
Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.
On the Content DB Home page, click the Administration tab.
In the Formats table row, click the Go to Task icon.
On the Formats page, click the name of the format you want to modify.
On the Edit Format page, you can change the following information:
MIME Type: Specify the type of content stored in Oracle Content DB, such as text/plain
or text/html
. Click the Flashlight icon to select from a list of MIME types.
Extension: Specify the default extension for files that use this format, such as .fm
or .jar
. Click the Flashlight icon to select from a list of file extensions.
The names of files uploaded from UNIX or Linux clients are case-sensitive. If the case of the extension for your format (for example, .ZIP
) does not match the case of the extension for the file uploaded from UNIX or Linux (for example, .zip
), the uploaded file will be classified as the Unknown format, and the content will not be indexed. Files that are not indexed do not show up in content search results.
Binary: Specify whether files that use this format are of binary type.
Omitted From Antivirus Scan: Specify whether files that use this format need to be omitted from antivirus scans.
Indexed: Specify whether files that use this format need to be indexed. Changing this setting only affects new documents that are uploaded to Oracle Content DB; the index setting for existing documents that use this format will not be changed. To force indexing of existing documents, upload the documents again after changing this setting.
Click OK.
Some formats must be indexed. For these formats, the index setting cannot be changed.
You can delete formats using the Application Server Control. The Unknown format is a required system format and cannot be deleted.
To delete a format:
Connect to the Application Server Control and go to the Content DB Home page. See "Accessing the Oracle Content DB Home Page" for information about how to do this.
On the Content DB Home page, click the Administration tab.
In the Formats table row, click the Go to Task icon.
On the Formats page, select the format you want to delete.
Click Delete.
On the Warning page, click Yes.
Table 10-1 provides a list of default formats.
Table 10-1 Default System Formats
Format Name | Extension | Indexed by Default? | Can Change Index Setting?Foot 1 |
---|---|---|---|
Advanced Stream Redirector File |
asx |
No |
Yes |
Advanced Streaming Format |
asf |
No |
Yes |
Apple Quicktime |
mov |
Yes |
No |
Apple Quicktime (qt) |
qt |
Yes |
No |
Audio Interchange File (aif) |
aif |
Yes |
No |
Audio Interchange File (aifc) |
aifc |
Yes |
No |
Audio Interchange File (aiff) |
aiff |
Yes |
No |
Basic audio |
au |
Yes |
No |
Bitmap image |
bmp |
Yes |
No |
c file |
c |
Yes |
Yes |
C Header |
h |
Yes |
Yes |
C++ Header (h++) |
h++ |
Yes |
Yes |
C++ Header (hh) |
hh |
Yes |
Yes |
C++ Header (hpp) |
hpp |
Yes |
Yes |
C++ Header (hxx) |
hxx |
Yes |
Yes |
C++ Source Code (C++) |
c++ |
Yes |
Yes |
C++ Source Code (cc) |
cc |
Yes |
Yes |
C++ Source Code (cpp) |
cpp |
Yes |
Yes |
CC++ Source Code (cxx) |
cxx |
Yes |
Yes |
Comma-Separated Values |
csv |
Yes |
Yes |
Compiled WML Document |
wmlc |
No |
Yes |
Compiled WML Script |
wmlsc |
No |
Yes |
Compressed File |
taz |
No |
Yes |
Corel Photo-Paint Image |
cpt |
No |
Yes |
Corel Vector Graphic Drawing |
cdr |
No |
Yes |
Corel Vector Pattern |
pat |
No |
Yes |
CorelDraw Template |
cdt |
No |
Yes |
Debian Linux Package |
deb |
No |
Yes |
Difference File |
diff |
Yes |
Yes |
Email Message |
eml |
Yes |
No |
Encapsulated PostScript |
eps |
Yes |
Yes |
Extensible HyperText Markup Language File |
xhtml |
Yes |
Yes |
Extensible Markup Language |
xml |
Yes |
Yes |
FileMaker Pro Spreadsheet |
fm |
Yes |
Yes |
FrameMaker Book |
book |
Yes |
Yes |
FrameMaker FBDOC |
fbdoc |
Yes |
Yes |
FrameMaker FRAME |
frame |
Yes |
Yes |
FrameMaker FRM |
frm |
Yes |
Yes |
FrameMaker MAKER |
maker |
Yes |
Yes |
GIF |
gif |
Yes |
No |
GNU tar Compressed File Archive (GNU Tape Archive) |
gtar |
No |
Yes |
GZIP |
gz |
No |
Yes |
HTML |
htm |
Yes |
Yes |
HTML unix |
html |
Yes |
No |
Hypertext Cascading Style Sheet |
css |
Yes |
Yes |
JAR |
jar |
No |
Yes |
Java Bytecode |
class |
No |
Yes |
java file |
java |
Yes |
Yes |
Java Serialized Object File |
ser |
No |
Yes |
JavaScript Source Code |
js |
Yes |
Yes |
JNLP |
jnlp |
No |
Yes |
JPEG |
jpg |
Yes |
No |
JPEG (jpe) |
jpe |
Yes |
No |
JPEG (jpeg) |
jpeg |
Yes |
No |
JSP |
jsp |
Yes |
Yes |
Lotus 123 Spreadsheet |
wk |
Yes |
Yes |
Macintosh Sound Resource |
snd |
No |
Yes |
Macromedia Director Movie |
dir |
No |
Yes |
Macromedia Director Protected Movie File |
dxr |
No |
Yes |
Macromedia Flash Format File |
swf |
No |
Yes |
Macromedia Flash Format File - swfl |
swfl |
No |
Yes |
MHTML Document mhtm |
mht |
Yes |
Yes |
MHTML Document mhtml |
mhtml |
Yes |
Yes |
Microsoft AVI |
avi |
Yes |
No |
Microsoft PowerPoint |
ppt |
Yes |
Yes |
Microsoft Powerpoint (pot) |
pot |
Yes |
Yes |
Microsoft Powerpoint Show |
pps |
Yes |
Yes |
Microsoft Wave Audio |
wav |
Yes |
No |
MIDI |
mid |
No |
Yes |
Money Data File |
mny |
No |
Yes |
MP3 Playlist File |
m3u |
No |
Yes |
MPEG |
mpg |
No |
Yes |
MPEG (mpe) |
mpe |
No |
Yes |
MPEG (mpeg) |
mpeg |
No |
Yes |
MPEG - mpega |
mpega |
Yes |
No |
MPEG Layer 2 |
mp2 |
Yes |
No |
MPEG Layer 3 Audio |
mp3 |
Yes |
No |
MPEG Layer 3 Audio Stream |
mpga |
Yes |
No |
MS Access |
mdb |
Yes |
Yes |
MS DOS Batch Processing |
bat |
Yes |
Yes |
MS Excel |
xls |
Yes |
Yes |
MS Excel (xlb) |
xlb |
Yes |
Yes |
MS Executable File |
exe |
No |
Yes |
MS Windows Dynamic Link Library |
dll |
No |
Yes |
MS Word |
doc |
Yes |
Yes |
MS Word (dot) |
dot |
Yes |
Yes |
MS Works |
msw |
Yes |
Yes |
Object File |
o |
No |
Yes |
OpenOffice.org Drawing |
sda |
No |
Yes |
OpenOffice.org Presentation |
sdd |
Yes |
Yes |
Outlook Express News File |
nws |
No |
Yes |
PCX |
pcx |
No |
Yes |
|
|
Yes |
Yes |
PERL Program File |
pl |
Yes |
Yes |
Portable (Public) Network Graphic |
png |
No |
Yes |
portable pixmap |
ppm |
No |
Yes |
Postscript |
ps |
No |
Yes |
postscript-ai |
ai |
No |
Yes |
Project File |
mpp |
Yes |
Yes |
Real Audio (ra) |
ra |
Yes |
No |
Real Audio (ram) |
ram |
Yes |
Yes |
Real Media (rm) |
rm |
Yes |
No |
Real Video |
rv |
Yes |
No |
RedHat Package Manager |
rpm |
No |
Yes |
RichText |
rtf |
Yes |
Yes |
RichText (rtx) |
rtx |
Yes |
Yes |
Schedule/Schedule+ Data |
scd |
No |
Yes |
SGI Video |
movie |
No |
Yes |
Shell Script |
sh |
Yes |
Yes |
Shockwave Movie |
dcr |
No |
Yes |
Sourcecode |
src |
Yes |
Yes |
Standard General Markup Language |
sgml |
Yes |
Yes |
Tab Separated Values File |
tsv |
Yes |
Yes |
Tar |
tar |
No |
Yes |
Tcl (Tool Command Language) Language Script |
tcl |
Yes |
Yes |
Text |
txt |
Yes |
Yes |
Text Document (text) |
text |
Yes |
Yes |
TIFF |
tif |
Yes |
No |
TIFF (tiff) |
tiff |
Yes |
No |
Tk Language Script |
tk |
Yes |
Yes |
UNIX Compressed Archive File |
z |
No |
Yes |
UNIX csh Shell Script |
csh |
Yes |
Yes |
UNIX Tar File Gzipped |
tgz |
No |
Yes |
Unknown |
(N/A) |
No |
No |
Unknown Binary |
bin |
No |
Yes |
URL Reference |
url |
No |
Yes |
vCalendar File |
vcs |
No |
Yes |
vCard File |
vcf |
Yes |
Yes |
Visio Drawing |
vsd |
Yes |
Yes |
VRML |
vrml |
No |
Yes |
Windows Help File |
hlp |
No |
Yes |
Windows Icon |
ico |
No |
Yes |
Wireless Markup Language File |
wml |
Yes |
Yes |
WML Script |
wmls |
Yes |
Yes |
Word Perfect |
wpd |
Yes |
Yes |
Wordperfect 5.1 Document |
wp5 |
Yes |
Yes |
XFIG Graphic File |
fig |
No |
Yes |
xpixmap |
xpm |
No |
Yes |
xpixmap pm |
pm |
No |
Yes |
Zip |
zip |
No |
Yes |
Footnote 1 Some formats must be indexed. For these formats, the index setting cannot be changed.