Index
A B C D E F G H I J K L M O P Q R S T U W
A
- access URL, 3.1.2, 7.3.1.1, 7.3.2.2
- ACLs
-
- defined, 4.1.3.1
- policies, 4.1.3.1, 4.1.3.1, 4.2.1.4
-
- restrictions, 4.1.3.2, 4.1.3.2
- Active Directory
-
- activating the plug-in, 4.1.3.3.1
- IDM systems, 5.24.1
- administration tool, 1.2.2
- administrative user
-
- eqsys, 4.1.2, 4.1.4.1
- AJP13 protocol, 4.4.4, 4.4.4.2, 5.24.1
-
- from remote hosts, 4.1.1
- with OC4J, 4.3.1
- with Oracle HTTP Server, 4.4.4.1, 4.4.4.1, 4.4.4.1, 4.4.4.2
- alternate words, 2.2.2
- Apache Axis
-
- license, F.1
- Apache log4j
-
- license, F.1
- APIs
-
- Authorization Plug-in, 4.1.3.1, 7.1, 7.3.3.2
- Crawler Plug-in, 1.3.4, 7.1, 7.3.1
- Identity Plug-in, 7.1, 7.3.3.1
- Query-time Authorization, 7.1, 7.3.4
- URL Rewriter, 7.1, 7.3.2
- Web Services, 1.3.3, 7.2
-
- Admin Web Service, 7.2
- Query Web Service, 7.2
- Application Server Control Console
-
- overview, 6.10
- authorization
-
- ACLs, 4.2.1.1
- crawler plug-in, 4.2.1.2
- query-time filtering, 4.2.1.4
- self service, 4.2.1.5
- Authorization Plug-in API, 4.1.3.1, 7.1, 7.3.3.2
B
- boundary control of Web crawling, 3.2
- boundary rules, 2.2.3, 3.5.1
-
- defined, 3.2.2
- example using regular expression, 3.2.2.3
- exclusion rules, 3.2.2.2
- inclusion rules, 3.2.2.1
- permanent redirect, 6.5.8
- tuning, 6.5.3
- with dynamic pages, 6.5.4
- with file sources, 6.5.3.1
- with Portal sources, 6.4.4
- with symbolic links, 6.4.2.2
C
- caching documents, 3.4.1.1
- character set detection, 3.2.8
- crawler, 3.1
-
- crawler plug-ins, 3.1.3
- crawling multimedia files, 3.2.2.2.1
- crawling process, 3.3, 6.3
- depth, 3.2.3, 6.5.5
- log file, 3.5.2, 6.5.11, 7.3.2.3
-
- crawler.dat configuration file, 3.5.2
- enabling character set detection, 3.2.8
- setting default document titles, 3.2.7, 3.2.7, 3.2.7
- setting the logging level, 3.5.2.1.1
- maintenance crawls, 3.4.2
- monitoring the crawling process, 3.5
- overview, 3.1
- settings, 3.2
- URL status codes, C
- crawler configuration, 2.2.3
- Crawler Plug-in API, 1.2.3, 1.3.4, 1.3.4, 3.2.1, 4.2.1.2, 7.1, 7.3.1, 7.3.1.1
-
- APIs and classes, 7.3.1.2.4
- crawler.dat configuration file, 3.2.7, 3.2.8, 3.5.2.1
- crawling mode, 3.2.1
D
- database sources
-
- benefits over table sources, 6.4.1.1.3
- limitations, 6.4.1.1.4
- tips, 6.4.1.1.2
- debug mode, 6.9
- display URL, 3.1.2, 7.3.1.1, 7.3.2.2
- document attributes, 3.3, 6.3
- domain rules, 3.2.2
- duplicate documents, 6.5.7
-
- dupMarked, 7.2.4.3.1, 7.2.5.3.1, 7.2.5.3.3, 7.2.5.3.5
- dupRemoved, 7.2.4.3.1, 7.2.5.3.1, 7.2.5.3.3, 7.2.5.3.5
- hasDuplicate, 7.2.4.3.2
- isDuplicate, 7.2.4.3.2
- versus near duplicate documents, 6.5.7
- dynamic pages, 6.5.4
E
- eqsys
-
- administrative user, 4.1.2, 4.1.4.1
- error messages, D
F
- failed schedules, 2.2.1, 6.5.1
- federated search, 1.3.2
-
- characteristics, 6.4.6.1
- example, 5.24.2
- limitations, 6.4.6.2
- setting up, 5.24
- trusted entities, 5.24.1
- federation trusted entities, 5.24.1
- file sources
-
- crawling file URLs, 6.4.2.3
- multibyte environments, 6.4.2.1
- tips, 6.4.2.2
- URL boundary rules
-
- with file sources, 6.5.3.1
- with symbolic links, 6.4.2.2
G
- Google Desktop for Enterprise
-
- integrating with, 6.7
H
- HTML forms, 4.1.2.1
- HTTP authentication, 4.1.2.1, 4.1.4
- HTTP protocol, 3.1.2, 4.1.1, 4.4.4.1, 6.4.2.3
- HTTP proxy server, 2.1, 6.5.2
- HTTP status codes, 3.5.2.1.1, 6.5.8, 6.5.8, 6.5.8, 6.8, C
- HTTPS protocol, 3.1.2, 4.1.1, 4.4, 4.4.4, 5.24.1
- http-web-site.xml file, 4.3.1, 4.4.4.1
I
- identity management systems, 2.2.3, 4.1.1, 4.1.3.1, 4.1.4.1, 4.2.1.1, 5.1.1
- Identity Plug-in API, 7.1, 7.3.3.1
- identity plug-ins, 2.2.3, 5.1.1
-
- ACLs, 4.1.3.1
- activating, 4.1.3.3
- define, 4.1.1
- re-registering, 4.1.3.4
- restrictions, 4.1.3.5
- user authentication, 4.1.3.1
- IMAP server, 4.2.1.5
-
- mailing list sources, 6.4.3
- index
-
- documents, 3.4.1.2
- index memory size, 6.6.4
- index optimization, 6.6.2
- indexing batch size, 6.6.3
J
- Java virtual machine, 6.6.6
- JDBC, 4.1.1, 5.4
- JVM, 6.6.6
K
- keyword in context, 3.6.1
- KWIC, 3.6.1
L
- list of values (LOV), 3.3
- log files
-
- crawler log file, 6.5.11, 7.3.2.3
- OC4J log file, 6.9
M
- mailing list sources
-
- tips, 6.4.3
- metadata, 3.3, 6.3
- multimedia files
-
- crawling, 3.2.2.2.1
O
- OC4J server, 7.2.1, 7.2.3
- optimizing
-
- index, 6.6.2
- Oracle Calendar sources
-
- secure, 5.18
- Oracle Content Database sources, 1.1, 5.19
-
- tips, 1.1, 5.19.1
- Oracle Content Services, 1.1, 5.19.1
- Oracle HTTP Server
-
- channel with Oracle SES, 4.1.4.2
- communicating with, 4.4.4.1
- configuration, 4.4.4.2
- earlier than 10.1.2, 4.4.4.2
- front-ending, 4.1.4.2, 4.3, 4.4.4.1, 4.4.4.1, 4.4.4.1
- mod_oc4j, 4.3.1, 4.3.1
- restart, 4.3
- SSL certificate, 4.4.4.1
- SSL-protect, 5.24.1
- with AJP13 port, 5.24.1
- Oracle Internet Directory
-
- identity plug-in, 4.1.4.1
-
- restrictions, 4.1.3.5
- IDM systems, 5.24.1
- login attribute, 5.18.2
- overview, 4.1.4.1
- Oracle Secure Enterprise Search
-
- accessing Application Server Control Console, 6.10
- administration tool, 1.2.2, 2.2
- backup and recovery, 6.2
- components, 1.2
- crawler, 1.2.1, 3.1
- debug mode, 6.9
- error messages, D
- getting started, 2.1
- global settings, 2.2.3
- integration with Oracle Internet Directory, 4.1.4.1
- overview, 1.1
- security, 4.1
- statistics, 2.2.1
- third party licenses
-
- Apache Axis, F.1
- Apache log4j, F.1
- tuning crawl performance, 6.5
- upgrading, B
- what's new in 10.1.7, Preface
- Oracle undo space, 6.6.7
- OracleAS Portal sources, 4.1.2.1
-
- tips, 6.4.4
- user privileges, 6.4.4
- OracleAS Single Sign-On, 4.1.2.1, 4.1.4.2
P
- passwords
-
- changing, 4.1.2
- temporary, 4.1.2.1
- path rules, 3.2.2
Q
- query configuration, 2.2.3
- query-time authorization
-
- comparison with ACLs, 4.1.3.1
- configuration, 4.2.1.4
R
- relevancy boosting, 2.2.2
-
- limitations, 6.6.5.1
- result filter, 5.19.3, 7.3.3.2, 7.3.4
- ResultFilterPlugin class, 4.2.1.4.1
- ResultFilterPlugin interface
-
- API, 7.3.4
- thread-safety, 7.3.4.7
- robots META tag, 3.2.4, 6.5.6
- robots.txt file, 3.2.4, 6.5.6, 7.3.2.1
- robots.txt protocol, 3.2.4, 6.5.6
- rules
-
- domain, 3.2.2
- path, 3.2.2
S
- schedules, 2.2.1
-
- understanding, 6.5.1
- search attributes
-
- default, 3.3
- search performance, 2.2.2
- searchctl commands, 4.2.1.4.1, 6.4.2.1, 6.11
- searching
-
- advanced search, 3.6.2
- basic search, 3.6.1
- overview, 3.6
- restricting, 3.6.2.2
- source groups, 3.6.1, 3.6.3
- secure search, 1.3.1
-
- identity plug-ins, 2.2.3
- security filters, 2.2.3
- self service authorization, 4.2.1.5
- SOAP, 7.2, 7.2.2, 7.2.2.2
-
- client applications using, 7.2.3.1
- development environment, 7.2.4.2
- message body, 7.2.3
- messages, 7.2.8
- source groups, 2.2.2, 3.6.3
- source hierarchy, 3.6.3
- sources
-
- synchronizing, 3.1, 3.1.3
- types, 1.1
-
- e-mail, 1.1
- EMC Documentum Content Server, 5.5
- federated, 1.1, 1.1, 2.2.3, 5.24
- file, 1.1
- FileNet Content Engine, 5.7
- FileNet Image Services, 5.8
- Lotus Notes, 5.11
- mailing list, 1.1
- Microsoft Exchange, 5.12
- Microsoft SharePoint, 5.13
- NTFS for UNIX, 5.16
- NTFS for Windows, 5.15
- Open Text Livelink, 5.17
- Oracle Calendar, 1.1, 5.18
- Oracle Content Database, 1.1, 5.19
- Oracle E-Business Suite 11i, 5.20.1
- OracleAS Portal, 1.1
- Siebel 8, 5.23
- table, 1.1
- Web, 1.1
- user-defined, 3.1.3
- spell checking, 2.2.3
- SQL*Plus
-
- connecting using, 4.1.1
- SSL, 4.1.1, 4.4.1
-
- certificates, 4.4.1
- crawling Web site with SSL certificates, 4.4.3
- importing certificates, 4.4.3
- in Oracle SES, 4.4
- JSSE, 4.4
- keystore, 4.4.1
- statistics, 2.2.1
- submit URL, 3.6.4
- suggested content, 6.1
-
- example with Google OneBox, 6.1.1
- security options, 6.1
- suggested links, 2.2.2, 6.6.1
T
- table sources
-
- benefits over database sources, 6.4.1.1.1
- limitations, 6.4.1.1.2
- tips, 6.4.1.1.2
- temporary passwords, 4.1.2.1
- tips
-
- using database sources, 6.4.1.1.2
- using file sources, 6.4.2
- using mailing list sources, 6.4.3
- using Oracle Calendar sources, 5.18
- using Oracle Content Database sources, 1.1, 5.19.1
- using OracleAS Portal sources, 6.4.4
- using table sources, 6.4.1.1.2
- using user-defined sources, 6.4.5
- titles, changing, 3.2.7, 3.2.7, 3.2.7
- trusted entities, 5.24.1
U
- undo space, 6.6.7
- UNDO_RETENTION parameter, 6.6.7
- upgrade support, B
- URL boundary rules, 2.2.3, 3.5.1
-
- defined, 3.2.2
- permanent redirect, 6.5.8
- tuning, 6.5.3
- with dynamic pages, 6.5.4
- with Portal sources, 6.4.4
- with symbolic links, 6.4.2.2
- URL crawler status codes, C
- URL link filtering, 7.3.2.1
- URL link rewriting, 7.3.2.2
- URL looping, 6.5.9
- URL queue, 3.1.1
- URL rewriter
-
- creating, 7.3.2.3
- using, 7.3.2.3
- URL Rewriter API, 3.2.6
- URL submission, 3.6.4
- UrlRewriter, 7.3.2
- user authentication, 4.1.3.1
- user authorization, 4.1.3.1
- user-defined sources, 2.2.1
-
- tips, 6.4.5
W
- Web crawling, 7.3.2
-
- boundary control, 3.2
- Web Services API, 1.3.3, 7.1, 7.2
-
- architecture, 7.2.3
- concepts, 7.2.2
-
- SOAP, 7.2.2.2
- WSDL, 7.2.2.3
- data types, 7.2.4
- example, 7.2.7
- installation, 7.2.1
- operations, 7.2.5.1
- query syntax, 7.2.6
- URL, 7.2.1
- WSDL specification, 7.2.2.3, E