Testing a Content Crawler
Before you have a content crawler import content into the
public folders of your portal, test it by running a job that crawls
document records into a temporary folder.
Create a test folder and remove the Everyone group, and any
other public groups, from the Security page
on the folder to ensure that users cannot access the test content.
- Make sure the content crawler creates the correct links.
Examine the target folder and ensure the content crawler has
generated records and links for desired content and has not created
unwanted records and links.
If you iterate this testing step after modifying the content
crawler configuration, make sure you delete the contents of the test
folder and clear the deletion history for the content crawler.
- Make sure the content crawler creates correct metadata.
Make sure that all documents are given the right content types,
and that these content types correctly map properties to source document
attributes.
Go to the Knowledge Directory, and look at the properties and
content types of a few of the documents this content crawler imported
to see if they are the properties and content types you expected.
To view the properties and content type for a document:
- Click Directory and navigate to the folder
that contains the document whose properties and content type you want
to view.
- Click Properties under the document to
display the information about the document. The properties are displayed
in a table along with their values. The content type is displayed
at the bottom of the page.
If you iterate this testing step after modifying the content
crawler configuration, make sure you configure the content crawler
to refresh these links.
- Test properties, filters, and search.
To test that document properties have been configured to enable
filters and search, browse to the test folder, and perform a search
using the same expression used by the filter you are testing. Either
cut and paste the text from the filter into the portal search box
or use the Advanced Search tool to enter expressions involving properties.
Select Search Only in this Folder. The links
that are returned by your search are for the documents that will pass
your filter.