Troubleshooting the Results of a Crawl
There are several things you can troubleshoot if your content
crawler does not import the expected content.
- Make sure your folder filters are correctly filtering content.
To learn about testing your filters, see Testing Filters.
- Make sure your content crawler did not place unwanted content
into the target folder.
If a document does not filter into any subfolders, your content
crawler might place
the document in the target folder. This is determined by a setting
on the Main Settings
page of the Folder Editor.
- Make sure the content crawler did not place content into
the Unclassified Documents
folder.
If a document cannot be placed in any target folders or subfolders,
your content
crawler might place the document in the Unclassified Documents folder.
This is determined
by a setting on the Advanced Settings page of the Content Crawler
Editor.
If you have the correct permissions, you can view the Unclassified
Documents folder
when you are editing the Knowledge Directory or by clicking Administration, then, in the Select
Utility drop-down list, select Access Unclassified
Documents.
- Make sure you have at least Edit access to the target folder.
- For web content crawlers, make sure the robot exclusion
protocols or any exclusions
or inclusions are not keeping your content crawler from importing
the expected content.
This is determined by a setting on the Web Page Exclusions page
of the Content Crawler
Editor.
- Make sure the authentication information specified in the
associated content source
allows the portal to access content.
- Review the job history for additional information.