Initializes internal crawler resources From the Data Source, the Crawler can get the supported data formats from the Data Source Provider Registry and then go to the Top Level Document Type Map and pick out the sections it needs (or is relevant) in the Crawler. This initializes the Crawler's Document Type Map (which can now be edited by the user). Also from the Data Source, the Crawler is able to figure out what its DataSourceCrawlProvider is.
IPTCrawler Interface | com.plumtree.server Namespace