Package org.apache.nutch.net

Interface Summary
URLFilter Interface used to limit which URLs enter Nutch.
URLNormalizer Interface used to convert URLs to normal form and optionally perform substitutions
 

Class Summary
URLFilterChecker Checks one given filter or all filters.
URLFilters Creates and caches URLFilter implementing plugins.
URLNormalizers This class uses a "chained filter" pattern to run defined normalizers.
URLScopeFilter Crawl scoping class A user can filter the urls by defining crawlscope.mode.
 

Enum Summary
URLScopeFilter.Mode  
 

Exception Summary
URLFilterException  
 



Copyright © 2007, 2012, Oracle and/or its affiliates. All rights reserved.