| Interface | Description |
|---|---|
| URLFilter |
Interface used to limit which URLs enter Nutch.
|
| URLNormalizer |
Interface used to convert URLs to normal form and optionally perform substitutions
|
| Class | Description |
|---|---|
| URLFilterChecker |
Checks one given filter or all filters.
|
| URLFilters |
Creates and caches
URLFilter implementing plugins. |
| URLNormalizers |
This class uses a "chained filter" pattern to run defined normalizers.
|
| URLScopeFilter |
Crawl scoping class
A user can filter the urls by defining crawlscope.mode.
|
| Enum | Description |
|---|---|
| URLScopeFilter.Mode |
| Exception | Description |
|---|---|
| URLFilterException |
Copyright © 2007, 2014, Oracle and/or its affiliates. All rights reserved.