org.apache.nutch.net
Class URLScopeFilter
java.lang.Object
org.apache.nutch.net.URLScopeFilter
public class URLScopeFilter
- extends Object
Crawl scoping class
A user can filter the urls by defining crawlscope.mode. This filter is applied before the reg-ex filter
- Author:
- aliu
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
URLScopeFilter
public URLScopeFilter(Configuration conf)
getMode
public static URLScopeFilter.Mode getMode(String str)
filter
public String filter(String urlString,
URL seed,
URL seedRedirect)
Copyright © 2007, 2012, Oracle and/or its affiliates. All rights reserved.