org.apache.nutch.net
Class URLScopeFilter

java.lang.Object
  extended by org.apache.nutch.net.URLScopeFilter

public class URLScopeFilter
extends Object

Crawl scoping class A user can filter the urls by defining crawlscope.mode. This filter is applied before the reg-ex filter

Author:
aliu

Nested Class Summary
static class URLScopeFilter.Mode
           
 
Constructor Summary
URLScopeFilter(Configuration conf)
           
 
Method Summary
 String filter(String urlString, URL seed, URL seedRedirect)
           
static URLScopeFilter.Mode getMode(String str)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

URLScopeFilter

public URLScopeFilter(Configuration conf)
Method Detail

getMode

public static URLScopeFilter.Mode getMode(String str)

filter

public String filter(String urlString,
                     URL seed,
                     URL seedRedirect)


Copyright © 2007, 2012, Oracle and/or its affiliates. All rights reserved.