|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use CrawlDatum | |
---|---|
org.apache.nutch.crawl | Crawl control code. |
org.apache.nutch.fetcher | The Nutch robot. |
org.apache.nutch.protocol |
Uses of CrawlDatum in org.apache.nutch.crawl |
---|
Methods in org.apache.nutch.crawl that return CrawlDatum | |
---|---|
static CrawlDatum |
CrawlDatum.read(DataInput in)
|
Methods in org.apache.nutch.crawl with parameters of type CrawlDatum | |
---|---|
static boolean |
CrawlDatum.hasDbStatus(CrawlDatum datum)
|
static boolean |
CrawlDatum.hasFetchStatus(CrawlDatum datum)
|
void |
CrawlDatum.set(CrawlDatum that)
Copy the contents of another instance into this instance. |
Uses of CrawlDatum in org.apache.nutch.fetcher |
---|
Methods in org.apache.nutch.fetcher that return CrawlDatum | |
---|---|
CrawlDatum |
FetcherOutput.getCrawlDatum()
|
Constructors in org.apache.nutch.fetcher with parameters of type CrawlDatum | |
---|---|
FetcherOutput(CrawlDatum crawlDatum,
Content content,
ParseImpl parse)
|
Uses of CrawlDatum in org.apache.nutch.protocol |
---|
Methods in org.apache.nutch.protocol with parameters of type CrawlDatum | |
---|---|
ProtocolOutput |
Protocol.getProtocolOutput(Text url,
CrawlDatum datum)
Returns the Content for a fetchlist entry. |
RobotRules |
Protocol.getRobotRules(Text url,
CrawlDatum datum)
Retrieve robot rules applicable for this url. |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |