|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use CrawlDatum | |
|---|---|
| org.apache.nutch.crawl | Crawl control code. |
| org.apache.nutch.fetcher | The Nutch robot. |
| org.apache.nutch.protocol | |
| Uses of CrawlDatum in org.apache.nutch.crawl |
|---|
| Methods in org.apache.nutch.crawl that return CrawlDatum | |
|---|---|
static CrawlDatum |
CrawlDatum.read(DataInput in)
|
| Methods in org.apache.nutch.crawl with parameters of type CrawlDatum | |
|---|---|
static boolean |
CrawlDatum.hasDbStatus(CrawlDatum datum)
|
static boolean |
CrawlDatum.hasFetchStatus(CrawlDatum datum)
|
void |
CrawlDatum.set(CrawlDatum that)
Copy the contents of another instance into this instance. |
| Uses of CrawlDatum in org.apache.nutch.fetcher |
|---|
| Methods in org.apache.nutch.fetcher that return CrawlDatum | |
|---|---|
CrawlDatum |
FetcherOutput.getCrawlDatum()
|
| Constructors in org.apache.nutch.fetcher with parameters of type CrawlDatum | |
|---|---|
FetcherOutput(CrawlDatum crawlDatum,
Content content,
ParseImpl parse)
|
|
| Uses of CrawlDatum in org.apache.nutch.protocol |
|---|
| Methods in org.apache.nutch.protocol with parameters of type CrawlDatum | |
|---|---|
ProtocolOutput |
Protocol.getProtocolOutput(Text url,
CrawlDatum datum)
Returns the Content for a fetchlist entry. |
RobotRules |
Protocol.getRobotRules(Text url,
CrawlDatum datum)
Retrieve robot rules applicable for this url. |
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||