Package | Description |
---|---|
org.apache.nutch.parse |
Modifier and Type | Method and Description |
---|---|
Outlink[] |
ParseData.getOutlinks()
The outlinks of the page.
|
static Outlink[] |
OutlinkExtractor.getOutlinks(String plainText,
Configuration conf)
Extracts
Outlink from given plain text. |
static Outlink[] |
OutlinkExtractor.getOutlinks(String plainText,
String anchor,
Configuration conf)
Extracts
Outlink from given plain text and adds anchor
to the extracted Outlink s |
static Outlink |
Outlink.read(DataInput in) |
Constructor and Description |
---|
ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta) |
ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta) |
ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta,
DocumentFragment root,
HTMLMetaTags metaTags) |
Copyright © 2007, 2014, Oracle and/or its affiliates. All rights reserved.