|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use Parse | |
---|---|
com.endeca.itl.web.process | |
org.apache.nutch.crawl | Crawl control code. |
org.apache.nutch.parse |
Uses of Parse in com.endeca.itl.web.process |
---|
Methods in com.endeca.itl.web.process with parameters of type Parse | |
---|---|
List<com.endeca.itl.record.Record> |
EndecaRecordGenerator.generate(Content content,
Parse parse)
|
Uses of Parse in org.apache.nutch.crawl |
---|
Methods in org.apache.nutch.crawl with parameters of type Parse | |
---|---|
byte[] |
TextProfileSignature.calculate(Content content,
Parse parse)
|
abstract byte[] |
Signature.calculate(Content content,
Parse parse)
|
byte[] |
MD5Signature.calculate(Content content,
Parse parse)
|
Uses of Parse in org.apache.nutch.parse |
---|
Classes in org.apache.nutch.parse that implement Parse | |
---|---|
class |
ParseImpl
The result of parsing a page's raw content. |
Methods in org.apache.nutch.parse that return Parse | |
---|---|
Parse |
ParseFilters.filter(Content content,
Parse parse)
Run all defined filters. |
Parse |
ParseFilter.filter(Content content,
Parse parse)
Adds metadata or modifies parse |
Parse |
HtmlParseFilters.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Run all defined filters. |
Parse |
HtmlParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page. |
Parse |
ParseStatus.getEmptyParse(Configuration conf)
A convenience method. |
Parse |
Parser.getParse(Content c)
Creates the parse for some content. |
Parse |
ParseUtil.parse(Content content)
Performs a parse by iterating through a List of preferred Parser s
until a successful parse is performed and a Parse object is
returned. |
Parse |
ParseUtil.parseByExtensionId(String extId,
Content content)
Method parses a Content object using the Parser specified
by the parameter extId , i.e., the Parser's extension ID. |
Methods in org.apache.nutch.parse with parameters of type Parse | |
---|---|
Parse |
ParseFilters.filter(Content content,
Parse parse)
Run all defined filters. |
Parse |
ParseFilter.filter(Content content,
Parse parse)
Adds metadata or modifies parse |
Parse |
HtmlParseFilters.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Run all defined filters. |
Parse |
HtmlParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page. |
Constructors in org.apache.nutch.parse with parameters of type Parse | |
---|---|
ParseImpl(Parse parse)
|
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |