|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use Parse | |
|---|---|
| com.endeca.itl.web.process | |
| org.apache.nutch.crawl | Crawl control code. |
| org.apache.nutch.parse | |
| Uses of Parse in com.endeca.itl.web.process |
|---|
| Methods in com.endeca.itl.web.process with parameters of type Parse | |
|---|---|
List<com.endeca.itl.record.Record> |
EndecaRecordGenerator.generate(Content content,
Parse parse)
|
| Uses of Parse in org.apache.nutch.crawl |
|---|
| Methods in org.apache.nutch.crawl with parameters of type Parse | |
|---|---|
byte[] |
TextProfileSignature.calculate(Content content,
Parse parse)
|
abstract byte[] |
Signature.calculate(Content content,
Parse parse)
|
byte[] |
MD5Signature.calculate(Content content,
Parse parse)
|
| Uses of Parse in org.apache.nutch.parse |
|---|
| Classes in org.apache.nutch.parse that implement Parse | |
|---|---|
class |
ParseImpl
The result of parsing a page's raw content. |
| Methods in org.apache.nutch.parse that return Parse | |
|---|---|
Parse |
ParseFilters.filter(Content content,
Parse parse)
Run all defined filters. |
Parse |
ParseFilter.filter(Content content,
Parse parse)
Adds metadata or modifies parse |
Parse |
HtmlParseFilters.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Run all defined filters. |
Parse |
HtmlParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page. |
Parse |
ParseStatus.getEmptyParse(Configuration conf)
A convenience method. |
Parse |
Parser.getParse(Content c)
Creates the parse for some content. |
Parse |
ParseUtil.parse(Content content)
Performs a parse by iterating through a List of preferred Parsers
until a successful parse is performed and a Parse object is
returned. |
Parse |
ParseUtil.parseByExtensionId(String extId,
Content content)
Method parses a Content object using the Parser specified
by the parameter extId, i.e., the Parser's extension ID. |
| Methods in org.apache.nutch.parse with parameters of type Parse | |
|---|---|
Parse |
ParseFilters.filter(Content content,
Parse parse)
Run all defined filters. |
Parse |
ParseFilter.filter(Content content,
Parse parse)
Adds metadata or modifies parse |
Parse |
HtmlParseFilters.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Run all defined filters. |
Parse |
HtmlParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page. |
| Constructors in org.apache.nutch.parse with parameters of type Parse | |
|---|---|
ParseImpl(Parse parse)
|
|
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||