Uses of Interface
org.apache.nutch.parse.Parse

Packages that use Parse
com.endeca.itl.web.process   
org.apache.nutch.crawl Crawl control code. 
org.apache.nutch.parse   
 

Uses of Parse in com.endeca.itl.web.process
 

Methods in com.endeca.itl.web.process with parameters of type Parse
 List<com.endeca.itl.record.Record> EndecaRecordGenerator.generate(Content content, Parse parse)
           
 

Uses of Parse in org.apache.nutch.crawl
 

Methods in org.apache.nutch.crawl with parameters of type Parse
 byte[] TextProfileSignature.calculate(Content content, Parse parse)
           
abstract  byte[] Signature.calculate(Content content, Parse parse)
           
 byte[] MD5Signature.calculate(Content content, Parse parse)
           
 

Uses of Parse in org.apache.nutch.parse
 

Classes in org.apache.nutch.parse that implement Parse
 class ParseImpl
          The result of parsing a page's raw content.
 

Methods in org.apache.nutch.parse that return Parse
 Parse ParseFilters.filter(Content content, Parse parse)
          Run all defined filters.
 Parse ParseFilter.filter(Content content, Parse parse)
          Adds metadata or modifies parse
 Parse HtmlParseFilters.filter(Content content, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
          Deprecated. Run all defined filters.
 Parse HtmlParseFilter.filter(Content content, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
          Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
 Parse ParseStatus.getEmptyParse(Configuration conf)
          A convenience method.
 Parse Parser.getParse(Content c)
          Creates the parse for some content.
 Parse ParseUtil.parse(Content content)
          Performs a parse by iterating through a List of preferred Parsers until a successful parse is performed and a Parse object is returned.
 Parse ParseUtil.parseByExtensionId(String extId, Content content)
          Method parses a Content object using the Parser specified by the parameter extId, i.e., the Parser's extension ID.
 

Methods in org.apache.nutch.parse with parameters of type Parse
 Parse ParseFilters.filter(Content content, Parse parse)
          Run all defined filters.
 Parse ParseFilter.filter(Content content, Parse parse)
          Adds metadata or modifies parse
 Parse HtmlParseFilters.filter(Content content, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
          Deprecated. Run all defined filters.
 Parse HtmlParseFilter.filter(Content content, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
          Deprecated. Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
 

Constructors in org.apache.nutch.parse with parameters of type Parse
ParseImpl(Parse parse)
           
 



Copyright © 2007, 2012, Oracle and/or its affiliates. All rights reserved.