|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use IDocumentParser | |
---|---|
com.norconex.importer.parser | |
com.norconex.importer.parser.impl |
Uses of IDocumentParser in com.norconex.importer.parser |
---|
Methods in com.norconex.importer.parser that return IDocumentParser | |
---|---|
protected IDocumentParser |
DefaultDocumentParserFactory.getFallbackParser()
|
IDocumentParser |
IDocumentParserFactory.getParser(String documentReference,
ContentType contentType)
Gets a document parser, optionally based on its reference or content type. |
IDocumentParser |
DefaultDocumentParserFactory.getParser(String documentReference,
ContentType contentType)
Gets a parser based on content type, regardless of document reference (ignoring it). |
Methods in com.norconex.importer.parser with parameters of type IDocumentParser | |
---|---|
protected void |
DefaultDocumentParserFactory.registerFallbackParser(IDocumentParser parser)
|
protected void |
DefaultDocumentParserFactory.registerNamedParser(ContentType contentType,
IDocumentParser parser)
|
Uses of IDocumentParser in com.norconex.importer.parser.impl |
---|
Classes in com.norconex.importer.parser.impl that implement IDocumentParser | |
---|---|
class |
AbstractTikaParser
Base class wrapping Apache Tika parser for use by the importer. |
class |
FallbackParser
Parser using auto-detection of document content-type to figure out which specific parser to invoke to best parse a document. |
class |
HTMLParser
HTML parser based on Apache Tika HtmlParser . |
class |
PDFParser
HTML parser based on Apache Tika org.apache.tika.parser.pdf.PDFParser . |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |