com.norconex.importer.parser.impl
Class PDFParser

java.lang.Object
  extended by com.norconex.importer.parser.impl.AbstractTikaParser
      extended by com.norconex.importer.parser.impl.PDFParser
All Implemented Interfaces:
IDocumentParser, Serializable

public class PDFParser
extends AbstractTikaParser

HTML parser based on Apache Tika org.apache.tika.parser.pdf.PDFParser.

Author:
Pascal Essiembre
See Also:
Serialized Form

Nested Class Summary
 
Nested classes/interfaces inherited from class com.norconex.importer.parser.impl.AbstractTikaParser
AbstractTikaParser.RecursiveMetadataParser
 
Field Summary
 
Fields inherited from interface com.norconex.importer.parser.IDocumentParser
RDF_BASE_URI, RDF_SUBJECT_CONTENT
 
Constructor Summary
PDFParser(String format)
           
 
Method Summary
 
Methods inherited from class com.norconex.importer.parser.impl.AbstractTikaParser
addTikaMetadata, parseDocument
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PDFParser

public PDFParser(String format)


Copyright © 2009-2013 Norconex Inc.. All Rights Reserved.