com.norconex.importer.transformer
Interface IDocumentTransformer

All Superinterfaces:
IImportHandler, Serializable
All Known Implementing Classes:
AbstractCharStreamTransformer, AbstractRestrictiveTransformer, AbstractStringTransformer, StripAfterTransformer, StripBeforeTransformer, StripBetweenTransformer

public interface IDocumentTransformer
extends IImportHandler

Transformers allow to manipulate and convert extracted text and save the modified text back.

Author:
Pascal Essiembre

Method Summary
 void transformDocument(String reference, InputStream input, OutputStream output, Properties metadata, boolean parsed)
          Transforms document content and metadata.
 

Method Detail

transformDocument

void transformDocument(String reference,
                       InputStream input,
                       OutputStream output,
                       Properties metadata,
                       boolean parsed)
                       throws IOException
Transforms document content and metadata.

Parameters:
reference - document reference (e.g. URL)
input - document to transform
output - transformed document
metadata - document metadata
parsed - whether the document has been parsed already or not (a parsed document should normally be text-based)
Throws:
IOException


Copyright © 2009-2013 Norconex Inc.. All Rights Reserved.