de.aitools.aq.textextraction
Class PsConverter

java.lang.Object
  extended by de.aitools.aq.textextraction.TextExtractor
      extended by de.aitools.aq.textextraction.PsConverter

public class PsConverter
extends TextExtractor

An implementation of TextExtractor to convert PS files to plain text.

On Linux there are several ways or programs to convert PS files to plain text. The following is a list of these methods and some notes about whether their usage is recommended or not and why.

TODO PDF to PS conversion: pdf2ps or pdftops > take ps2pdf and why

Version:
$Id: PsConverter.java,v 1.1 2011/04/16 02:18:05 trenkman Exp $
Author:
martin.trenkmann@uni-weimar.de

Constructor Summary
PsConverter()
           
 
Method Summary
 void extract(java.io.File from, java.io.File to)
          Extracts plain text from the given inputFile.
 java.util.Set<org.apache.tika.mime.MediaType> getSupportedMediaTypes()
          Returns a the MIME type supported by this converter.
 
Methods inherited from class de.aitools.aq.textextraction.TextExtractor
extract
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PsConverter

public PsConverter()
Method Detail

getSupportedMediaTypes

public java.util.Set<org.apache.tika.mime.MediaType> getSupportedMediaTypes()
Description copied from class: TextExtractor
Returns a the MIME type supported by this converter.

Specified by:
getSupportedMediaTypes in class TextExtractor
Returns:
The supported MIME type.

extract

public void extract(java.io.File from,
                    java.io.File to)
             throws TextExtractorException
Description copied from class: TextExtractor
Extracts plain text from the given inputFile. The retrieved content will be stored in outputFile.

Specified by:
extract in class TextExtractor
Parameters:
from - The input file to extract plain text from
to - The path of the plain text file to be created
Throws:
TextExtractorException