|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectde.aitools.aq.textextraction.TextExtractor
public abstract class TextExtractor
An abstract class to extract plain text from arbitrary documents.
Derived classes have to implement extract(File, File)
.
Constructor Summary | |
---|---|
TextExtractor()
|
Method Summary | |
---|---|
java.io.File |
extract(java.io.File inputFile)
Extracts plain text from the given inputFile. |
abstract void |
extract(java.io.File inputFile,
java.io.File outputFile)
Extracts plain text from the given inputFile. |
abstract java.util.Set<org.apache.tika.mime.MediaType> |
getSupportedMediaTypes()
Returns a the MIME type supported by this converter. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public TextExtractor()
Method Detail |
---|
public abstract java.util.Set<org.apache.tika.mime.MediaType> getSupportedMediaTypes()
public abstract void extract(java.io.File inputFile, java.io.File outputFile) throws TextExtractorException
inputFile
- The input file to extract plain text fromoutputFile
- The path of the plain text file to be created
TextExtractorException
public java.io.File extract(java.io.File inputFile) throws TextExtractorException
inputFile
- The input file to extract plain text from
TextExtractorException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |