de.aitools.ie.languagedetection.builder
Class LanguageModelBuilder

java.lang.Object
  extended by de.aitools.ie.languagedetection.builder.LanguageModelBuilder

public class LanguageModelBuilder
extends java.lang.Object

Training of the language models for the language detection framework.

Author:
fabian.loose@uni-weimar.de, martin.potthast@uni-weimar.de

Constructor Summary
LanguageModelBuilder(java.util.Locale language)
           
 
Method Summary
 void addText(java.lang.String text)
          Adds an example text which must be written this instance's language.
static void buildFromLanguageDirectories(java.io.File directory, boolean overwrite)
           
static void buildFromTextDirectory(java.io.File directory, java.util.Locale language)
          Build the language model from a folder containing text files.
static void deleteLanguageModelIndexSerialization()
          To increase the speed of the language detection the language model index in LanguageDetector is serialized.
static void main(java.lang.String[] args)
          Usage example.
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

LanguageModelBuilder

public LanguageModelBuilder(java.util.Locale language)
Method Detail

addText

public void addText(java.lang.String text)
Adds an example text which must be written this instance's language.


buildFromTextDirectory

public static void buildFromTextDirectory(java.io.File directory,
                                          java.util.Locale language)
Build the language model from a folder containing text files.


buildFromLanguageDirectories

public static void buildFromLanguageDirectories(java.io.File directory,
                                                boolean overwrite)

deleteLanguageModelIndexSerialization

public static void deleteLanguageModelIndexSerialization()
To increase the speed of the language detection the language model index in LanguageDetector is serialized. When the language models are build this serialization becomes invalid and therefore needs to be deleted. This is done by this method.


main

public static void main(java.lang.String[] args)
Usage example.