de.aitools.ie.keyphraseextraction
Class FrequentPhraseExtractor

java.lang.Object
  extended by de.aitools.ie.keyphraseextraction.KeyphraseExtractor
      extended by de.aitools.ie.keyphraseextraction.FrequentPhraseExtractor

public class FrequentPhraseExtractor
extends KeyphraseExtractor

A simple KeyphraseExtractor to extract frequent phrases. Phrases that contain at least one stopword will be discarded.

Version:
$Id: FrequentPhraseExtractor.java,v 1.1 2011/05/10 19:12:56 trenkman Exp $
Author:
martin.trenkmann@uni-weimar.de

Constructor Summary
FrequentPhraseExtractor(java.util.Locale locale)
           
 
Method Summary
 java.util.SortedSet<Phrase> extract(java.lang.String text)
          Extracts phrases of size one, also known as unigrams.
 java.util.SortedSet<Phrase> extract(java.lang.String text, int k)
          Extracts phrases of size one, also known as unigrams.
 java.util.SortedSet<Phrase> extract(java.lang.String text, int k, boolean normalize)
          Extracts phrases of size one, also known as unigrams.
 
Methods inherited from class de.aitools.ie.keyphraseextraction.KeyphraseExtractor
extract, getLocale, getTopPhrases, setLocale
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

FrequentPhraseExtractor

public FrequentPhraseExtractor(java.util.Locale locale)
Method Detail

extract

public java.util.SortedSet<Phrase> extract(java.lang.String text)
Extracts phrases of size one, also known as unigrams.

Overrides:
extract in class KeyphraseExtractor
Parameters:
text - the text to extract key phrases from.
Returns:
a ranked set of the highest scored key phrases.
See Also:
KeyphraseExtractor.extract(String, int, int, boolean)

extract

public java.util.SortedSet<Phrase> extract(java.lang.String text,
                                           int k)
Extracts phrases of size one, also known as unigrams.

Overrides:
extract in class KeyphraseExtractor
Parameters:
text - the text to extract key phrases from.
k - the maximal number of key phrases to extract.
Returns:
a ranked set of the highest scored key phrases.
See Also:
KeyphraseExtractor.extract(String, int, int, boolean)

extract

public java.util.SortedSet<Phrase> extract(java.lang.String text,
                                           int k,
                                           boolean normalize)
Extracts phrases of size one, also known as unigrams.

Overrides:
extract in class KeyphraseExtractor
Parameters:
text - the text to extract key phrases from.
k - the maximal number of key phrases to extract.
normalize - enables/disables score normalization.
Returns:
a ranked set of the highest scored key phrases.
See Also:
KeyphraseExtractor.extract(String, int, int, boolean)