de.aitools.ie.keyphraseextraction
Class FrequentPhraseExtractor
java.lang.Object
de.aitools.ie.keyphraseextraction.KeyphraseExtractor
de.aitools.ie.keyphraseextraction.FrequentPhraseExtractor
public class FrequentPhraseExtractor
- extends KeyphraseExtractor
A simple KeyphraseExtractor
to extract frequent phrases.
Phrases that contain at least one stopword will be discarded.
- Version:
- $Id: FrequentPhraseExtractor.java,v 1.1 2011/05/10 19:12:56 trenkman Exp $
- Author:
- martin.trenkmann@uni-weimar.de
Method Summary |
java.util.SortedSet<Phrase> |
extract(java.lang.String text)
Extracts phrases of size one, also known as unigrams. |
java.util.SortedSet<Phrase> |
extract(java.lang.String text,
int k)
Extracts phrases of size one, also known as unigrams. |
java.util.SortedSet<Phrase> |
extract(java.lang.String text,
int k,
boolean normalize)
Extracts phrases of size one, also known as unigrams. |
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
FrequentPhraseExtractor
public FrequentPhraseExtractor(java.util.Locale locale)
extract
public java.util.SortedSet<Phrase> extract(java.lang.String text)
- Extracts phrases of size one, also known as unigrams.
- Overrides:
extract
in class KeyphraseExtractor
- Parameters:
text
- the text to extract key phrases from.
- Returns:
- a ranked set of the highest scored key phrases.
- See Also:
KeyphraseExtractor.extract(String, int, int, boolean)
extract
public java.util.SortedSet<Phrase> extract(java.lang.String text,
int k)
- Extracts phrases of size one, also known as unigrams.
- Overrides:
extract
in class KeyphraseExtractor
- Parameters:
text
- the text to extract key phrases from.k
- the maximal number of key phrases to extract.
- Returns:
- a ranked set of the highest scored key phrases.
- See Also:
KeyphraseExtractor.extract(String, int, int, boolean)
extract
public java.util.SortedSet<Phrase> extract(java.lang.String text,
int k,
boolean normalize)
- Extracts phrases of size one, also known as unigrams.
- Overrides:
extract
in class KeyphraseExtractor
- Parameters:
text
- the text to extract key phrases from.k
- the maximal number of key phrases to extract.normalize
- enables/disables score normalization.
- Returns:
- a ranked set of the highest scored key phrases.
- See Also:
KeyphraseExtractor.extract(String, int, int, boolean)