de.aitools.ir.retrievalmodels.representer
Class TFPDF
java.lang.Object
de.aitools.ir.retrievalmodels.representer.AbstractRepresenter<java.lang.String,Vector>
de.aitools.ir.retrievalmodels.representer.TFPDF
- All Implemented Interfaces:
- Representer<java.lang.String,Vector>, java.io.Serializable
public class TFPDF
- extends AbstractRepresenter<java.lang.String,Vector>
Implementation of TFPDF---a variant of TFIDF---whereby the document frequency
is weighted exponentially in order to favor terms occurring in many
documents.
References:
K.~Bun and M.~Ishizuka. Emerging topic tracking system in WWW.
In Proceedings of WI~2001, pages 125--130.
PDF.
- Version:
- $Id: TFPDF.java,v 1.1 2012/04/23 13:44:42 hoppe Exp $
- Author:
- dennis.hoppe(/\t)uni-weimar.de
- See Also:
- Serialized Form
Method Summary |
boolean |
isTrained()
|
Vector |
represent(java.lang.String text)
|
void |
train(java.lang.Iterable<java.lang.String> texts,
boolean forceTraining)
|
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TFPDF
public TFPDF(java.util.Locale locale)
TFPDF
public TFPDF(TermFrequency tf)
represent
public Vector represent(java.lang.String text)
train
public void train(java.lang.Iterable<java.lang.String> texts,
boolean forceTraining)
isTrained
public boolean isTrained()