public class TfIdfCounter extends KeywordExtractor
defaultSegment| Constructor and Description |
|---|
TfIdfCounter() |
TfIdfCounter(boolean filterStopWord) |
TfIdfCounter(Segment defaultSegment) |
TfIdfCounter(Segment defaultSegment,
boolean filterStopWord) |
| Modifier and Type | Method and Description |
|---|---|
void |
add(List<Term> termList) |
void |
add(Object id,
List<Term> termList) |
void |
add(Object id,
String text)
添加文档
|
int |
add(String text)
添加文档,自动分配id
|
Map<String,Double> |
allTf() |
Map<Object,Map<String,Double>> |
compute() |
Set<Object> |
documents() |
List<String> |
getKeywords(List<Term> termList,
int size) |
List<Map.Entry<String,Double>> |
getKeywordsOf(Object id) |
List<Map.Entry<String,Double>> |
getKeywordsOf(Object id,
int size) |
List<Map.Entry<String,Double>> |
getKeywordsWithTfIdf(List<Term> termList,
int size) |
List<Map.Entry<String,Double>> |
getKeywordsWithTfIdf(String document,
int size) |
Map<Object,Map<String,Double>> |
getTfMap() |
void |
loadIdfFile(String idfPath)
加载自定义idf文件
|
List<Map.Entry<String,Double>> |
sortedAllTf() |
List<Map.Entry<String,Integer>> |
sortedAllTfInt() |
filter, getKeywords, getKeywords, getSegment, setSegment, shouldIncludepublic TfIdfCounter()
public TfIdfCounter(boolean filterStopWord)
public TfIdfCounter(Segment defaultSegment, boolean filterStopWord)
public TfIdfCounter(Segment defaultSegment)
public List<String> getKeywords(List<Term> termList, int size)
getKeywords in class KeywordExtractorpublic List<Map.Entry<String,Double>> getKeywordsWithTfIdf(String document, int size)
public List<Map.Entry<String,Double>> getKeywordsWithTfIdf(List<Term> termList, int size)
public int add(String text)
text - public void loadIdfFile(String idfPath)
idfPath - Copyright © 2014–2021 码农场. All rights reserved.