public abstract class KeywordExtractor extends Object
| Modifier and Type | Field and Description |
|---|---|
protected Segment |
defaultSegment
默认分词器
|
| Constructor and Description |
|---|
KeywordExtractor() |
KeywordExtractor(Segment defaultSegment) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
filter(List<Term> termList) |
abstract List<String> |
getKeywords(List<Term> termList,
int size) |
List<String> |
getKeywords(String document)
提取关键词(top 10)
|
List<String> |
getKeywords(String document,
int size)
提取关键词
|
Segment |
getSegment() |
KeywordExtractor |
setSegment(Segment segment)
设置关键词提取器使用的分词器
|
protected boolean |
shouldInclude(Term term)
是否应当将这个term纳入计算,词性属于名词、动词、副词、形容词
|
protected Segment defaultSegment
public KeywordExtractor(Segment defaultSegment)
public KeywordExtractor()
protected boolean shouldInclude(Term term)
term - public KeywordExtractor setSegment(Segment segment)
segment - 任何开启了词性标注的分词器public Segment getSegment()
public List<String> getKeywords(String document, int size)
document - 关键词size - 需要几个关键词public List<String> getKeywords(String document)
document - 文章Copyright © 2014–2021 码农场. All rights reserved.