public class DocVectorModel extends AbstractVectorModel<Integer>
| Constructor and Description |
|---|
DocVectorModel(WordVectorModel wordVectorModel) |
DocVectorModel(WordVectorModel wordVectorModel,
Segment segment,
boolean filter) |
| Modifier and Type | Method and Description |
|---|---|
Vector |
addDocument(int id,
String content)
添加文档
|
int |
dimension()
模型中的词向量维度
|
void |
enableFilter(boolean filter)
激活/关闭停用词过滤器
|
Segment |
getSegment() |
boolean |
isFilterEnabled()
是否激活了停用词过滤器
|
List<Map.Entry<Integer,Float>> |
nearest(String query)
查询最相似的前10个文档
|
List<Map.Entry<Integer,Float>> |
nearest(String query,
int n)
查询最相似的前n个文档
|
Vector |
query(String content)
将一个文档转为向量
|
void |
setSegment(Segment segment) |
float |
similarity(String what,
String with)
文档相似度计算
|
nearest, nearest, nearest, nearest, remove, similarity, size, vectorpublic DocVectorModel(WordVectorModel wordVectorModel)
public DocVectorModel(WordVectorModel wordVectorModel, Segment segment, boolean filter)
public Vector addDocument(int id, String content)
id - 文档idcontent - 文档内容public List<Map.Entry<Integer,Float>> nearest(String query)
query - 查询语句(或者说一个文档的内容)public List<Map.Entry<Integer,Float>> nearest(String query, int n)
query - 查询语句(或者说一个文档的内容)public Vector query(String content)
query in class AbstractVectorModel<Integer>content - 文档public int dimension()
AbstractVectorModeldimension in class AbstractVectorModel<Integer>public float similarity(String what, String with)
what - with - public Segment getSegment()
public void setSegment(Segment segment)
public boolean isFilterEnabled()
public void enableFilter(boolean filter)
filter - Copyright © 2014–2021 码农场. All rights reserved.