public class Document extends Object implements Serializable
| Modifier and Type | Field and Description |
|---|---|
List<Sentence> |
sentenceList |
| Modifier and Type | Method and Description |
|---|---|
static Document |
create(File file) |
static Document |
create(String param) |
List<List<IWord>> |
getComplexSentenceList()
获取复杂句子列表,句子中的每个单词有可能是复合词,有可能是简单词
|
List<List<Word>> |
getSimpleSentenceList()
获取简单的句子列表,其中复合词会被拆分为简单词
|
List<List<Word>> |
getSimpleSentenceList(boolean spilt)
获取简单的句子列表
|
List<List<Word>> |
getSimpleSentenceList(Set<String> labelSet)
获取简单的句子列表,其中复合词的标签如果是set中指定的话会被拆分为简单词
|
List<Word> |
getSimpleWordList() |
List<IWord> |
getWordList()
获取单词序列
|
String |
toString() |
public List<List<Word>> getSimpleSentenceList()
public List<List<IWord>> getComplexSentenceList()
public List<List<Word>> getSimpleSentenceList(boolean spilt)
spilt - 如果为真,其中复合词会被拆分为简单词public List<List<Word>> getSimpleSentenceList(Set<String> labelSet)
labelSet - Copyright © 2014–2021 码农场. All rights reserved.