public abstract class CharacterBasedGenerativeModelSegment extends Segment
| 构造器和说明 |
|---|
CharacterBasedGenerativeModelSegment() |
| 限定符和类型 | 方法和说明 |
|---|---|
protected static List<Term> |
convert(List<Vertex> vertexList,
boolean offsetEnabled)
将一条路径转为最终结果
|
static CoreDictionary.Attribute |
guessAttribute(Term term)
查询或猜测一个词语的属性,
先查词典,然后对字母、数字串的属性进行判断,最后猜测未登录词
|
protected abstract List<Term> |
roughSegSentence(char[] sentence)
单纯的分词模型实现该方法,仅输出词
|
protected List<Term> |
segSentence(char[] sentence)
给一个句子分词
|
protected List<Vertex> |
toVertexList(List<Term> wordList,
boolean appendStart)
将中间结果转换为词网顶点,
这样就可以利用基于Vertex开发的功能, 如词性标注、NER等
|
atomSegment, combineByCustomDictionary, combineByCustomDictionary, enableAllNamedEntityRecognize, enableCustomDictionary, enableCustomDictionaryForcing, enableIndexMode, enableIndexMode, enableJapaneseNameRecognize, enableMultithreading, enableMultithreading, enableNameRecognize, enableNumberQuantifierRecognize, enableOffset, enableOrganizationRecognize, enablePartOfSpeechTagging, enablePlaceRecognize, enableTranslatedNameRecognize, mergeNumberQuantifier, quickAtomSegment, seg, seg, seg2sentence, simpleAtomSegmentpublic static CoreDictionary.Attribute guessAttribute(Term term)
term - protected List<Term> segSentence(char[] sentence)
SegmentsegSentence 在类中 Segmentsentence - 待分词句子protected abstract List<Term> roughSegSentence(char[] sentence)
sentence - protected List<Vertex> toVertexList(List<Term> wordList, boolean appendStart)
wordList - appendStart - Copyright © 2014–2017 码农场. All rights reserved.