public abstract class CharacterBasedSegment extends Segment
config, customDictionary| Constructor and Description |
|---|
CharacterBasedSegment() |
| Modifier and Type | Method and Description |
|---|---|
static CoreDictionary.Attribute |
guessAttribute(Term term)
查询或猜测一个词语的属性,
先查词典,然后对字母、数字串的属性进行判断,最后猜测未登录词
|
protected abstract List<Term> |
roughSegSentence(char[] sentence)
单纯的分词模型实现该方法,仅输出词
|
protected List<Term> |
segSentence(char[] sentence)
以下方法用于纯分词模型
分词、词性标注联合模型则直接重载segSentence
|
protected List<Vertex> |
toVertexList(List<Term> wordList,
boolean appendStart)
将中间结果转换为词网顶点,
这样就可以利用基于Vertex开发的功能, 如词性标注、NER等
|
atomSegment, combineByCustomDictionary, combineByCustomDictionary, combineByCustomDictionary, combineByCustomDictionary, convert, enableAllNamedEntityRecognize, enableCustomDictionary, enableCustomDictionary, enableCustomDictionaryForcing, enableIndexMode, enableIndexMode, enableJapaneseNameRecognize, enableMultithreading, enableMultithreading, enableNameRecognize, enableNumberQuantifierRecognize, enableOffset, enableOrganizationRecognize, enablePartOfSpeechTagging, enablePlaceRecognize, enableTranslatedNameRecognize, mergeNumberQuantifier, quickAtomSegment, seg, seg, seg2sentence, seg2sentence, simpleAtomSegmentpublic static CoreDictionary.Attribute guessAttribute(Term term)
term - protected List<Term> segSentence(char[] sentence)
segSentence in class Segmentsentence - 待分词句子protected abstract List<Term> roughSegSentence(char[] sentence)
sentence - Copyright © 2014–2021 码农场. All rights reserved.