| Package | Description |
|---|---|
| com.hankcs.hanlp.corpus.dictionary | |
| com.hankcs.hanlp.corpus.document | |
| com.hankcs.hanlp.corpus.document.sentence | |
| com.hankcs.hanlp.dictionary.other | |
| com.hankcs.hanlp.model.crf | |
| com.hankcs.hanlp.model.hmm | |
| com.hankcs.hanlp.model.perceptron |
感知机在线学习算法的线性序列标注模型。基于这套框架实现了一整套分词、词性标注和命名实体识别功能。
理论参考邓知龙 《基于感知器算法的高效中文分词与词性标注系统设计与实现》,
简介:http://www.hankcs.com/nlp/segment/implementation-of-word-segmentation-device-java-based-on-structured-average-perceptron.html
|
| com.hankcs.hanlp.model.perceptron.instance | |
| com.hankcs.hanlp.model.perceptron.utility | |
| com.hankcs.hanlp.tokenizer |
一些常用的配置好的静态分词器,其中一些具备特殊的预处理功能
可供用户参考以实现自己的预处理逻辑. |
| com.hankcs.hanlp.tokenizer.lexical | |
| com.hankcs.hanlp.tokenizer.pipe | |
| com.hankcs.hanlp.utility |
| Modifier and Type | Method and Description |
|---|---|
void |
CommonDictionaryMaker.learn(Sentence... sentences)
同compute
|
| Modifier and Type | Method and Description |
|---|---|
void |
CommonDictionaryMaker.learn(List<Sentence> sentenceList)
同compute
|
| Modifier and Type | Field and Description |
|---|---|
List<Sentence> |
Document.sentenceList |
| Constructor and Description |
|---|
Document(List<Sentence> sentenceList) |
| Modifier and Type | Method and Description |
|---|---|
static Sentence |
Sentence.create(String param)
以人民日报2014语料格式的字符串创建一个结构化句子
|
Sentence |
Sentence.mergeCompoundWords() |
Sentence |
Sentence.translateCompoundWordLabels()
按照 PartOfSpeechTagDictionary 指定的映射表将复合词词语词性翻译过去
|
Sentence |
Sentence.translateLabels()
按照 PartOfSpeechTagDictionary 指定的映射表将词语词性翻译过去
|
| Modifier and Type | Method and Description |
|---|---|
static void |
CharTable.normalize(Sentence sentence) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
CRFNERecognizer.convertCorpus(Sentence sentence,
BufferedWriter bw) |
protected void |
CRFSegmenter.convertCorpus(Sentence sentence,
BufferedWriter bw) |
protected abstract void |
CRFTagger.convertCorpus(Sentence sentence,
BufferedWriter bw) |
protected void |
CRFPOSTagger.convertCorpus(Sentence sentence,
BufferedWriter bw) |
| Modifier and Type | Method and Description |
|---|---|
protected List<String[]> |
HMMSegmenter.convertToSequence(Sentence sentence) |
protected List<String[]> |
HMMNERecognizer.convertToSequence(Sentence sentence) |
protected abstract List<String[]> |
HMMTrainer.convertToSequence(Sentence sentence) |
protected List<String[]> |
HMMPOSTagger.convertToSequence(Sentence sentence) |
| Modifier and Type | Method and Description |
|---|---|
protected Instance |
PerceptronSegmenter.createInstance(Sentence sentence,
FeatureMap featureMap) |
protected Instance |
NERTrainer.createInstance(Sentence sentence,
FeatureMap featureMap) |
protected abstract Instance |
InstanceConsumer.createInstance(Sentence sentence,
FeatureMap featureMap) |
protected Instance |
CWSTrainer.createInstance(Sentence sentence,
FeatureMap mutableFeatureMap) |
protected Instance |
PerceptronPOSTagger.createInstance(Sentence sentence,
FeatureMap featureMap) |
protected Instance |
PerceptronNERecognizer.createInstance(Sentence sentence,
FeatureMap featureMap) |
protected Instance |
POSTrainer.createInstance(Sentence sentence,
FeatureMap featureMap) |
boolean |
PerceptronTagger.learn(Sentence sentence)
在线学习
|
boolean |
PerceptronLexicalAnalyzer.learn(Sentence sentence)
在线学习
|
| Modifier and Type | Method and Description |
|---|---|
static CWSInstance |
CWSInstance.create(Sentence sentence,
FeatureMap featureMap) |
static POSInstance |
POSInstance.create(Sentence sentence,
FeatureMap featureMap) |
boolean |
InstanceHandler.process(Sentence instance) |
| Constructor and Description |
|---|
NERInstance(Sentence sentence,
FeatureMap featureMap) |
| Modifier and Type | Method and Description |
|---|---|
static List<String[]> |
Utility.convertSentenceToNER(Sentence sentence,
NERTagSet tagSet)
将句子转换为 (单词,词性,NER标签)三元组
|
static void |
Utility.normalize(Sentence sentence) |
| Modifier and Type | Method and Description |
|---|---|
static Sentence |
NLPTokenizer.analyze(String sentence)
词法分析
|
| Modifier and Type | Method and Description |
|---|---|
Sentence |
LexicalAnalyzer.analyze(String sentence)
对句子进行词法分析
|
Sentence |
AbstractLexicalAnalyzer.analyze(String sentence) |
| Modifier and Type | Method and Description |
|---|---|
Sentence |
LexicalAnalyzerPipeline.analyze(String sentence) |
| Modifier and Type | Method and Description |
|---|---|
static String |
TextUtility.combine(Sentence sentence) |
Copyright © 2014–2021 码农场. All rights reserved.