| Modifier and Type | Method and Description |
|---|---|
void |
DictionaryMaker.add(IWord word)
向词典中加入一个词语
|
void |
NGramDictionaryMaker.addPair(IWord first,
IWord second) |
Item |
DictionaryMaker.get(IWord word) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
NSDictionaryMaker.addToDictionary(List<List<IWord>> sentenceList) |
protected void |
NRDictionaryMaker.addToDictionary(List<List<IWord>> sentenceList) |
protected void |
NTDictionaryMaker.addToDictionary(List<List<IWord>> sentenceList) |
protected void |
NatureDictionaryMaker.addToDictionary(List<List<IWord>> sentenceList) |
protected abstract void |
CommonDictionaryMaker.addToDictionary(List<List<IWord>> sentenceList)
加入到词典中,允许子类自定义过滤等等,这样比较灵活
|
void |
CommonDictionaryMaker.compute(List<List<IWord>> sentenceList)
处理语料,准备词典
|
protected void |
NSDictionaryMaker.roleTag(List<List<IWord>> sentenceList) |
protected void |
NRDictionaryMaker.roleTag(List<List<IWord>> sentenceList) |
protected void |
NTDictionaryMaker.roleTag(List<List<IWord>> sentenceList) |
protected void |
NatureDictionaryMaker.roleTag(List<List<IWord>> sentenceList) |
protected abstract void |
CommonDictionaryMaker.roleTag(List<List<IWord>> sentenceList)
角色标注,如果子类要进行label的调整或增加新的首尾等等,可以在此进行
|
| Modifier and Type | Method and Description |
|---|---|
static List<List<IWord>> |
CorpusLoader.convert2SentenceList(String path) |
List<List<IWord>> |
Document.getComplexSentenceList()
获取复杂句子列表,句子中的每个单词有可能是复合词,有可能是简单词
|
List<IWord> |
Document.getWordList()
获取单词序列
|
static List<List<IWord>> |
CorpusLoader.loadSentenceList(String path) |
| Modifier and Type | Method and Description |
|---|---|
static boolean |
CorpusLoader.saveSentenceList(List<List<IWord>> sentenceList,
String path) |
| Modifier and Type | Field and Description |
|---|---|
List<IWord> |
Sentence.wordList
词语列表(复合或简单单词的列表)
|
| Modifier and Type | Method and Description |
|---|---|
IWord |
Sentence.findFirstWordByLabel(String label)
找出第一个词性为label的单词(不检查复合词内部的简单词)
|
| Modifier and Type | Method and Description |
|---|---|
ListIterator<IWord> |
Sentence.findFirstWordIteratorByLabel(String label)
找出第一个词性为label的单词的指针(不检查复合词内部的简单词)
若要查看该单词,请调用 previous 若要删除该单词,请调用 remove |
List<IWord> |
Sentence.findWordsByLabel(String label)
找出所有词性为label的单词(不检查复合词内部的简单词)
|
Iterator<IWord> |
Sentence.iterator() |
| Constructor and Description |
|---|
Sentence(List<IWord> wordList) |
| Modifier and Type | Class and Description |
|---|---|
class |
CompoundWord
复合词,由两个或以上的word构成
|
class |
Word
一个单词
|
| Modifier and Type | Method and Description |
|---|---|
static IWord |
WordFactory.create(String param)
根据参数字符串产生对应的词语
|
| Modifier and Type | Method and Description |
|---|---|
static IWord |
CorpusUtil.compile(IWord word)
编译单词
|
| Modifier and Type | Method and Description |
|---|---|
static List<List<IWord>> |
CorpusUtil.convert2CompatibleList(List<List<Word>> simpleSentenceList)
将word列表转为兼容的IWord列表
|
static List<IWord> |
CorpusUtil.spilt(List<IWord> wordList) |
| Modifier and Type | Method and Description |
|---|---|
static Word |
Precompiler.compile(IWord word) |
static IWord |
CorpusUtil.compile(IWord word)
编译单词
|
| Modifier and Type | Method and Description |
|---|---|
static void |
Precompiler.compileWithoutNS(List<IWord> wordList)
在忽略ns的前提下预编译
|
static void |
Precompiler.compileWithoutNT(List<IWord> wordList)
在忽略ns的前提下预编译
|
static List<IWord> |
CorpusUtil.spilt(List<IWord> wordList) |
| Modifier and Type | Method and Description |
|---|---|
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.get(int index) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.get(int index) |
Iterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.iterator() |
Iterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.iterator() |
ListIterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.listIterator() |
ListIterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.listIterator() |
ListIterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.listIterator(int index) |
ListIterator<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.listIterator(int index) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.remove(int index) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.remove(int index) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.set(int index,
Pipe<List<IWord>,List<IWord>> element) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.set(int index,
Pipe<List<IWord>,List<IWord>> element) |
List<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.subList(int fromIndex,
int toIndex) |
List<Pipe<List<IWord>,List<IWord>>> |
SegmentPipeline.subList(int fromIndex,
int toIndex) |
| Modifier and Type | Method and Description |
|---|---|
void |
SegmentPipeline.add(int index,
Pipe<List<IWord>,List<IWord>> element) |
void |
SegmentPipeline.add(int index,
Pipe<List<IWord>,List<IWord>> element) |
boolean |
SegmentPipeline.add(Pipe<List<IWord>,List<IWord>> pipe) |
boolean |
SegmentPipeline.add(Pipe<List<IWord>,List<IWord>> pipe) |
boolean |
SegmentPipeline.addAll(Collection<? extends Pipe<List<IWord>,List<IWord>>> c) |
boolean |
SegmentPipeline.addAll(Collection<? extends Pipe<List<IWord>,List<IWord>>> c) |
boolean |
SegmentPipeline.addAll(int index,
Collection<? extends Pipe<List<IWord>,List<IWord>>> c) |
boolean |
SegmentPipeline.addAll(int index,
Collection<? extends Pipe<List<IWord>,List<IWord>>> c) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.set(int index,
Pipe<List<IWord>,List<IWord>> element) |
Pipe<List<IWord>,List<IWord>> |
SegmentPipeline.set(int index,
Pipe<List<IWord>,List<IWord>> element) |
| Modifier and Type | Method and Description |
|---|---|
List<IWord> |
RegexRecognizePipe.flow(List<IWord> input) |
List<IWord> |
LexicalAnalyzerPipe.flow(List<IWord> input) |
| Modifier and Type | Method and Description |
|---|---|
List<IWord> |
RegexRecognizePipe.flow(List<IWord> input) |
List<IWord> |
LexicalAnalyzerPipe.flow(List<IWord> input) |
| Constructor and Description |
|---|
LexicalAnalyzerPipeline(Pipe<String,List<IWord>> first,
Pipe<List<IWord>,List<IWord>> last) |
LexicalAnalyzerPipeline(Pipe<String,List<IWord>> first,
Pipe<List<IWord>,List<IWord>> last) |
LexicalAnalyzerPipeline(Pipe<String,List<IWord>> first,
Pipe<List<IWord>,List<IWord>> last) |
Copyright © 2014–2021 码农场. All rights reserved.