- a - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
形容词
- A - Static variable in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
bad case
- AbstractClassifier - Class in com.hankcs.hanlp.classification.classifiers
-
- AbstractClassifier() - Constructor for class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- AbstractClosestVectors - Class in com.hankcs.hanlp.mining.word2vec
-
- AbstractClosestVectors(String) - Constructor for class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors
-
- AbstractClosestVectors.Result - Class in com.hankcs.hanlp.mining.word2vec
-
- AbstractDataSet - Class in com.hankcs.hanlp.classification.corpus
-
- AbstractDataSet(AbstractModel) - Constructor for class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
构造测试集
- AbstractDataSet() - Constructor for class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- AbstractDependencyParser - Class in com.hankcs.hanlp.dependency
-
- AbstractDependencyParser(Segment) - Constructor for class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- AbstractDependencyParser() - Constructor for class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- AbstractInexactAction - Class in com.hankcs.hanlp.dependency.nnparser.action
-
- AbstractInexactAction() - Constructor for class com.hankcs.hanlp.dependency.nnparser.action.AbstractInexactAction
-
- AbstractLexicalAnalyzer - Class in com.hankcs.hanlp.tokenizer.lexical
-
词法分析器基类(中文分词、词性标注和命名实体识别)
- AbstractLexicalAnalyzer() - Constructor for class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- AbstractLexicalAnalyzer(Segmenter) - Constructor for class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- AbstractLexicalAnalyzer(Segmenter, POSTagger) - Constructor for class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- AbstractLexicalAnalyzer(Segmenter, POSTagger, NERecognizer) - Constructor for class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- AbstractModel - Class in com.hankcs.hanlp.classification.models
-
所有文本分类模型的基类,包含基本的需要序列化的数据
- AbstractModel() - Constructor for class com.hankcs.hanlp.classification.models.AbstractModel
-
- AbstractTrainer - Class in com.hankcs.hanlp.mining.word2vec
-
- AbstractTrainer() - Constructor for class com.hankcs.hanlp.mining.word2vec.AbstractTrainer
-
- AbstractVectorModel<K> - Class in com.hankcs.hanlp.mining.word2vec
-
抽象的向量模型,将抽象的对象映射为向量
- AbstractVectorModel(Map<K, Vector>) - Constructor for class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
- AbstractVectorModel() - Constructor for class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
- acceptCustomWord(int, int, CoreDictionary.Attribute) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- accuracy - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
分类准确率
- Action - Class in com.hankcs.hanlp.dependency.nnparser.action
-
- Action() - Constructor for class com.hankcs.hanlp.dependency.nnparser.action.Action
-
- Action(int, int) - Constructor for class com.hankcs.hanlp.dependency.nnparser.action.Action
-
创建动作
- action - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.BeamElement
-
- Action - Enum in com.hankcs.hanlp.dependency.perceptron.transition.parser
-
Created by Mohammad Sadegh Rasooli.
- action - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.parser.LabeledAction
-
- actionCost(Action, int, State) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
For the cost of an action given the gold dependencies
For more information see:
Yoav Goldberg and Joakim Nivre.
- ActionFactory - Class in com.hankcs.hanlp.dependency.nnparser.action
-
- ActionFactory() - Constructor for class com.hankcs.hanlp.dependency.nnparser.action.ActionFactory
-
- actionHistory - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- ActionType - Interface in com.hankcs.hanlp.dependency.nnparser.action
-
arc-standard system (Nivre, 2004) 用到的动作,类似于 Yamada 和 Matsumoto 提出的分析动作
- ActionUtils - Class in com.hankcs.hanlp.dependency.nnparser.action
-
- ActionUtils() - Constructor for class com.hankcs.hanlp.dependency.nnparser.action.ActionUtils
-
- ad - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
副形词
- ada_alpha - Variable in class com.hankcs.hanlp.dependency.nnparser.option.LearnOption
-
- ada_eps - Variable in class com.hankcs.hanlp.dependency.nnparser.option.LearnOption
-
- AdaOption - Class in com.hankcs.hanlp.dependency.nnparser.option
-
- AdaOption() - Constructor for class com.hankcs.hanlp.dependency.nnparser.option.AdaOption
-
- add(E) - Method in class com.hankcs.hanlp.algorithm.MaxHeap
-
添加一个元素
- add(K) - Method in class com.hankcs.hanlp.classification.collections.FrequencyMap
-
增加一个词的词频
- add(Map<String, String[]>) - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- add(String, String) - Method in class com.hankcs.hanlp.classification.corpus.FileDataSet
-
- add(String, String) - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
往训练集中加入一个文档
- add(Map<String, String[]>) - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
- add(String, String) - Method in class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- add(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- add(SString) - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- add(String) - Method in class com.hankcs.hanlp.collection.set.UnEmptyStringSet
-
- add(String, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
非覆盖模式添加
- add(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
非覆盖模式添加,值默认为当前集合大小
- add(String) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonSuffixExtractor
-
- add(IWord) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
向词典中加入一个词语
- add(String, String) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- add(Item) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
插入条目
- add(String) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- add(String, V) - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
向中加入单词
- add(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
添加一个词语
- add(String) - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
- add(String) - Method in class com.hankcs.hanlp.corpus.occurrence.TermOccurrence
-
- add(String, String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
往自定义词典中插入一个新词(非覆盖模式)
动态增删不会持久化到词典文件
- add(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
往自定义词典中插入一个新词(非覆盖模式)
动态增删不会持久化到词典文件
- add(String, String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
往自定义词典中插入一个新词(非覆盖模式)
动态增删不会持久化到词典文件
- add(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
往自定义词典中插入一个新词(非覆盖模式)
动态增删不会持久化到词典文件
- add(String) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
加入停用词到停用词词典中
- add(String) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- add(List<Term>) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- add(TermFrequency) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- add(Object, List<Term>) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- add(List<Term>) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- add(Object, String) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
添加文档
- add(String) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
添加文档,自动分配id
- add(Vector) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- add(Node, Node) - Method in class com.hankcs.hanlp.model.crf.crfpp.Path
-
- add(String[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- add(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- add(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- add(String[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- add(String) - Method in class com.hankcs.hanlp.model.perceptron.common.FrequencyMap
-
- add(String) - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- add(int, char[]...) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- add(int, Collection<char[]>) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- add(int, Vertex) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
添加顶点
- add(int, List<AtomNode>) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
添加顶点,由原子分词顶点添加
- add(Pipe<List<IWord>, List<IWord>>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- add(int, Pipe<List<IWord>, List<IWord>>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- add(Pipe<M, M>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- add(int, Pipe<M, M>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- addAction(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- addAll(Collection<E>) - Method in class com.hankcs.hanlp.algorithm.MaxHeap
-
添加许多元素
- addAll(Collection<? extends String>) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- addAll(List<Item>) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
插入全部条目
- addAll(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
- addAll(String[]) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
- addAll(String[]) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- addAll(List<Term>) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- addAll(String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- addAll(List<String>) - Method in class com.hankcs.hanlp.corpus.occurrence.TermOccurrence
-
- addAll(Collection<? extends TermFrequency>) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- addAll(List<Vertex>) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
全自动添加顶点
- addAll(Collection<? extends Pipe<List<IWord>, List<IWord>>>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- addAll(int, Collection<? extends Pipe<List<IWord>, List<IWord>>>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- addAll(Collection<? extends Pipe<M, M>>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- addAll(int, Collection<? extends Pipe<M, M>>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- addAllKeyword(Collection<String>) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
- addAllNotCombine(List<Item>) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
插入新条目,不执行合并
- addArc(int, int, int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- addAvailableBeamElements(ArrayList<BeamElement>, float, boolean, boolean, boolean, boolean, Object[], AveragedPerceptron, boolean, int, ArrayList<Integer>) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.PartialTreeBeamScorerThread
-
- addCategory(String) - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- addChild(BaseNode) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
添加子节点
- addChild(BaseNode) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- addChild(BaseNode) - Method in class com.hankcs.hanlp.collection.trie.bintrie.Node
-
- addDefaultNERLabels() - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- addDocument(K, String) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
添加文档
- addDocument(K, List<String>) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
添加文档
- addDocument(int, String) - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
添加文档
- addEmit(String) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
添加一个匹配到的模式串(这个状态对应着这个模式串)
- addEmit(Collection<String>) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
添加一些匹配到的模式串
- addEmit(int) - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
添加一个匹配到的模式串(这个状态对应着这个模式串)
- addEmit(Collection<Integer>) - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
添加一些匹配到的模式串
- addFeature(CharSequence, List<Integer>, FeatureMap) - Static method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
- addFeature(String, FeatureMap, List<Integer>) - Static method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
向特征向量插入特征
- addFeatureThenClear(StringBuilder, List<Integer>, FeatureMap) - Static method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
添加特征,同时清空缓存
- addFirst(Pipe<M, M>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
以最高优先级加入管道
- addKeyword(String) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
- addLabel(E) - Method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- addLabel(E, Integer) - Method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- addLabel(String) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- addLabel(String, Integer) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
添加一个标签和频次
- addLast(Pipe<M, M>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
以最低优先级加入管道
- addNERLabels(String) - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- addNotCombine(Item) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
插入条目,但是不合并,如果已有则忽略
- addOutgoingTransition(char, boolean) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
新建一个转移目标
Creates an outgoing _transition labeled with a
given char that has a new node as its target.
- addOutgoingTransition(char, MDAGNode) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
建立一条边(起点是自己)
- addPair(IWord, IWord) - Method in class com.hankcs.hanlp.corpus.dictionary.NGramDictionaryMaker
-
- addPair(String, String) - Method in class com.hankcs.hanlp.corpus.dictionary.TMDictionaryMaker
-
添加一个转移例子,会在内部完成统计
- addPair(String, String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
添加一个共现
- addScore(float) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- addSentence(String) - Method in interface com.hankcs.hanlp.suggest.ISuggester
-
- addSentence(String) - Method in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
- addSentence(String) - Method in interface com.hankcs.hanlp.suggest.scorer.IScorer
-
输入一个候选句子
- addSentence(String) - Method in class com.hankcs.hanlp.suggest.Suggester
-
- addState(Character) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
- addState(Character) - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- addString(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
Adds a string to the MDAG.
- addStrings(Collection<String>) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
Adds a Collection of Strings to the MDAG.
- addTerm(String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
统计词频
- addToDictionary(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
加入到词典中,允许子类自定义过滤等等,这样比较灵活
- addToDictionary(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NatureDictionaryMaker
-
- addToDictionary(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NRDictionaryMaker
-
- addToDictionary(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NSDictionaryMaker
-
- addToDictionary(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NTDictionaryMaker
-
- addToOverlaps(Intervalable, List<Intervalable>, List<Intervalable>) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalNode
-
添加到重叠区间列表中
- addToSelf(Vector) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- addTria(String, String, String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- addWord(String) - Method in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- addWordToVocab(String) - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
Adds a word to the vocabulary
- ag - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
形容词性语素
- aggregation - Variable in class com.hankcs.hanlp.mining.word.WordInfo
-
互信息
- AhoCorasickDoubleArrayTrie<V> - Class in com.hankcs.hanlp.collection.AhoCorasick
-
基于双数组Trie树的AhoCorasick自动机
- AhoCorasickDoubleArrayTrie() - Constructor for class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
- AhoCorasickDoubleArrayTrie(TreeMap<String, V>) - Constructor for class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
由一个词典创建
- AhoCorasickDoubleArrayTrie.Hit<V> - Class in com.hankcs.hanlp.collection.AhoCorasick
-
一个命中结果
- AhoCorasickDoubleArrayTrie.IHit<V> - Interface in com.hankcs.hanlp.collection.AhoCorasick
-
命中一个模式串的处理方法
- AhoCorasickDoubleArrayTrie.IHitFull<V> - Interface in com.hankcs.hanlp.collection.AhoCorasick
-
- AhoCorasickDoubleArrayTrieSegment - Class in com.hankcs.hanlp.seg.Other
-
使用AhoCorasickDoubleArrayTrie实现的最长分词器
需要用户调用setTrie()提供一个AhoCorasickDoubleArrayTrie
- AhoCorasickDoubleArrayTrieSegment() - Constructor for class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- AhoCorasickDoubleArrayTrieSegment(TreeMap<String, CoreDictionary.Attribute>) - Constructor for class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- AhoCorasickDoubleArrayTrieSegment(AhoCorasickDoubleArrayTrie<CoreDictionary.Attribute>) - Constructor for class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- AhoCorasickDoubleArrayTrieSegment(String...) - Constructor for class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
加载自己的词典,构造分词器
- al - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
形容词性惯用语
- algorithm - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- all() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
所有词汇的频次
- allLabels() - Method in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
- allLabels() - Method in class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- allTags() - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
获取所有标签及其下标
- allTf() - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- alpha - Variable in class com.hankcs.hanlp.mining.word2vec.Config
-
- alpha - Variable in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- alpha(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- alpha(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- alpha_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- Alphabet - Class in com.hankcs.hanlp.dependency.nnparser
-
int 到 String 的双向map
- Alphabet() - Constructor for class com.hankcs.hanlp.dependency.nnparser.Alphabet
-
- alphaFloat_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- an - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
名形词
- analogy(String, String, String) - Method in class com.hankcs.hanlp.mining.word2vec.WordVectorModel
-
返回跟 A - B + C 最相似的词语,比如 中国 - 北京 + 东京 = 日本。输入顺序按照 中国 北京 东京
- analogy(String, String, String, int) - Method in class com.hankcs.hanlp.mining.word2vec.WordVectorModel
-
返回跟 A - B + C 最相似的词语,比如 中国 - 北京 + 东京 = 日本。输入顺序按照 中国 北京 东京
- analyze(String) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- analyze(String) - Method in interface com.hankcs.hanlp.tokenizer.lexical.LexicalAnalyzer
-
对句子进行词法分析
- analyze(String) - Static method in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
词法分析
- analyze(String) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- ANALYZER - Static variable in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
预置分词器
- analyzer - Variable in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipe
-
代理的词法分析器
- answer(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- answer(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- append(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
在数组尾部新增一个元素
- apply(List<Term>) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
对分词结果应用过滤
- applyRule(String, int, TaggerImpl) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- ArcEager - Class in com.hankcs.hanlp.dependency.perceptron.transition.parser
-
- ArcEagerBeamTrainer - Class in com.hankcs.hanlp.dependency.perceptron.transition.trainer
-
- ArcEagerBeamTrainer(String, AveragedPerceptron, Options, ArrayList<Integer>, int, IndexMaps) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.trainer.ArcEagerBeamTrainer
-
- arcs - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
Keeps dependent->head information
- argPos(String, String[]) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractTrainer
-
- argPos(String, String[], boolean) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractTrainer
-
- Args - Class in com.hankcs.hanlp.model.perceptron.cli
-
- Args() - Constructor for class com.hankcs.hanlp.model.perceptron.cli.Args
-
- Args.ValueCreator - Interface in com.hankcs.hanlp.model.perceptron.cli
-
- Argument - Annotation Type in com.hankcs.hanlp.model.perceptron.cli
-
- ArrayCompare - Class in com.hankcs.hanlp.algorithm
-
比较两个数组
- ArrayCompare() - Constructor for class com.hankcs.hanlp.algorithm.ArrayCompare
-
- ArrayDistance - Class in com.hankcs.hanlp.algorithm
-
求两个集合中最相近的两个数
- ArrayDistance() - Constructor for class com.hankcs.hanlp.algorithm.ArrayDistance
-
- arrayLeftDivide(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element left division, C = A.\B
- arrayLeftDivideEquals(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element left division in place, A = A.\B
- arrayRightDivide(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element right division, C = A./B
- arrayRightDivideEquals(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element right division in place, A = A./B
- arrayTimes(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element multiplication, C = A.*B
- arrayTimesEquals(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Element-by-element multiplication in place, A = A.*B
- ArrayTool - Class in com.hankcs.hanlp.collection.trie.bintrie.util
-
- ArrayTool() - Constructor for class com.hankcs.hanlp.collection.trie.bintrie.util.ArrayTool
-
- AtomNode - Class in com.hankcs.hanlp.seg.NShort.Path
-
原子分词节点
- AtomNode(String, int) - Constructor for class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
- AtomNode(char, int) - Constructor for class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
- atomSegment(char[]) - Static method in class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- atomSegment(char[], int, int) - Static method in class com.hankcs.hanlp.seg.Segment
-
原子分词
- atomSegmentToTable(char[]) - Static method in class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- Attribute(int) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- Attribute(Nature[], int[]) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- Attribute(Nature, int) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- Attribute(Nature) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
使用单个词性,默认词频1000构造
- Attribute(int) - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- Attribute(Nature[], int[]) - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- Attribute(Nature, int) - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- Attribute(Nature[], int[], int) - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- Attribute(Nature) - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
使用单个词性,默认词频1000构造
- ATTRIBUTE - Static variable in class com.hankcs.hanlp.dictionary.nr.NRConstant
-
本词典专注的词的属性
- ATTRIBUTE - Static variable in class com.hankcs.hanlp.dictionary.nr.PersonDictionary
-
- attribute - Variable in class com.hankcs.hanlp.seg.common.Vertex
-
词的属性,谨慎修改属性内部的数据,因为会影响到字典
如果要修改,应当new一个Attribute
- average(double[], int[], int) - Method in class com.hankcs.hanlp.model.perceptron.model.AveragedPerceptron
-
- average(double[]) - Static method in class com.hankcs.hanlp.utility.MathUtility
-
- average_accuracy - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
平均准确率
- average_f1 - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
平均F1
- average_precision - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
平均精确率
- average_recall - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
平均召回率
- AveragedPerceptron - Class in com.hankcs.hanlp.dependency.perceptron.learning
-
- AveragedPerceptron(int, int) - Constructor for class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- AveragedPerceptron(ParserModel) - Constructor for class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- AveragedPerceptron - Class in com.hankcs.hanlp.model.perceptron.model
-
平均感知机算法学习的线性模型
- AveragedPerceptron(FeatureMap, float[]) - Constructor for class com.hankcs.hanlp.model.perceptron.model.AveragedPerceptron
-
- AveragedPerceptron(FeatureMap) - Constructor for class com.hankcs.hanlp.model.perceptron.model.AveragedPerceptron
-
- c - Variable in class com.hankcs.hanlp.classification.models.NaiveBayesModel
-
类别数
- c - Variable in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
节点代表的字符
- c - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
连词
- c - Variable in class com.hankcs.hanlp.dictionary.BaseSearcher
-
待分词文本的char
- CacheCorpus - Class in com.hankcs.hanlp.mining.word2vec
-
load corpus from disk cache
- CacheCorpus(Corpus) - Constructor for class com.hankcs.hanlp.mining.word2vec.CacheCorpus
-
- cacheFile - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- calcAlpha() - Method in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- calcBeta() - Method in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- calcCost(Node) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
计算状态特征函数的代价
- calcCost(Path) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
计算转移特征函数的代价
- calcExpectation(double[], double, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Node
-
计算节点期望
- calcExpectation(double[], double, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Path
-
计算边的期望
- calculateWeight(Vertex, Vertex) - Static method in class com.hankcs.hanlp.utility.MathUtility
-
从一个词到另一个词的词的花费
- call() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.BeamScorerThread
-
- call() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ParseTaggedThread
-
- call() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ParseThread
-
- call() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.PartialTreeBeamScorerThread
-
- call() - Method in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- canDo(Action, State) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
- CanGetNext() - Method in class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
是否仍然有下一个元素可供读取
- catalog - Variable in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- Catalog - Class in com.hankcs.hanlp.classification.corpus
-
类目名称和id的对应关系
- Catalog() - Constructor for class com.hankcs.hanlp.classification.corpus.Catalog
-
- Catalog(String[]) - Constructor for class com.hankcs.hanlp.classification.corpus.Catalog
-
- catalog - Variable in class com.hankcs.hanlp.classification.models.AbstractModel
-
类目表
- catalog - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
分类名称
- categorize(Document) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
预测分类
- categorize(Document) - Method in class com.hankcs.hanlp.classification.classifiers.NaiveBayesClassifier
-
- category - Variable in class com.hankcs.hanlp.classification.corpus.Document
-
文档所属类目
- categoryCounts - Variable in class com.hankcs.hanlp.classification.features.BaseFeatureData
-
每个类目中的文档数量
- cbow - Variable in class com.hankcs.hanlp.mining.word2vec.Config
-
- cc - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
并列连词
- changeFeatureWeight(HashMap<Object, CompactArray>, HashMap<Object, CompactArray>, Object, int, float, int) - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- changePunc(String) - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- changeWeight(Action, int, Object, int, float) - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- CharacterBasedGenerativeModel - Class in com.hankcs.hanlp.model.trigram
-
基于字符的生成模型(其实就是一个TriGram文法模型,或称2阶隐马模型)
- CharacterBasedGenerativeModel() - Constructor for class com.hankcs.hanlp.model.trigram.CharacterBasedGenerativeModel
-
- CharacterBasedSegment - Class in com.hankcs.hanlp.seg
-
基于“由字构词”方法分词器基类
- CharacterBasedSegment() - Constructor for class com.hankcs.hanlp.seg.CharacterBasedSegment
-
- CharacterHelper - Class in com.hankcs.hanlp.utility
-
字符集识别辅助工具类
- CharacterHelper() - Constructor for class com.hankcs.hanlp.utility.CharacterHelper
-
- CharacterMapping - Interface in com.hankcs.hanlp.collection.trie.datrie
-
字符映射接口
- charArray - Variable in class com.hankcs.hanlp.seg.common.WordNet
-
原始句子对应的数组
- CharArray - Class in com.hankcs.hanlp.suggest.scorer.editdistance
-
对字符数组的封装,可以代替String
- CharArray(char[]) - Constructor for class com.hankcs.hanlp.suggest.scorer.editdistance.CharArray
-
- charAt(int) - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- CharTable - Class in com.hankcs.hanlp.dictionary.other
-
字符正规化表
- CharTable() - Constructor for class com.hankcs.hanlp.dictionary.other.CharTable
-
- CharTablePath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
字符正规化表(全角转半角,繁体转简体)
- charToBytes(char) - Static method in class com.hankcs.hanlp.utility.ByteUtil
-
将一个char字符转换位字节数组(2个字节),b[0]存储高位字符,大端
- charTreeSet - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
字母表
- CharType - Class in com.hankcs.hanlp.dictionary.other
-
字符类型
- CharType() - Constructor for class com.hankcs.hanlp.dictionary.other.CharType
-
- charType(char) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
- charType(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断字符类型
- CharTypePath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
字符类型对应表
- check - Variable in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
双数组值check
- check - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- checkForOverlaps(Intervalable, IntervalNode.Direction) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalNode
-
寻找重叠
- checkForOverlapsToTheLeft(Intervalable) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalNode
-
往左边寻找重叠
- checkForOverlapsToTheRight(Intervalable) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalNode
-
往右边寻找重叠
- checkMaxXsize_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- chi_square(BaseFeatureData) - Method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
使用卡方非参数校验来执行特征选择
https://nlp.stanford.edu/IR-book/html/htmledition/feature-selectionchi2-feature-selection-1.html
- child - Variable in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
子节点
- CHINESE_NUMBERS - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- ChisquareCdf(double, int) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
给定卡方值和自由度,计算从0到x的累积分布函数值
- chisquareCriticalValue - Variable in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
在P值(拒真错误概率)为0.001时的卡方临界值,用于特征选择算法
- ChiSquareFeatureExtractor - Class in com.hankcs.hanlp.classification.features
-
特征提取,用来生成FeatureStats并且使用卡方测试选择最佳特征
- ChiSquareFeatureExtractor() - Constructor for class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
- ChisquareInverseCdf(double, int) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
给定卡方分布的p值和自由度,返回卡方值。内部采用二分搜索实现,移植自JS代码:
http://www.fourmilab.ch/rpkp/experiments/analysis/chiCalc.js
- cint(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
把表示数字含义的字符串转成整形
- classifier - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.parser.TransitionBasedParser
-
Any kind of classifier that can give us scores
- classify(String) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
使用一个训练出来的分类器来预测分类
- classify(Document) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- classify(String) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
预测最可能的分类
- classify(Document) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
预测最可能的分类
- clean() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
清理from属性
- clear() - Method in class com.hankcs.hanlp.classification.corpus.FileDataSet
-
- clear() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
清空数据集
- clear() - Method in class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- clear() - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- clear() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- clear() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- clear() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- clear() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.LbfgsOptimizer
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.Path
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- clear() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- clear() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
清空词图
- clear() - Method in class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
清除所有元素
- clear() - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- clear() - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- clear() - Static method in class com.hankcs.hanlp.utility.GlobalObjectPool
-
清空全局变量
- clearNodes() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- clearStoredHashCode() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Clears this node's stored hash value
- clone() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
克隆一个状态
Creates an MDAGNode possessing the same accept state status and outgoing transitions as this node.
- clone(MDAGNode, char) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
克隆一个状态
原来soleParentNode转移到本状态,现在转移到克隆后的状态
Creates an MDAGNode possessing the same accept state status ant _transition set
(incoming & outgoing) as this node.
- clone() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Clone the Matrix object.
- clone() - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Edge
-
- clone() - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- clone() - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Pair
-
- clone() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- clone() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- close() - Method in class com.hankcs.hanlp.corpus.io.ByteArray
-
通知执行关闭/销毁操作
- close() - Method in class com.hankcs.hanlp.corpus.io.ByteArrayFileStream
-
- close() - Method in class com.hankcs.hanlp.corpus.io.ByteArrayOtherStream
-
- close() - Method in class com.hankcs.hanlp.corpus.io.IOUtil.LineIterator
-
- close() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
Close the corpus and it cannot be read any more.
- close() - Method in class com.hankcs.hanlp.model.crf.crfpp.Model
-
- close() - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- close() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- Cluster<K> - Class in com.hankcs.hanlp.mining.cluster
-
- Cluster() - Constructor for class com.hankcs.hanlp.mining.cluster.Cluster
-
- Cluster(List<Document<K>>) - Constructor for class com.hankcs.hanlp.mining.cluster.Cluster
-
- ClusterAnalyzer<K> - Class in com.hankcs.hanlp.mining.cluster
-
文本聚类
- ClusterAnalyzer() - Constructor for class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
- clusterFile - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- clusterId(String) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
获取聚类id
- clustering() - Method in class com.hankcs.hanlp.mining.word2vec.KMeansClustering
-
- col(int) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
取出第j列作为一个列向量
- CollectionUtility - Class in com.hankcs.hanlp.classification.utilities
-
- CollectionUtility() - Constructor for class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
- collins(List<Double>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- cols() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
- com.hankcs.hanlp - package com.hankcs.hanlp
-
- com.hankcs.hanlp.algorithm - package com.hankcs.hanlp.algorithm
-
- com.hankcs.hanlp.algorithm.ahocorasick.interval - package com.hankcs.hanlp.algorithm.ahocorasick.interval
-
- com.hankcs.hanlp.algorithm.ahocorasick.trie - package com.hankcs.hanlp.algorithm.ahocorasick.trie
-
- com.hankcs.hanlp.classification.classifiers - package com.hankcs.hanlp.classification.classifiers
-
- com.hankcs.hanlp.classification.collections - package com.hankcs.hanlp.classification.collections
-
- com.hankcs.hanlp.classification.corpus - package com.hankcs.hanlp.classification.corpus
-
- com.hankcs.hanlp.classification.features - package com.hankcs.hanlp.classification.features
-
- com.hankcs.hanlp.classification.models - package com.hankcs.hanlp.classification.models
-
- com.hankcs.hanlp.classification.statistics - package com.hankcs.hanlp.classification.statistics
-
- com.hankcs.hanlp.classification.statistics.evaluations - package com.hankcs.hanlp.classification.statistics.evaluations
-
- com.hankcs.hanlp.classification.tokenizers - package com.hankcs.hanlp.classification.tokenizers
-
- com.hankcs.hanlp.classification.utilities - package com.hankcs.hanlp.classification.utilities
-
- com.hankcs.hanlp.classification.utilities.io - package com.hankcs.hanlp.classification.utilities.io
-
- com.hankcs.hanlp.collection.AhoCorasick - package com.hankcs.hanlp.collection.AhoCorasick
-
- com.hankcs.hanlp.collection.dartsclone - package com.hankcs.hanlp.collection.dartsclone
-
- com.hankcs.hanlp.collection.dartsclone.details - package com.hankcs.hanlp.collection.dartsclone.details
-
- com.hankcs.hanlp.collection.MDAG - package com.hankcs.hanlp.collection.MDAG
-
- com.hankcs.hanlp.collection.sequence - package com.hankcs.hanlp.collection.sequence
-
- com.hankcs.hanlp.collection.set - package com.hankcs.hanlp.collection.set
-
- com.hankcs.hanlp.collection.trie - package com.hankcs.hanlp.collection.trie
-
- com.hankcs.hanlp.collection.trie.bintrie - package com.hankcs.hanlp.collection.trie.bintrie
-
- com.hankcs.hanlp.collection.trie.bintrie.util - package com.hankcs.hanlp.collection.trie.bintrie.util
-
- com.hankcs.hanlp.collection.trie.datrie - package com.hankcs.hanlp.collection.trie.datrie
-
可变双数组trie树,可以当做Map来用。如果V是int,可以直接用MutableDoubleArrayTrieInteger
- com.hankcs.hanlp.corpus.dependency.CoNll - package com.hankcs.hanlp.corpus.dependency.CoNll
-
- com.hankcs.hanlp.corpus.dependency.model - package com.hankcs.hanlp.corpus.dependency.model
-
- com.hankcs.hanlp.corpus.dictionary - package com.hankcs.hanlp.corpus.dictionary
-
- com.hankcs.hanlp.corpus.dictionary.item - package com.hankcs.hanlp.corpus.dictionary.item
-
- com.hankcs.hanlp.corpus.document - package com.hankcs.hanlp.corpus.document
-
- com.hankcs.hanlp.corpus.document.sentence - package com.hankcs.hanlp.corpus.document.sentence
-
- com.hankcs.hanlp.corpus.document.sentence.word - package com.hankcs.hanlp.corpus.document.sentence.word
-
- com.hankcs.hanlp.corpus.io - package com.hankcs.hanlp.corpus.io
-
- com.hankcs.hanlp.corpus.occurrence - package com.hankcs.hanlp.corpus.occurrence
-
- com.hankcs.hanlp.corpus.synonym - package com.hankcs.hanlp.corpus.synonym
-
- com.hankcs.hanlp.corpus.tag - package com.hankcs.hanlp.corpus.tag
-
- com.hankcs.hanlp.corpus.util - package com.hankcs.hanlp.corpus.util
-
- com.hankcs.hanlp.dependency - package com.hankcs.hanlp.dependency
-
- com.hankcs.hanlp.dependency.common - package com.hankcs.hanlp.dependency.common
-
- com.hankcs.hanlp.dependency.nnparser - package com.hankcs.hanlp.dependency.nnparser
-
- com.hankcs.hanlp.dependency.nnparser.action - package com.hankcs.hanlp.dependency.nnparser.action
-
- com.hankcs.hanlp.dependency.nnparser.option - package com.hankcs.hanlp.dependency.nnparser.option
-
- com.hankcs.hanlp.dependency.nnparser.util - package com.hankcs.hanlp.dependency.nnparser.util
-
- com.hankcs.hanlp.dependency.perceptron - package com.hankcs.hanlp.dependency.perceptron
-
本package是对Yara Parser的包装与优化,主要做了如下几点优化
- 代码重构,提高复用率(由于dynamic oracle需要在训练的过程中逐渐动态地创建特征,
所以无法复用HanLP的感知机框架,这也是为什么选择直接包装该模块而不是重新实现的原因之一。)
- 接口调整,与词法分析器整合
- debug
- 文档注释
Yara Parser的版权与授权信息如下:
© Copyright 2014-2015, Yahoo! Inc.
- com.hankcs.hanlp.dependency.perceptron.accessories - package com.hankcs.hanlp.dependency.perceptron.accessories
-
- com.hankcs.hanlp.dependency.perceptron.learning - package com.hankcs.hanlp.dependency.perceptron.learning
-
- com.hankcs.hanlp.dependency.perceptron.parser - package com.hankcs.hanlp.dependency.perceptron.parser
-
- com.hankcs.hanlp.dependency.perceptron.structures - package com.hankcs.hanlp.dependency.perceptron.structures
-
- com.hankcs.hanlp.dependency.perceptron.transition.configuration - package com.hankcs.hanlp.dependency.perceptron.transition.configuration
-
- com.hankcs.hanlp.dependency.perceptron.transition.features - package com.hankcs.hanlp.dependency.perceptron.transition.features
-
- com.hankcs.hanlp.dependency.perceptron.transition.parser - package com.hankcs.hanlp.dependency.perceptron.transition.parser
-
- com.hankcs.hanlp.dependency.perceptron.transition.trainer - package com.hankcs.hanlp.dependency.perceptron.transition.trainer
-
- com.hankcs.hanlp.dictionary - package com.hankcs.hanlp.dictionary
-
- com.hankcs.hanlp.dictionary.common - package com.hankcs.hanlp.dictionary.common
-
- com.hankcs.hanlp.dictionary.nr - package com.hankcs.hanlp.dictionary.nr
-
- com.hankcs.hanlp.dictionary.ns - package com.hankcs.hanlp.dictionary.ns
-
- com.hankcs.hanlp.dictionary.nt - package com.hankcs.hanlp.dictionary.nt
-
- com.hankcs.hanlp.dictionary.other - package com.hankcs.hanlp.dictionary.other
-
- com.hankcs.hanlp.dictionary.py - package com.hankcs.hanlp.dictionary.py
-
- com.hankcs.hanlp.dictionary.stopword - package com.hankcs.hanlp.dictionary.stopword
-
- com.hankcs.hanlp.dictionary.ts - package com.hankcs.hanlp.dictionary.ts
-
- com.hankcs.hanlp.mining - package com.hankcs.hanlp.mining
-
这个包下面是一些文本挖掘工具(主要是无监督方法)
- com.hankcs.hanlp.mining.cluster - package com.hankcs.hanlp.mining.cluster
-
文本聚类模块(k-means和repeated bisection)
参考文献 Steinbach M, Karypis G, Kumar V.
- com.hankcs.hanlp.mining.phrase - package com.hankcs.hanlp.mining.phrase
-
- com.hankcs.hanlp.mining.word - package com.hankcs.hanlp.mining.word
-
- com.hankcs.hanlp.mining.word2vec - package com.hankcs.hanlp.mining.word2vec
-
Java移植版的word2vec,最大程度上与原版一致。大部分代码来自:https://github.com/kojisekig/word2vec-lucene ,额外做了一些性能优化
- com.hankcs.hanlp.model - package com.hankcs.hanlp.model
-
- com.hankcs.hanlp.model.bigram - package com.hankcs.hanlp.model.bigram
-
- com.hankcs.hanlp.model.crf - package com.hankcs.hanlp.model.crf
-
- com.hankcs.hanlp.model.crf.crfpp - package com.hankcs.hanlp.model.crf.crfpp
-
这个包下面是由
Zhifa Chen移植的CRF++。
做了一些注释、修改与debug。本来想自己移植的,后来发现已经有移植版,所以就没有浪费时间重复造轮子。
关于理论,请参考
《CRF++代码分析》。
这份代码(含CRF++和darts-java)的许可证是LGPL & Modified BSD,需注明如下版权声明:
- com.hankcs.hanlp.model.hmm - package com.hankcs.hanlp.model.hmm
-
- com.hankcs.hanlp.model.maxent - package com.hankcs.hanlp.model.maxent
-
- com.hankcs.hanlp.model.perceptron - package com.hankcs.hanlp.model.perceptron
-
感知机在线学习算法的线性序列标注模型。基于这套框架实现了一整套分词、词性标注和命名实体识别功能。
理论参考邓知龙 《基于感知器算法的高效中文分词与词性标注系统设计与实现》,
简介:http://www.hankcs.com/nlp/segment/implementation-of-word-segmentation-device-java-based-on-structured-average-perceptron.html
- com.hankcs.hanlp.model.perceptron.cli - package com.hankcs.hanlp.model.perceptron.cli
-
- com.hankcs.hanlp.model.perceptron.common - package com.hankcs.hanlp.model.perceptron.common
-
- com.hankcs.hanlp.model.perceptron.feature - package com.hankcs.hanlp.model.perceptron.feature
-
- com.hankcs.hanlp.model.perceptron.instance - package com.hankcs.hanlp.model.perceptron.instance
-
- com.hankcs.hanlp.model.perceptron.model - package com.hankcs.hanlp.model.perceptron.model
-
- com.hankcs.hanlp.model.perceptron.tagset - package com.hankcs.hanlp.model.perceptron.tagset
-
- com.hankcs.hanlp.model.perceptron.utility - package com.hankcs.hanlp.model.perceptron.utility
-
- com.hankcs.hanlp.model.trigram - package com.hankcs.hanlp.model.trigram
-
- com.hankcs.hanlp.model.trigram.frequency - package com.hankcs.hanlp.model.trigram.frequency
-
- com.hankcs.hanlp.recognition.nr - package com.hankcs.hanlp.recognition.nr
-
- com.hankcs.hanlp.recognition.ns - package com.hankcs.hanlp.recognition.ns
-
- com.hankcs.hanlp.recognition.nt - package com.hankcs.hanlp.recognition.nt
-
- com.hankcs.hanlp.seg - package com.hankcs.hanlp.seg
-
- com.hankcs.hanlp.seg.common - package com.hankcs.hanlp.seg.common
-
- com.hankcs.hanlp.seg.common.wrapper - package com.hankcs.hanlp.seg.common.wrapper
-
- com.hankcs.hanlp.seg.CRF - package com.hankcs.hanlp.seg.CRF
-
- com.hankcs.hanlp.seg.Dijkstra - package com.hankcs.hanlp.seg.Dijkstra
-
- com.hankcs.hanlp.seg.Dijkstra.Path - package com.hankcs.hanlp.seg.Dijkstra.Path
-
- com.hankcs.hanlp.seg.HMM - package com.hankcs.hanlp.seg.HMM
-
- com.hankcs.hanlp.seg.NShort - package com.hankcs.hanlp.seg.NShort
-
- com.hankcs.hanlp.seg.NShort.Path - package com.hankcs.hanlp.seg.NShort.Path
-
- com.hankcs.hanlp.seg.Other - package com.hankcs.hanlp.seg.Other
-
- com.hankcs.hanlp.seg.Viterbi - package com.hankcs.hanlp.seg.Viterbi
-
- com.hankcs.hanlp.seg.Viterbi.Path - package com.hankcs.hanlp.seg.Viterbi.Path
-
- com.hankcs.hanlp.suggest - package com.hankcs.hanlp.suggest
-
- com.hankcs.hanlp.suggest.scorer - package com.hankcs.hanlp.suggest.scorer
-
- com.hankcs.hanlp.suggest.scorer.editdistance - package com.hankcs.hanlp.suggest.scorer.editdistance
-
- com.hankcs.hanlp.suggest.scorer.lexeme - package com.hankcs.hanlp.suggest.scorer.lexeme
-
- com.hankcs.hanlp.suggest.scorer.pinyin - package com.hankcs.hanlp.suggest.scorer.pinyin
-
- com.hankcs.hanlp.summary - package com.hankcs.hanlp.summary
-
- com.hankcs.hanlp.tokenizer - package com.hankcs.hanlp.tokenizer
-
一些常用的配置好的静态分词器,其中一些具备特殊的预处理功能
可供用户参考以实现自己的预处理逻辑.
- com.hankcs.hanlp.tokenizer.lexical - package com.hankcs.hanlp.tokenizer.lexical
-
- com.hankcs.hanlp.tokenizer.pipe - package com.hankcs.hanlp.tokenizer.pipe
-
- com.hankcs.hanlp.utility - package com.hankcs.hanlp.utility
-
- combine(String, String) - Static method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
合并两部词典
- combine(String...) - Static method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
合并多部词典
- combine(SimpleItem) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
合并两个条目,两者的标签map会合并
- combine(SimpleDictionary<V>) - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
以我为主词典,合并一个副词典,我有的词条不会被副词典覆盖
- combine(StringDictionary...) - Static method in class com.hankcs.hanlp.corpus.dictionary.StringDictionaryMaker
-
合并词典,第一个为主词典
- combine(String...) - Static method in class com.hankcs.hanlp.corpus.dictionary.StringDictionaryMaker
-
- combine(TFDictionary, int, boolean) - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
合并自己(主词典)和某个词频词典
- combine(String...) - Static method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
合并多个词典
- combine(String...) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
- combine(Sentence) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
- combine(List<Word>) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
- combineByCustomDictionary(List<Vertex>) - Method in class com.hankcs.hanlp.seg.Segment
-
使用用户词典合并粗分结果
- combineByCustomDictionary(List<Vertex>, DoubleArrayTrie<CoreDictionary.Attribute>) - Method in class com.hankcs.hanlp.seg.Segment
-
使用用户词典合并粗分结果
- combineByCustomDictionary(List<Vertex>, WordNet) - Method in class com.hankcs.hanlp.seg.Segment
-
使用用户词典合并粗分结果,并将用户词语收集到全词图中
- combineByCustomDictionary(List<Vertex>, DoubleArrayTrie<CoreDictionary.Attribute>, WordNet) - Method in class com.hankcs.hanlp.seg.Segment
-
使用用户词典合并粗分结果,并将用户词语收集到全词图中
- combineNER(String[], NERTagSet) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- combineWhenNotInclude(String[]) - Static method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
合并,只补充除第一个词典外其他词典的新词
- combineWithCustomDictionary(List<String>) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
使用用户词典合并粗分结果
- combineWithNormalization(String[]) - Static method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
对除第一个之外的词典执行标准化,并且合并
- commit(int, float, int, Configuration) - Method in interface com.hankcs.hanlp.dependency.perceptron.transition.parser.IAction
-
- commitAction(int, int, float, ArrayList<Integer>, Configuration) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
- CommonAhoCorasickDoubleArrayTrieSegment<V> - Class in com.hankcs.hanlp.seg.Other
-
对CommonAhoCorasickSegmentUtil中最长分词器的包装
- CommonAhoCorasickDoubleArrayTrieSegment(AhoCorasickDoubleArrayTrie<V>) - Constructor for class com.hankcs.hanlp.seg.Other.CommonAhoCorasickDoubleArrayTrieSegment
-
- CommonAhoCorasickDoubleArrayTrieSegment(TreeMap<String, V>) - Constructor for class com.hankcs.hanlp.seg.Other.CommonAhoCorasickDoubleArrayTrieSegment
-
- CommonAhoCorasickSegmentUtil - Class in com.hankcs.hanlp.seg.Other
-
一个通用的使用AhoCorasickDoubleArrayTrie实现的最长分词器
- CommonAhoCorasickSegmentUtil() - Constructor for class com.hankcs.hanlp.seg.Other.CommonAhoCorasickSegmentUtil
-
- CommonDictionary<V> - Class in com.hankcs.hanlp.dictionary.common
-
通用的词典,对应固定格式的词典,但是标签可以泛型化
- CommonDictionary() - Constructor for class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
- CommonDictionaryMaker - Class in com.hankcs.hanlp.corpus.dictionary
-
- CommonDictionaryMaker(EasyDictionary) - Constructor for class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
- commonPrefixSearch(String, int, int) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
前缀查询
- commonPrefixSearch(String) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- commonPrefixSearch(byte[], int, int) - Method in class com.hankcs.hanlp.collection.dartsclone.DoubleArray
-
Returns the keys that begins with the given key and its corresponding values.
- commonPrefixSearch(String, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
查询与query的前缀重合的所有词语
- commonPrefixSearch(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- commonPrefixSearch(String, int, int, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
前缀查询
- commonPrefixSearch(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
前缀查询
- commonPrefixSearch(char[], int) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
前缀查询
- commonPrefixSearch(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
前缀查询
- commonPrefixSearch(char[], int) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
前缀查询
- commonPrefixSearch(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- commonPrefixSearch(String, int, int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- commonPrefixSearchWithValue(char[], int) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
前缀查询
- commonPrefixSearchWithValue(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
前缀查询
- commonPrefixSearchWithValue(String) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
前缀查询,包含值
- commonPrefixSearchWithValue(char[], int) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
前缀查询,通过字符数组来表示字符串可以优化运行速度
- commonPrefixSearchWithValue(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- commonPrefixSearchWithValue(char[], int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
优化的前缀查询,可以复用字符数组
- CommonStringDictionary - Class in com.hankcs.hanlp.dictionary.common
-
最简单的词典,每一行只有一个词,没别的
- CommonStringDictionary() - Constructor for class com.hankcs.hanlp.dictionary.common.CommonStringDictionary
-
- CommonSuffixExtractor - Class in com.hankcs.hanlp.corpus.dictionary
-
公共后缀提取工具
- CommonSuffixExtractor() - Constructor for class com.hankcs.hanlp.corpus.dictionary.CommonSuffixExtractor
-
- CommonSynonymDictionary - Class in com.hankcs.hanlp.dictionary.common
-
一个没有指定资源位置的通用同义词词典
- CommonSynonymDictionary.SynonymItem - Class in com.hankcs.hanlp.dictionary.common
-
词典中的一个条目
- CommonSynonymDictionaryEx - Class in com.hankcs.hanlp.dictionary.common
-
一个没有指定资源位置的通用同义词词典
- CommonSynonymDictionaryEx.SynonymItem - Class in com.hankcs.hanlp.dictionary.common
-
词典中的一个条目
- CompactArray - Class in com.hankcs.hanlp.dependency.perceptron.structures
-
一个稀疏数组,实际只有一个连续区间被分配内存
- CompactArray(int, float[]) - Constructor for class com.hankcs.hanlp.dependency.perceptron.structures.CompactArray
-
- CompactTree - Class in com.hankcs.hanlp.dependency.perceptron.transition.configuration
-
- CompactTree(HashMap<Integer, Pair<Integer, String>>, ArrayList<String>) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.CompactTree
-
- comparator() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- compare(Intervalable, Intervalable) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalableComparatorByPosition
-
- compare(Intervalable, Intervalable) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalableComparatorBySize
-
- compare(Long[], Long[]) - Static method in class com.hankcs.hanlp.algorithm.ArrayCompare
-
比较数组A与B的大小关系
- compare(String, String) - Method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
比较标准答案与分词结果
- compareTo(Object) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- compareTo(SString) - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- compareTo(BaseNode) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
- compareTo(char) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
重载,与字符的比较
- compareTo(BaseNode<V>.TrieEntry) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode.TrieEntry
-
- compareTo(TermFrequency) - Method in class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
- compareTo(State) - Method in class com.hankcs.hanlp.dependency.common.State
-
- compareTo(AbstractInexactAction) - Method in class com.hankcs.hanlp.dependency.nnparser.action.AbstractInexactAction
-
- compareTo(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Pair
-
- compareTo(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- compareTo(BeamElement) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.BeamElement
-
- compareTo(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- compareTo(Cluster<K>) - Method in class com.hankcs.hanlp.mining.cluster.Cluster
-
- compareTo(VocabWord) - Method in class com.hankcs.hanlp.mining.word2vec.VocabWord
-
- compareTo(State) - Method in class com.hankcs.hanlp.seg.Dijkstra.Path.State
-
- compareTo(QueueElement) - Method in class com.hankcs.hanlp.seg.NShort.Path.QueueElement
-
- compareTo(CharArray) - Method in class com.hankcs.hanlp.suggest.scorer.editdistance.CharArray
-
- compareTo(IdVector) - Method in class com.hankcs.hanlp.suggest.scorer.lexeme.IdVector
-
- compareTo(PinyinKey) - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinKey
-
- compile(String, String) - Static method in class com.hankcs.hanlp.corpus.dependency.CoNll.PosTagCompiler
-
编译,比如将词性为数词的转为##数##
- compile(IWord) - Static method in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
编译单词
- compile(Word) - Static method in class com.hankcs.hanlp.corpus.util.Precompiler
-
将一个单词编译为等效字串
- compile(IWord) - Static method in class com.hankcs.hanlp.corpus.util.Precompiler
-
- compiledWord - Variable in class com.hankcs.hanlp.dependency.common.Node
-
- compileWithoutNS(List<IWord>) - Static method in class com.hankcs.hanlp.corpus.util.Precompiler
-
在忽略ns的前提下预编译
- compileWithoutNT(List<IWord>) - Static method in class com.hankcs.hanlp.corpus.util.Precompiler
-
在忽略ns的前提下预编译
- CompoundWord - Class in com.hankcs.hanlp.corpus.document.sentence.word
-
复合词,由两个或以上的word构成
- CompoundWord(List<Word>, String) - Constructor for class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- compress(double) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
模型压缩
- compress(double, double) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
- compute(Graph) - Static method in class com.hankcs.hanlp.algorithm.Dijkstra
-
- compute(List<CommonSynonymDictionary.SynonymItem>, List<CommonSynonymDictionary.SynonymItem>) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
- compute(long[], long[]) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
- compute(int[], int[]) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
- compute(String, String) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
编辑距离
- compute(char[], char[]) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
编辑距离
- compute(char[], char[]) - Static method in class com.hankcs.hanlp.algorithm.LongestCommonSubsequence
-
- compute(String, String) - Static method in class com.hankcs.hanlp.algorithm.LongestCommonSubsequence
-
- compute(char[], char[]) - Static method in class com.hankcs.hanlp.algorithm.LongestCommonSubstring
-
- compute(String, String) - Static method in class com.hankcs.hanlp.algorithm.LongestCommonSubstring
-
- compute(int[], int[], double[], double[][], double[][]) - Static method in class com.hankcs.hanlp.algorithm.Viterbi
-
求解HMM模型,所有概率请提前取对数
- compute(List<Vertex>, TransformMatrix) - Static method in class com.hankcs.hanlp.algorithm.Viterbi
-
特化版的求解HMM模型
- compute(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
处理语料,准备词典
- compute() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
输入数据完毕,执行计算
- compute(List<Term>) - Static method in class com.hankcs.hanlp.dependency.MaxEntDependencyParser
-
Deprecated.
分析句子的依存句法
- compute(String) - Static method in class com.hankcs.hanlp.dependency.MaxEntDependencyParser
-
Deprecated.
分析句子的依存句法
- compute(List<Term>) - Static method in class com.hankcs.hanlp.dependency.nnparser.NeuralNetworkDependencyParser
-
分析句子的依存句法
- compute(String) - Static method in class com.hankcs.hanlp.dependency.nnparser.NeuralNetworkDependencyParser
-
分析句子的依存句法
- compute(List<Term>) - Static method in class com.hankcs.hanlp.dependency.WordNatureDependencyParser
-
分析句子的依存句法
- compute(String) - Static method in class com.hankcs.hanlp.dependency.WordNatureDependencyParser
-
分析句子的依存句法
- compute() - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- ComputeAccuracy - Class in com.hankcs.hanlp.mining.word2vec
-
- ComputeAccuracy() - Constructor for class com.hankcs.hanlp.mining.word2vec.ComputeAccuracy
-
- computeAverageDistance(Long[], Long[]) - Static method in class com.hankcs.hanlp.algorithm.ArrayDistance
-
- computeEnum(List<EnumItem<E>>, TransformMatrixDictionary<E>) - Static method in class com.hankcs.hanlp.algorithm.Viterbi
-
标准版的Viterbi算法,查准率高,效率稍低
- computeEnumSimply(List<EnumItem<E>>, TransformMatrixDictionary<E>) - Static method in class com.hankcs.hanlp.algorithm.Viterbi
-
仅仅利用了转移矩阵的“维特比”算法
- computeLeftEntropy(PairFrequency) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
计算左熵
- computeMinimumDistance(TreeSet<Long>, TreeSet<Long>) - Static method in class com.hankcs.hanlp.algorithm.ArrayDistance
-
- computeMinimumDistance(Long[], Long[]) - Static method in class com.hankcs.hanlp.algorithm.ArrayDistance
-
- computeMutualInformation(String, String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- computeMutualInformation(PairFrequency) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- computeRightEntropy(PairFrequency) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
计算右熵
- computeScore(LinkedList<double[]>, int) - Static method in class com.hankcs.hanlp.model.crf.CRFModel
-
给一系列特征函数结合tag打分
- computeScore(String) - Method in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
- computeScore(String) - Method in interface com.hankcs.hanlp.suggest.scorer.IScorer
-
计算分值
- computeScoreList(Table, int) - Method in class com.hankcs.hanlp.model.crf.CRFModel
-
根据特征函数计算输出
- Config() - Constructor for class com.hankcs.hanlp.HanLP.Config
-
- Config - Class in com.hankcs.hanlp.mining.word2vec
-
- Config() - Constructor for class com.hankcs.hanlp.mining.word2vec.Config
-
- config - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- Config - Class in com.hankcs.hanlp.seg
-
分词器配置项
- Config() - Constructor for class com.hankcs.hanlp.seg.Config
-
- config - Variable in class com.hankcs.hanlp.seg.Segment
-
分词器配置
- ConfigOption - Class in com.hankcs.hanlp.dependency.nnparser.option
-
- ConfigOption() - Constructor for class com.hankcs.hanlp.dependency.nnparser.option.ConfigOption
-
- Configuration - Class in com.hankcs.hanlp.dependency.perceptron.transition.configuration
-
由stack、buffer和arc组成,额外记录了导致该状态的动作历史和分数
- Configuration(Sentence, boolean) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- Configuration(Sentence) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- confirmNature(Nature) - Method in class com.hankcs.hanlp.seg.common.Vertex
-
将属性的词性锁定为nature
- confirmNature(Nature, boolean) - Method in class com.hankcs.hanlp.seg.common.Vertex
-
将属性的词性锁定为nature,此重载会降低性能
- CoNLLFixer - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
修正一些非10行的依存语料
- CoNLLFixer() - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLFixer
-
- CoNllLine - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
CoNLL语料中的一行
- CoNllLine(String...) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNllLine
-
- CoNLLLoader - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
CoNLL格式依存语料加载
- CoNLLLoader() - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLLoader
-
- CoNLLReader - Class in com.hankcs.hanlp.dependency.perceptron.accessories
-
- CoNLLReader(String) - Constructor for class com.hankcs.hanlp.dependency.perceptron.accessories.CoNLLReader
-
Initializes the file reader
- CoNLLSentence - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
CoNLL中的一个句子
- CoNLLSentence(List<CoNllLine>) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
构造一个句子
- CoNLLSentence(CoNLLWord[]) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
- CoNLLWord - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
- CoNLLWord(int, String, String) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
- CoNLLWord(int, String, String, String) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
- CoNLLWord(CoNllLine) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
- CoNLLWord(CoNllLine[], int) - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
- connect(int, int, double) - Method in class com.hankcs.hanlp.seg.common.Graph
-
连接两个节点
- ConsoleLogger - Class in com.hankcs.hanlp.classification.utilities.io
-
输出到stdout和stderr的日志系统
- ConsoleLogger() - Constructor for class com.hankcs.hanlp.classification.utilities.io.ConsoleLogger
-
- constructWithCopy(double[][]) - Static method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct a matrix from a copy of a 2-D array.
- contains(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
是否包含
Determines whether a String is present in the MDAG.
- contains(Object) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- contains(String) - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary
-
- contains(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
是否含有键
- contains(String) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
是否包含词语
- contains(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
词典中是否含有词语
- contains(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
词典中是否含有词语
- contains(String) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
- contains(Object) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- contains(String) - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- contains(Object) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- contains(Object) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- contains(String) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
词库是否收录了词语(查询核心词典和用户词典)
- containsAll(Collection<?>) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- containsAll(Collection<?>) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- containsAll(Collection<?>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- containsAll(Collection<?>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- containsKey(Object) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- containsKey(String) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
是否包含key
- containsKey(String) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- containsKey(Object) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- containsKey(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- containsKey(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- containsKey(String) - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- containsKey(String) - Static method in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
是否包含key
- containsKey(String, int) - Static method in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
包含key,且key至少长length
- containsKey(String) - Static method in class com.hankcs.hanlp.dictionary.nr.TranslatedPersonDictionary
-
是否包含key
- containsKey(String, int) - Static method in class com.hankcs.hanlp.dictionary.nr.TranslatedPersonDictionary
-
时报包含key,且key至少长length
- containsLabel(E) - Method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- containsLabel(String) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- containsValue(Object) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- containsValue(Object) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- containsWordWithLabel(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
是否含有词性为label的单词
- Context - Class in com.hankcs.hanlp.dependency.nnparser
-
上下文
- Context() - Constructor for class com.hankcs.hanlp.dependency.nnparser.Context
-
- Context - Class in com.hankcs.hanlp.model.maxent
-
将参数与特征关联起来的类,用来储存最大熵的参数,也用来储存模型和经验分布
- Context(int[], double[]) - Constructor for class com.hankcs.hanlp.model.maxent.Context
-
构建一个新的上下文
- ContinuousDistributions - Class in com.hankcs.hanlp.classification.statistics
-
提供常见连续分布的概率密度函数和累积分布函数计算
- ContinuousDistributions() - Constructor for class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
- convert(String, String) - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- convert(String, String) - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
利用本数据集的词表和类目表将文本形式的文档转换为内部通用的文档
- convert(List<Term>, boolean) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionaryEx
-
将分词结果转换为同义词列表
- CONVERT - Static variable in class com.hankcs.hanlp.dictionary.other.CharTable
-
正规化使用的对应表
- convert(char) - Static method in class com.hankcs.hanlp.dictionary.other.CharTable
-
将一个字符正规化
- convert(char[]) - Static method in class com.hankcs.hanlp.dictionary.other.CharTable
-
- convert(String) - Static method in class com.hankcs.hanlp.dictionary.other.CharTable
-
- convert(String, char[]) - Static method in class com.hankcs.hanlp.dictionary.other.CharTable
-
- convert(String) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
文本转拼音
- convert(String, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
文本转拼音
- convert(String[]) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- convert(String) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- convert(String, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- convert(String) - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
- convert - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- convert(String, String) - Method in class com.hankcs.hanlp.model.crf.crfpp.DecoderFeatureIndex
-
- convert(String, String) - Method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- convert(String, int) - Static method in class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
- convert(List<Vertex>, boolean) - Static method in class com.hankcs.hanlp.seg.Segment
-
将一条路径转为最终结果
- convert(List<Vertex>) - Static method in class com.hankcs.hanlp.seg.WordBasedSegment
-
将一条路径转为最终结果
- convert2Array(String, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将拼音文本转化为完整的拼音,支持汉英混合的杂乱文本,注意如果混用拼音和输入法头的话,并不会有多高的准确率,声调也不会准的
- convert2CompatibleList(List<List<Word>>) - Static method in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
将word列表转为兼容的IWord列表
- convert2Document(File) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- convert2DocumentList(String) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- convert2DocumentList(String, boolean) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
读取整个目录中的人民日报格式语料
- convert2Pair(String, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将混合文本转为拼音
- convert2SentenceList(String) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- convert2SimpleSentenceList(String) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- convert2Tone5(Pinyin) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将拼音的音调统统转为5调或者最大的音调
- convert_to_text - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- convertCorpus(Sentence, BufferedWriter) - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- convertCorpus(Sentence, BufferedWriter) - Method in class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- convertCorpus(Sentence, BufferedWriter) - Method in class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- convertCorpus(Sentence, BufferedWriter) - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
- convertCorpus(String, String) - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
- convertFromToneNumber(String) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- convertFromToneNumber(String[]) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- convertId2String(long) - Static method in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- convertId2StringWithIndex(long) - Static method in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- convertIntToTwoChar(int) - Static method in class com.hankcs.hanlp.utility.ByteUtil
-
- convertList2Array(List<Pinyin>) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinUtil
-
将列表转为数组
- convertPinyinList2TonePinyinList(List<Pinyin>) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinUtil
-
转换List
pinyinList到List,其中的String为带声调符号形式
- convertPKUtoCWS(String, String, int, int) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
将人民日报格式的分词语料转化为空格分割的语料
- convertPKUtoPOS(String, String, int, int) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
将人民日报格式的分词语料转化为空格分割的语料
- convertSentenceToNER(Sentence, NERTagSet) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
将句子转换为 (单词,词性,NER标签)三元组
- convertSingle(String) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将单个音节转为拼音
- convertString2Id(String) - Static method in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- convertString2IdWithIndex(String, long) - Static method in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- convertString2IdWithIndex(String, int) - Static method in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- convertStringToNature(String, LinkedHashSet<Nature>) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
将字符串词性转为Enum词性
- convertStringToNature(String) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
将字符串词性转为Enum词性
- convertToHongKongTraditionalChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalToHongKongChineseDictionary
-
- convertToHongKongTraditionalChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalToHongKongChineseDictionary
-
- convertToneNumber2ToneMark(String) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinUtil
-
Convert tone numbers to tone marks using Unicode
Algorithm for determining location of tone mark
A simple algorithm for determining the vowel on which the tone mark
appears is as follows:
First, look for an "a" or an "e".
- convertToPinyin(String) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
转为拼音
- convertToPinyin(String, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
- convertToPinyinArray(String) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
转为拼音
- convertToPinyinFirstCharString(String, String, boolean) - Static method in class com.hankcs.hanlp.HanLP
-
转化为拼音(首字母)
- convertToPinyinList(String) - Static method in class com.hankcs.hanlp.HanLP
-
转化为拼音
- convertToPinyinString(String, String, boolean) - Static method in class com.hankcs.hanlp.HanLP
-
转化为拼音
- convertToSequence(Sentence) - Method in class com.hankcs.hanlp.model.hmm.HMMNERecognizer
-
- convertToSequence(Sentence) - Method in class com.hankcs.hanlp.model.hmm.HMMPOSTagger
-
- convertToSequence(Sentence) - Method in class com.hankcs.hanlp.model.hmm.HMMSegmenter
-
- convertToSequence(Sentence) - Method in class com.hankcs.hanlp.model.hmm.HMMTrainer
-
- convertToSimplifiedChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToSimplifiedChineseDictionary
-
- convertToSimplifiedChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToSimplifiedChineseDictionary
-
- convertToSimplifiedChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToSimplifiedChineseDictionary
-
- convertToSimplifiedChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToSimplifiedChineseDictionary
-
- convertToSimplifiedChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalChineseDictionary
-
- convertToSimplifiedChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalChineseDictionary
-
- convertToSimplifiedChinese(String) - Static method in class com.hankcs.hanlp.HanLP
-
繁转简
- convertToTaiwanChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalToTaiwanChineseDictionary
-
- convertToTaiwanChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TraditionalToTaiwanChineseDictionary
-
- convertToTraditionalChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToTraditionalChineseDictionary
-
- convertToTraditionalChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToTraditionalChineseDictionary
-
- convertToTraditionalChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedChineseDictionary
-
- convertToTraditionalChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedChineseDictionary
-
- convertToTraditionalChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToTraditionalChineseDictionary
-
- convertToTraditionalChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToTraditionalChineseDictionary
-
- convertToTraditionalChinese(String) - Static method in class com.hankcs.hanlp.HanLP
-
简转繁
- convertToTraditionalHongKongChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedToHongKongChineseDictionary
-
- convertToTraditionalHongKongChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedToHongKongChineseDictionary
-
- convertToTraditionalHongKongChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToHongKongChineseDictionary
-
- convertToTraditionalHongKongChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.TaiwanToHongKongChineseDictionary
-
- convertToTraditionalTaiwanChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToTaiwanChineseDictionary
-
- convertToTraditionalTaiwanChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.HongKongToTaiwanChineseDictionary
-
- convertToTraditionalTaiwanChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedToTaiwanChineseDictionary
-
- convertToTraditionalTaiwanChinese(char[]) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedToTaiwanChineseDictionary
-
- convertTwoCharToInt(char, char) - Static method in class com.hankcs.hanlp.utility.ByteUtil
-
- copy() - Method in class com.hankcs.hanlp.collection.dartsclone.details.DoubleArrayBuilder
-
- copy() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Make a deep copy of a matrix
- copy() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
复制自己
- CoreBiGramTableDictionary - Class in com.hankcs.hanlp.dictionary
-
核心词典的二元接续词典,采用整型储存,高性能
- CoreBiGramTableDictionary() - Constructor for class com.hankcs.hanlp.dictionary.CoreBiGramTableDictionary
-
- CoreDictionary - Class in com.hankcs.hanlp.dictionary
-
使用DoubleArrayTrie实现的核心词典
- CoreDictionary() - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionary
-
- CoreDictionary.Attribute - Class in com.hankcs.hanlp.dictionary
-
核心词典中的词属性
- CoreDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
核心词典路径
- CoreDictionaryTransformMatrixDictionary - Class in com.hankcs.hanlp.dictionary
-
核心词典词性转移矩阵
- CoreDictionaryTransformMatrixDictionary() - Constructor for class com.hankcs.hanlp.dictionary.CoreDictionaryTransformMatrixDictionary
-
- CoreDictionaryTransformMatrixDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
核心词典词性转移矩阵路径
- CoreStopWordDictionary - Class in com.hankcs.hanlp.dictionary.stopword
-
核心停用词词典
- CoreStopWordDictionary() - Constructor for class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
- CoreStopWordDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
停用词词典路径
- CoreSynonymDictionary - Class in com.hankcs.hanlp.dictionary
-
核心同义词词典
- CoreSynonymDictionary() - Constructor for class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
- CoreSynonymDictionaryDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
同义词词典路径
- CoreSynonymDictionaryEx - Class in com.hankcs.hanlp.dictionary
-
核心同义词词典(使用语义id作为value)
- CoreSynonymDictionaryEx() - Constructor for class com.hankcs.hanlp.dictionary.CoreSynonymDictionaryEx
-
- Corpus - Class in com.hankcs.hanlp.mining.word2vec
-
- Corpus(Config) - Constructor for class com.hankcs.hanlp.mining.word2vec.Corpus
-
- Corpus(Corpus) - Constructor for class com.hankcs.hanlp.mining.word2vec.Corpus
-
- corpusLoaded(int, int, int) - Method in interface com.hankcs.hanlp.mining.word2vec.TrainingCallback
-
语料加载完毕
- CorpusLoader - Class in com.hankcs.hanlp.corpus.document
-
- CorpusLoader() - Constructor for class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- CorpusLoader.Handler - Interface in com.hankcs.hanlp.corpus.document
-
- CorpusLoader.HandlerThread - Class in com.hankcs.hanlp.corpus.document
-
多线程任务
- corpusLoading(float) - Method in interface com.hankcs.hanlp.mining.word2vec.TrainingCallback
-
语料加载中
- CorpusUtil - Class in com.hankcs.hanlp.corpus.util
-
- CorpusUtil() - Constructor for class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- cosine(Vector) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
夹角的余弦
- cosineForUnitVector(Vector) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
夹角的余弦
认为this和other都是单位向量,所以方法内部没有除以两者的模。
- cost - Variable in class com.hankcs.hanlp.dependency.common.Edge
-
- cost - Variable in class com.hankcs.hanlp.dependency.common.State
-
- cost - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- cost - Variable in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- cost - Variable in class com.hankcs.hanlp.model.crf.crfpp.Path
-
- cost - Variable in class com.hankcs.hanlp.seg.Dijkstra.Path.State
-
路径花费
- costFactor() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- costFactor_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- count(String, String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
统计 keyword 在 srcText 中的出现次数
- CPOSTAG - Variable in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
当前词语的词性(粗粒度)
- CQueue - Class in com.hankcs.hanlp.seg.NShort.Path
-
一个维护了上次访问位置的优先级队列(最小堆)
- CQueue() - Constructor for class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- create(String[]) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.Item
-
- create(String[]) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.Item
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- create(String[]) - Static method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.document.Document
-
- create(File) - Static method in class com.hankcs.hanlp.corpus.document.Document
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
以人民日报2014语料格式的字符串创建一个结构化句子
- create(String) - Static method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
通过参数构造一个单词
- create(String) - Static method in class com.hankcs.hanlp.corpus.document.sentence.word.WordFactory
-
根据参数字符串产生对应的词语
- create(String) - Method in class com.hankcs.hanlp.corpus.io.FileIOAdapter
-
- create(String) - Method in interface com.hankcs.hanlp.corpus.io.IIOAdapter
-
创建一个新文件以供输出
- create(String) - Method in class com.hankcs.hanlp.corpus.io.ResourceIOAdapter
-
- create(String, char, String) - Static method in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
构造一个pf
- create(String, char, String, String) - Static method in class com.hankcs.hanlp.corpus.occurrence.TriaFrequency
-
构造一个三阶接续,正向
- create(String, String, char, String) - Static method in class com.hankcs.hanlp.corpus.occurrence.TriaFrequency
-
构造一个三阶接续,逆向
- create(String) - Static method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
通过类似 Bh06A32= 番茄 西红柿 的字符串构造一系列同义词
- create(String[]) - Static method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- create(String) - Static method in class com.hankcs.hanlp.corpus.tag.Nature
-
创建自定义词性,如果已有该对应词性,则直接返回已有的词性
- create(int, E) - Static method in class com.hankcs.hanlp.dependency.nnparser.util.std
-
- create(InputStream) - Static method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary
-
- create(InputStream) - Static method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionaryEx
-
- create(String) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- create(ByteArray, Nature[]) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
从字节流中加载
- create(String) - Static method in class com.hankcs.hanlp.model.crf.FeatureTemplate
-
- create(String) - Static method in class com.hankcs.hanlp.model.maxent.MaxEntModel
-
从文件加载,同时缓存为二进制文件
- create(ByteArray) - Static method in class com.hankcs.hanlp.model.maxent.MaxEntModel
-
从字节流快速加载
- create(Sentence, FeatureMap) - Static method in class com.hankcs.hanlp.model.perceptron.instance.CWSInstance
-
- create(String, FeatureMap) - Static method in class com.hankcs.hanlp.model.perceptron.instance.POSInstance
-
- create(Sentence, FeatureMap) - Static method in class com.hankcs.hanlp.model.perceptron.instance.POSInstance
-
- createByteArray(String) - Static method in class com.hankcs.hanlp.corpus.io.ByteArray
-
从文件读取一个字节数组
- createByteArrayFileStream(String) - Static method in class com.hankcs.hanlp.corpus.io.ByteArrayFileStream
-
- createByteArrayFileStream(FileInputStream) - Static method in class com.hankcs.hanlp.corpus.io.ByteArrayFileStream
-
- createByteArrayOtherStream(String) - Static method in class com.hankcs.hanlp.corpus.io.ByteArrayOtherStream
-
- createByteArrayOtherStream(InputStream) - Static method in class com.hankcs.hanlp.corpus.io.ByteArrayOtherStream
-
- createByteArrayStream(String) - Static method in class com.hankcs.hanlp.corpus.io.ByteArrayStream
-
- createIndices(String, boolean, boolean, String) - Static method in class com.hankcs.hanlp.dependency.perceptron.accessories.CoNLLReader
-
读取CoNLL文件,创建索引
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.CWSTrainer
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.InstanceConsumer
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.NERTrainer
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronNERecognizer
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronPOSTagger
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- createInstance(Sentence, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.POSTrainer
-
- createSynonymList(List<Term>, boolean) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
将分词结果转换为同义词列表
- createTagger() - Method in class com.hankcs.hanlp.model.crf.crfpp.Model
-
- createTagger() - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- createTagSet() - Method in class com.hankcs.hanlp.model.perceptron.CWSTrainer
-
- createTagSet() - Method in class com.hankcs.hanlp.model.perceptron.NERTrainer
-
重载此方法以支持任意自定义NER类型,例如:
NERTagSet tagSet = new NERTagSet();
tagSet.nerLabels.add("nr");
tagSet.nerLabels.add("ns");
tagSet.nerLabels.add("nt");
return tagSet;
- createTagSet() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer
-
创建标注集
- createTagSet() - Method in class com.hankcs.hanlp.model.perceptron.POSTrainer
-
- createUndefined(String) - Static method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
创建一个@类型的词典之外的条目
- createValue(String[]) - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
从一行词典条目创建值
- createValue(String[]) - Method in class com.hankcs.hanlp.dictionary.common.EnumItemDictionary
-
- createValue(Class<?>, String) - Method in interface com.hankcs.hanlp.model.perceptron.cli.Args.ValueCreator
-
Creates a value object of the given type using the given string value representation;
- crf_learn - Class in com.hankcs.hanlp.model.crf.crfpp
-
对应crf_learn
- crf_learn() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.crf_learn
-
- crf_learn.Option - Class in com.hankcs.hanlp.model.crf.crfpp
-
- crf_test - Class in com.hankcs.hanlp.model.crf.crfpp
-
对应crf_test
- crf_test() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.crf_test
-
- CRFCWSModelPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
CRF分词模型
- CRFEncoderThread - Class in com.hankcs.hanlp.model.crf.crfpp
-
- CRFEncoderThread(int) - Constructor for class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- CRFLexicalAnalyzer - Class in com.hankcs.hanlp.model.crf
-
CRF词法分析器(中文分词、词性标注和命名实体识别)
- CRFLexicalAnalyzer(CRFSegmenter) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer(CRFSegmenter, CRFPOSTagger) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer(CRFSegmenter, CRFPOSTagger, CRFNERecognizer) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer(String) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer(String, String) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer(String, String, String) - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
构造CRF词法分析器
- CRFLexicalAnalyzer() - Constructor for class com.hankcs.hanlp.model.crf.CRFLexicalAnalyzer
-
加载配置文件指定的模型
- CRFModel - Class in com.hankcs.hanlp.model.crf
-
- CRFModel() - Constructor for class com.hankcs.hanlp.model.crf.CRFModel
-
- CRFModel(ITrie<FeatureFunction>) - Constructor for class com.hankcs.hanlp.model.crf.CRFModel
-
以指定的trie树结构储存内部特征函数
- CRFNERecognizer - Class in com.hankcs.hanlp.model.crf
-
- CRFNERecognizer() - Constructor for class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- CRFNERecognizer(String) - Constructor for class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- CRFNERecognizer(String, String[]) - Constructor for class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- CRFNERModelPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
CRF命名实体识别模型
- CRFPOSModelPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
CRF词性标注模型
- CRFPOSTagger - Class in com.hankcs.hanlp.model.crf
-
- CRFPOSTagger() - Constructor for class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- CRFPOSTagger(String) - Constructor for class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- CRFSegment - Class in com.hankcs.hanlp.seg.CRF
-
- CRFSegment(CRFSegmentModel) - Constructor for class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- CRFSegment(String) - Constructor for class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- CRFSegment() - Constructor for class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- CRFSegmenter - Class in com.hankcs.hanlp.model.crf
-
- CRFSegmenter() - Constructor for class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- CRFSegmenter(String) - Constructor for class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- CRFSegmentModel - Class in com.hankcs.hanlp.model
-
静态CRF分词模型
- CRFSegmentModel(ITrie<FeatureFunction>) - Constructor for class com.hankcs.hanlp.model.CRFSegmentModel
-
以指定的trie树结构储存内部特征函数
- CRFSegmentModelPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
- CRFTagger - Class in com.hankcs.hanlp.model.crf
-
- CRFTagger() - Constructor for class com.hankcs.hanlp.model.crf.CRFTagger
-
- CRFTagger(String) - Constructor for class com.hankcs.hanlp.model.crf.CRFTagger
-
- CT_CHINESE - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
中文字符
- CT_CNUM - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
中文数字
- CT_DELIMITER - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
分隔符"!,.?()[]{}+=
- CT_INDEX - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
序号
- CT_LETTER - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
字母
- CT_NUM - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
数字
- CT_OTHER - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
其他
- CT_SINGLE - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
单字节
- cube() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
返回矩阵的立方(以数组形式)
- CustomDictionary - Class in com.hankcs.hanlp.dictionary
-
用户自定义词典
注意自定义词典的动态增删改不是线程安全的。
- CustomDictionary() - Constructor for class com.hankcs.hanlp.dictionary.CustomDictionary
-
- customDictionary - Variable in class com.hankcs.hanlp.seg.Segment
-
本分词器专用的词典,默认公用 CustomDictionary.DEFAULT
- CustomDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
用户自定义词典路径
- CWSEvaluator - Class in com.hankcs.hanlp.seg.common
-
中文分词评测工具
- CWSEvaluator() - Constructor for class com.hankcs.hanlp.seg.common.CWSEvaluator
-
- CWSEvaluator(Set<String>) - Constructor for class com.hankcs.hanlp.seg.common.CWSEvaluator
-
- CWSEvaluator(String) - Constructor for class com.hankcs.hanlp.seg.common.CWSEvaluator
-
- CWSEvaluator.Result - Class in com.hankcs.hanlp.seg.common
-
- CWSInstance - Class in com.hankcs.hanlp.model.perceptron.instance
-
- CWSInstance(String[], FeatureMap) - Constructor for class com.hankcs.hanlp.model.perceptron.instance.CWSInstance
-
生成分词实例
- CWSInstance(String, FeatureMap) - Constructor for class com.hankcs.hanlp.model.perceptron.instance.CWSInstance
-
- CWSTagSet - Class in com.hankcs.hanlp.model.perceptron.tagset
-
- CWSTagSet(int, int, int, int) - Constructor for class com.hankcs.hanlp.model.perceptron.tagset.CWSTagSet
-
- CWSTagSet() - Constructor for class com.hankcs.hanlp.model.perceptron.tagset.CWSTagSet
-
- CWSTrainer - Class in com.hankcs.hanlp.model.perceptron
-
感知机分词器训练工具
- CWSTrainer() - Constructor for class com.hankcs.hanlp.model.perceptron.CWSTrainer
-
- e - Variable in class com.hankcs.hanlp.collection.sequence.SString
-
结束位置,不包含
- e(CoNLLSentence, CoNLLSentence) - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- e - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
叹词
- E - Variable in class com.hankcs.hanlp.model.perceptron.tagset.CWSTagSet
-
- E_TAG_PREFIX - Variable in class com.hankcs.hanlp.model.perceptron.tagset.NERTagSet
-
- EasyDictionary - Class in com.hankcs.hanlp.corpus.dictionary
-
一个通用的、满足特定格式的双数组词典
- EasyDictionary() - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary
-
- EasyDictionary.Attribute - Class in com.hankcs.hanlp.corpus.dictionary
-
通用词典中的词属性
- EasyDictionary.Searcher - Class in com.hankcs.hanlp.corpus.dictionary
-
- EasyReader - Class in com.hankcs.hanlp.corpus.io
-
文本读取工具
- EasyReader(String) - Constructor for class com.hankcs.hanlp.corpus.io.EasyReader
-
构造
- EasyReader(String, boolean) - Constructor for class com.hankcs.hanlp.corpus.io.EasyReader
-
构造
- ed(String, String) - Static method in class com.hankcs.hanlp.algorithm.EditDistance
-
编辑距离
- Edge - Class in com.hankcs.hanlp.dependency.common
-
一条边
- Edge(int, int, String, float) - Constructor for class com.hankcs.hanlp.dependency.common.Edge
-
- edge - Variable in class com.hankcs.hanlp.dependency.common.State
-
- Edge - Class in com.hankcs.hanlp.dependency.perceptron.accessories
-
依存句法树上的一条边
- Edge(int, int) - Constructor for class com.hankcs.hanlp.dependency.perceptron.accessories.Edge
-
- Edge - Class in com.hankcs.hanlp.seg.common
-
基础边,不允许构造
- Edge(double, String) - Constructor for class com.hankcs.hanlp.seg.common.Edge
-
- EdgeFrom - Class in com.hankcs.hanlp.seg.common
-
记录了起点的边
- EdgeFrom(int, double, String) - Constructor for class com.hankcs.hanlp.seg.common.EdgeFrom
-
- edgesTo - Variable in class com.hankcs.hanlp.seg.common.Graph
-
边,到达下标i
- EditDistance - Class in com.hankcs.hanlp.algorithm
-
基于语义距离的编辑距离实现
- EditDistance() - Constructor for class com.hankcs.hanlp.algorithm.EditDistance
-
- EditDistanceScorer - Class in com.hankcs.hanlp.suggest.scorer.editdistance
-
编辑距离打分器
- EditDistanceScorer() - Constructor for class com.hankcs.hanlp.suggest.scorer.editdistance.EditDistanceScorer
-
- effectiveLaSize() - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- effectiveRaSize() - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- embedding_size - Variable in class com.hankcs.hanlp.dependency.nnparser.option.LearnOption
-
- emission_probability - Variable in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
观测概率矩阵
- emissionCost(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- emissionCost(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- emissionVector(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- emissionVector(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- Emit - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
一个模式串匹配结果
- Emit(int, int, String) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.Emit
-
构造一个模式串匹配结果
- emit() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
获取这个节点代表的模式串(们)
- emit() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
获取这个节点代表的模式串(们)
- empty() - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- empty() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- emptyFlag - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
This is the additional information for the case of parsing with tree constraint
For more information see:
Joakim Nivre and Daniel Fernández-González.
- enableAllNamedEntityRecognize(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
- enableAllNamedEntityRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用所有的命名实体识别
- enableCustomDictionary(boolean) - Method in class com.hankcs.hanlp.seg.DictionaryBasedSegment
-
- enableCustomDictionary(boolean) - Method in class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- enableCustomDictionary(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用用户词典
- enableCustomDictionary(DynamicCustomDictionary) - Method in class com.hankcs.hanlp.seg.Segment
-
启用新的用户词典
- enableCustomDictionaryForcing(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否尽可能强制使用用户词典(使用户词典的优先级尽可能高)
警告:具体实现由各子类决定,可能会破坏分词器的统计特性(例如,如果用户词典
含有“和服”,则“商品和服务”的分词结果可能会被用户词典的高优先级影响)。
- enableDebug() - Static method in class com.hankcs.hanlp.HanLP.Config
-
开启调试模式(会降低性能)
- enableDebug(boolean) - Static method in class com.hankcs.hanlp.HanLP.Config
-
开启调试模式(会降低性能)
- enableDeprelTranslator(boolean) - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- enableDeprelTranslator(boolean) - Method in interface com.hankcs.hanlp.dependency.IDependencyParser
-
依存关系自动转换开关
- enableFilter(boolean) - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
激活/关闭停用词过滤器
- enableIndexMode(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
设为索引模式
- enableIndexMode(int) - Method in class com.hankcs.hanlp.seg.Segment
-
索引模式下的最小切分颗粒度(设为1可以最小切分为单字)
- enableJapaneseNameRecognize(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
是否启用日本人名识别
- enableJapaneseNameRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用日本人名识别
- enableMultithreading(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
开启多线程
- enableMultithreading(int) - Method in class com.hankcs.hanlp.seg.Segment
-
开启多线程
- enableNameRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
开启人名识别
- enableNumberQuantifierRecognize(boolean) - Method in class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- enableNumberQuantifierRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用数词和数量词识别
即[二, 十, 一] => [二十一],[十, 九, 元] => [十九元]
- enableOffset(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
是否启用偏移量计算(开启后Term.offset才会被计算)
- enableOffset(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用偏移量计算(开启后Term.offset才会被计算)
- enableOrganizationRecognize(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
开启机构名识别
- enableOrganizationRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
开启机构名识别
- enablePartOfSpeechTagging(boolean) - Method in class com.hankcs.hanlp.seg.DictionaryBasedSegment
-
开启数词和英文识别(与标准意义上的词性标注不同,只是借用这个配置方法,不是真的开启了词性标注。
一般用词典分词的用户不太可能是NLP专业人士,对词性准确率要求不高,所以干脆不为词典分词实现词性标注。)
- enablePartOfSpeechTagging(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
开启词性标注
- enablePartOfSpeechTagging(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
开启词性标注
- enablePlaceRecognize(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
开启地名识别
- enablePlaceRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
开启地名识别
- enableProbability(boolean) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- enableProbability(boolean) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
是否归一化分值为概率
- enableRuleBasedSegment - Variable in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
是否执行规则分词(英文数字标点等的规则预处理)。规则永远是丑陋的,默认关闭。
- enableRuleBasedSegment(boolean) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
是否执行规则分词(英文数字标点等的规则预处理)。规则永远是丑陋的,默认关闭。
- enableTranslatedNameRecognize(boolean) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
是否启用音译人名识别
- enableTranslatedNameRecognize(boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
是否启用音译人名识别
- Encoder - Class in com.hankcs.hanlp.model.crf.crfpp
-
训练入口
- Encoder() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.Encoder
-
- Encoder.Algorithm - Enum in com.hankcs.hanlp.model.crf.crfpp
-
- EncoderFeatureIndex - Class in com.hankcs.hanlp.model.crf.crfpp
-
- EncoderFeatureIndex(int) - Constructor for class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- encoding - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- ENCODING - Variable in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- end - Variable in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie.Hit
-
模式串在母文本中的终止位置
- end - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
仅用于终##终,不会出现在分词结果中
- endOfCorpus() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- endsWith(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
词语是否以该词典中的某个单词结尾
- enQueue(QueueElement) - Method in class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
将QueueElement根据eWeight由小到大的顺序插入队列
- ensureAvailableBytes(int) - Method in class com.hankcs.hanlp.corpus.io.ByteArrayFileStream
-
确保buffer数组余有size个字节
- ensureAvailableBytes(int) - Method in class com.hankcs.hanlp.corpus.io.ByteArrayOtherStream
-
- ensureAvailableBytes(int) - Method in class com.hankcs.hanlp.corpus.io.ByteArrayStream
-
- entropy - Variable in class com.hankcs.hanlp.mining.word.WordInfo
-
信息熵
- entry - Variable in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
条目的key
- entrySet() - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- entrySet() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
- entrySet() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
获取键值对集合
- entrySet() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- entrySet() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- entrySet() - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
浏览所有词条
- entrySet() - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
获取键值对集合
- entrySet() - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
键值对
- entrySet() - Method in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
- entrySet() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureDatMap
-
- entrySet() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMap
-
- entrySet() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- entrySet() - Method in class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- ENUM_CREATOR - Static variable in class com.hankcs.hanlp.model.perceptron.cli.Args
-
- EnumItem<E extends Enum<E>> - Class in com.hankcs.hanlp.corpus.dictionary.item
-
对标签-频次的封装
- EnumItem() - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- EnumItem(E, Integer) - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
创建只有一个标签的条目
- EnumItem(E...) - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
创建一个条目,其标签频次都是1,各标签由参数指定
- EnumItemDictionary<E extends Enum<E>> - Class in com.hankcs.hanlp.dictionary.common
-
枚举条目的通用词典(类似C++的模板,然而Java并没有模板机制,所以有几个方法需要子类啰嗦一下)
- EnumItemDictionary() - Constructor for class com.hankcs.hanlp.dictionary.common.EnumItemDictionary
-
- eoc - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- EOS - Static variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- eps - Static variable in class com.hankcs.hanlp.model.crf.crfpp.Mcsrch
-
- equals(Object) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- equals(Object) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
两个状态是否等价,只有状态转移函数完全一致才算相等
Evaluates the equality of this node with another object.
- equals(Object) - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- equals(Object) - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
- equals(Object) - Method in class com.hankcs.hanlp.dependency.nnparser.action.AbstractInexactAction
-
- equals(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Pair
-
- equals(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- equals(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.BeamElement
-
- equals(Object) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- equals(Object) - Method in class com.hankcs.hanlp.mining.cluster.Document
-
- equals(Object) - Method in class com.hankcs.hanlp.model.crf.crfpp.Pair
-
Test this Pair for equality with another
Object.
- equals(Object) - Method in class com.hankcs.hanlp.seg.common.Term
-
判断Term是否相等
- equals(Object) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- equals(Object) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- equivalenceClassMDAGNodeHashMap - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
等价类集合,相当于论文中的register
- err(String, Object...) - Method in class com.hankcs.hanlp.classification.utilities.io.ConsoleLogger
-
- err(String, Object...) - Method in interface com.hankcs.hanlp.classification.utilities.io.ILogger
-
- err - Variable in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- ERROR_LOG(String, Object...) - Static method in class com.hankcs.hanlp.dependency.nnparser.util.Log
-
- estimateEmissionProbability(Collection<int[][]>, int, int) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
估计状态发射概率
- estimateStartProbability(Collection<int[][]>, int) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
估计初始状态概率向量
- estimateTransitionProbability(Collection<int[][]>, int) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
利用极大似然估计转移概率
- estimateTransitionProbability(Collection<int[][]>, int) - Method in class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- eta - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- eval() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- eval(String[]) - Method in class com.hankcs.hanlp.model.maxent.MaxEntModel
-
预测分布
- eval(String[], double[]) - Method in class com.hankcs.hanlp.model.maxent.MaxEntModel
-
预测分布
- eval(int[], double[], EvalParameters) - Static method in class com.hankcs.hanlp.model.maxent.MaxEntModel
-
预测
- EvalParameters - Class in com.hankcs.hanlp.model.maxent
-
封装了模型用来计算概率的一些参数
- EvalParameters(Context[], double, double, int) - Constructor for class com.hankcs.hanlp.model.maxent.EvalParameters
-
创建一个参数,可被用于预测
- EvalParameters(Context[], int) - Constructor for class com.hankcs.hanlp.model.maxent.EvalParameters
-
- evaluate(IClassifier, IDataSet) - Static method in class com.hankcs.hanlp.classification.statistics.evaluations.Evaluator
-
- evaluate(IClassifier, Map<String, String[]>) - Static method in class com.hankcs.hanlp.classification.statistics.evaluations.Evaluator
-
- evaluate(POSTagger, String) - Static method in class com.hankcs.hanlp.dependency.nnparser.util.PosTagUtil
-
评估词性标注器的准确率
- evaluate(String, String, HashSet<String>) - Static method in class com.hankcs.hanlp.dependency.perceptron.accessories.Evaluator
-
- evaluate - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- evaluate(String) - Method in class com.hankcs.hanlp.dependency.perceptron.parser.KBeamArcEagerDependencyParser
-
标准化评测
- evaluate(String, String) - Static method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
训练模型
- evaluate(String, LinearModel) - Method in class com.hankcs.hanlp.model.perceptron.CWSTrainer
-
- evaluate(String, String) - Method in class com.hankcs.hanlp.model.perceptron.InstanceConsumer
-
- evaluate(String, LinearModel) - Method in class com.hankcs.hanlp.model.perceptron.InstanceConsumer
-
- evaluate(String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
评估
- evaluate(PerceptronClassifier.Instance[]) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
评估
- evaluate(String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- evaluate(String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTagger
-
性能测试
- evaluate(Instance[], LinearModel) - Static method in class com.hankcs.hanlp.model.perceptron.utility.IOUtility
-
- evaluate(Instance, LinearModel, int[]) - Static method in class com.hankcs.hanlp.model.perceptron.utility.IOUtility
-
- evaluate(String, String) - Static method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
在标准答案与分词结果上执行评测
- evaluate(Segment, String, String, String) - Static method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
标准化评测分词器
- evaluate(Segment, String, String, String, String) - Static method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
标准化评测分词器
- evaluate(String, String, String) - Static method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
在标准答案与分词结果上执行评测
- evaluateCWS(String, PerceptronSegmenter) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- evaluateNER(NERecognizer, String) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- Evaluator - Class in com.hankcs.hanlp.classification.statistics.evaluations
-
分类器性能评测
- Evaluator - Class in com.hankcs.hanlp.corpus.dependency.CoNll
-
测试工具
- Evaluator() - Constructor for class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- Evaluator - Class in com.hankcs.hanlp.dependency.perceptron.accessories
-
- Evaluator() - Constructor for class com.hankcs.hanlp.dependency.perceptron.accessories.Evaluator
-
- exactMatchSearch(String) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
精确匹配
- exactMatchSearch(String) - Method in class com.hankcs.hanlp.collection.dartsclone.DoubleArray
-
Returns the corresponding value if the key is found.
- exactMatchSearch(byte[]) - Method in class com.hankcs.hanlp.collection.dartsclone.DoubleArray
-
Returns the corresponding value if the key is found.
- exactMatchSearch(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
精确匹配
- exactMatchSearch(String, int, int, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- exactMatchSearch(char[], int, int, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
精确查询
- exactMatchSearch(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- exactMatchSearch(String, int, int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- exceptionToString(Exception) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
将异常转为字符串
- execute() - Method in class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors
-
- exists(String) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- expected - Variable in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- extend(int) - Method in class com.hankcs.hanlp.dictionary.TransformMatrix
-
拓展内部矩阵,仅用于通过反射新增了枚举实例之后的兼容措施
- extract(String, int) - Static method in class com.hankcs.hanlp.mining.phrase.MutualInformationEntropyPhraseExtractor
-
一句话提取
- extractAllParseFeatures(Configuration, int) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.features.FeatureExtractor
-
Given a list of templates, extracts all features for the given state
- extractBasicFeatureData(IDataSet) - Static method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
生成一个FeatureStats对象,包含一个分类中的所有词语,分类数,实例数。这些统计数据
将用于特征选择算法。
- extractFeature(String, FeatureMap, int) - Method in class com.hankcs.hanlp.model.perceptron.instance.CWSInstance
-
- extractFeature(String[], String[], FeatureMap, int) - Method in class com.hankcs.hanlp.model.perceptron.instance.NERInstance
-
提取特征,override此方法来拓展自己的特征模板
- extractFeature(String[], FeatureMap, int) - Method in class com.hankcs.hanlp.model.perceptron.instance.POSInstance
-
- extractFeature(String, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
特征提取
- extractFeature(String, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronNameGenderClassifier
-
- extractGivenName(String) - Static method in class com.hankcs.hanlp.model.perceptron.PerceptronNameGenderClassifier
-
去掉姓氏,截取中国人名中的名字
- extractKeyword(String, int) - Static method in class com.hankcs.hanlp.HanLP
-
提取关键词
- extractKeywords(String) - Static method in class com.hankcs.hanlp.classification.utilities.TextProcessUtility
-
提取关键词,在真实的应用场景中,还应该涉及到短语
- extractPhrase(String, int) - Static method in class com.hankcs.hanlp.HanLP
-
提取短语
- extractPhrase(String, int) - Method in interface com.hankcs.hanlp.mining.phrase.IPhraseExtractor
-
提取短语
- extractPhrase(String, int) - Method in class com.hankcs.hanlp.mining.phrase.MutualInformationEntropyPhraseExtractor
-
- extractSuffix(int, int, boolean) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonSuffixExtractor
-
提取公共后缀
- extractSuffixByWords(int, int, boolean) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonSuffixExtractor
-
此方法认为后缀一定是整个的词语,所以length是以词语为单位的
- extractSuffixExtended(int, int) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonSuffixExtractor
-
- extractSummary(String, int) - Static method in class com.hankcs.hanlp.HanLP
-
自动摘要
分割目标文档时的默认句子分割符为,,。::“”??!!;;
- extractSummary(String, int, String) - Static method in class com.hankcs.hanlp.HanLP
-
自动摘要
- extractWords(String, int) - Static method in class com.hankcs.hanlp.HanLP
-
提取词语
- extractWords(BufferedReader, int) - Static method in class com.hankcs.hanlp.HanLP
-
提取词语
- extractWords(String, int, boolean) - Static method in class com.hankcs.hanlp.HanLP
-
提取词语(新词发现)
- extractWords(BufferedReader, int, boolean) - Static method in class com.hankcs.hanlp.HanLP
-
提取词语(新词发现)
- extractWords(BufferedReader, int, boolean, int, float, float, float) - Static method in class com.hankcs.hanlp.HanLP
-
提取词语(新词发现)
- g - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
学术词汇
- GammaCdf(double, double) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
伽马函数
- GaussCdf(double) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
给定高斯函数的z值,返回p值(累积分布函数值)
http://jamesmccaffrey.wordpress.com/2010/11/05/programmatically-computing-the-area-under-the-normal-curve/
- gb - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
生物相关词汇
- gbc - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
生物类别
- gc - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
化学相关词汇
- Gcf(double, double) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
Internal function used by GammaCdf
- generate(int) - Method in class com.hankcs.hanlp.model.hmm.FirstOrderHiddenMarkovModel
-
- generate(int) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
生成样本序列
- generate(int, int, int) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
生成样本序列
- generate(int) - Method in class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- generateBiGraph(WordNet) - Static method in class com.hankcs.hanlp.seg.WordBasedSegment
-
生成二元词图
- generateKey(String) - Method in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
生成能够代表这个句子的键
- generateKey(String) - Method in class com.hankcs.hanlp.suggest.scorer.editdistance.EditDistanceScorer
-
- generateKey(String) - Method in class com.hankcs.hanlp.suggest.scorer.lexeme.IdVectorScorer
-
- generateKey(String) - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinScorer
-
- generateParameter(Table, int) - Method in class com.hankcs.hanlp.model.crf.FeatureTemplate
-
- generateSingleWordContext(CoNLLWord[], int, String) - Static method in class com.hankcs.hanlp.corpus.dependency.model.MaxEntDependencyModelMaker
-
- generateUniContext(CoNLLWord[], int, int) - Static method in class com.hankcs.hanlp.corpus.dependency.model.MaxEntDependencyModelMaker
-
- generateWord(List<Vertex>, WordNet) - Static method in class com.hankcs.hanlp.seg.WordBasedSegment
-
对粗分结果执行一些规则上的合并拆分等等,同时合成新词网
- generateWordNet(WordNet) - Method in class com.hankcs.hanlp.seg.WordBasedSegment
-
生成一元词网
- get(String) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
获取值
- get(int) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
从值数组中提取下标为index的值
注意为了效率,此处不进行参数校验
- get(Object) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- get(char[]) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- get(String) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- get(Object) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
- get(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
- get(String) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- get(char[]) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- get(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- get(Object) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- get(String, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
精确查询
- get(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
精确查询
- get(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
精确查询
- get(char[]) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- get(int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
从值数组中提取下标为index的值
注意为了效率,此处不进行参数校验
- get(char[]) - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- get(String) - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- get(String) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- get(IWord) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- get(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
查询一个单词
- get(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
查找是否有该后缀
- get(int, int) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get a single element.
- get(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
查询一个单词
- get(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary
-
- get(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionaryEx
-
- get(String) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
获取条目
- get(int) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
获取条目
- get(String) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
- get(String) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionaryEx
-
- get(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
查单词
- get(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
查单词
- get(String) - Static method in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
- get(char) - Static method in class com.hankcs.hanlp.dictionary.other.CharType
-
获取字符的类型
- get(String) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
- get(Object) - Method in class com.hankcs.hanlp.mining.cluster.SparseVector
-
- get(String) - Static method in class com.hankcs.hanlp.model.bigram.BigramDependencyModel
-
- get(String, String, String, String) - Static method in class com.hankcs.hanlp.model.bigram.BigramDependencyModel
-
获取一个词和另一个词最可能的依存关系
- get(String) - Method in class com.hankcs.hanlp.model.bigram.WordNatureDependencyModel
-
- get(String, String) - Method in class com.hankcs.hanlp.model.bigram.WordNatureDependencyModel
-
- get(int, int) - Method in class com.hankcs.hanlp.model.crf.Table
-
获取表中某一个元素
- get(char[]...) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- get(char...) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- get(int) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
获取某一行的所有节点
- get(int, int) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
获取某一行长度为length的节点
- get(int) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- get(int) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- get(Object) - Static method in class com.hankcs.hanlp.utility.GlobalObjectPool
-
获取对象
- getAccuracy() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer.Result
-
获取准确率
- getALevel() - Method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
- getAllPossibleOptions(Options) - Static method in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- getAllStrings() - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
取出所有key
Retrieves all the valid Strings that have been inserted in to the MDAG.
- getAlpha() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getAlpha_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getAlphaFloat_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getAnalyzer() - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
获取代理的词法分析器
- getAnswer_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getArray() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Access the internal two-dimensional array.
- getArray() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.CompactArray
-
- getArrayCopy() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Copy the internal two-dimensional array.
- getAttribute() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
获取词的属性
- getAttribute(String) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
从HanLP的词库中提取某个单词的属性(包括核心词典和用户词典)
- getAttribute(Term) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
从HanLP的词库中提取某个单词的属性(包括核心词典和用户词典)
- getBase() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
获取base数组引用,不要修改base
- getBase() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getBaseArraySize() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- getBestPath() - Method in class com.hankcs.hanlp.seg.NShort.Path.NShortPath
-
获取唯一一条最短路径,当然最短路径可能不只一条
- getBiFrequency(String, String) - Static method in class com.hankcs.hanlp.dictionary.CoreBiGramTableDictionary
-
获取共现频次
- getBiFrequency(int, int) - Static method in class com.hankcs.hanlp.dictionary.CoreBiGramTableDictionary
-
获取共现频次
- getBiGram() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
获取二阶共现
- getBigramTempls_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getBrownCluster4thPrefix() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- getBrownCluster6thPrefix() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- getBrownClusterFullString() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- getBufferItem(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- getBytes() - Method in class com.hankcs.hanlp.corpus.io.ByteArray
-
获取全部字节
- getBytes(InputStream) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- getCallback() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getCatalog() - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- getCatalog() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
获取类目表
- getCategories() - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- getCategory(int) - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- getChar() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
- getChar() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- getCharCount(String, String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
得到字符集的字符在字符串中出现的次数
- getCharsetSize() - Method in interface com.hankcs.hanlp.collection.trie.datrie.CharacterMapping
-
- getCharsetSize() - Method in class com.hankcs.hanlp.collection.trie.datrie.Utf8CharacterMapping
-
- getCheck() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
获取check数组引用,不要修改check
- getCheck() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getCheckArraySize() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- getChild(char) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
获取子节点
- getChild(char) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- getChild(char) - Method in class com.hankcs.hanlp.collection.trie.bintrie.Node
-
- getChisquareCriticalValue() - Method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
获取卡方临界值
- getColumnDimension() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get column dimension.
- getColumnPackedCopy() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Make a one-dimensional column packed copy of the internal array.
- getComplexSentenceList() - Method in class com.hankcs.hanlp.corpus.document.Document
-
获取复杂句子列表,句子中的每个单词有可能是复合词,有可能是简单词
- getConstantInverse() - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- getCorrectionConstant() - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- getCorrectionParam() - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- getCost_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getCostFactor_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getDA() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- getDat() - Method in class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- getDefaultFeatureTemplate() - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- getDefaultFeatureTemplate() - Method in class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- getDefaultFeatureTemplate() - Method in class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- getDefaultFeatureTemplate() - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
- getDefaultInitialLearningRate() - Method in enum com.hankcs.hanlp.mining.word2vec.NeuralNetworkType
-
- getDependent(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- getDeprelTranslator() - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- getDeprelTranslator() - Method in interface com.hankcs.hanlp.dependency.IDependencyParser
-
获取依存关系映射表
- getDepth() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
获取节点深度
- getDepth() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
获取节点深度
- getEdge(Node, Node) - Method in class com.hankcs.hanlp.model.bigram.WordNatureDependencyModel
-
打分
- getEdgeArray() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
获取边的列表,edge[i][j]表示id为i的词语与j存在一条依存关系为该值的边,否则为null
- getEdgeListTo(int) - Method in class com.hankcs.hanlp.seg.common.Graph
-
获取到达顶点to的边列表
- getEdgesTo() - Method in class com.hankcs.hanlp.seg.common.Graph
-
- getElementArray() - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- getEmit() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.FragmentToken
-
- getEmit() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.MatchToken
-
- getEmit() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Token
-
- getEmptySize() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
获取空闲的数组元素个数
- getEnd() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- getEnd() - Method in interface com.hankcs.hanlp.algorithm.ahocorasick.interval.Intervalable
-
终点
- getEntrySet() - Method in class com.hankcs.hanlp.corpus.occurrence.TermOccurrence
-
- getExponentialExpandFactor() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- getFeature_id_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getFeature_index_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getFeatureAt(int) - Method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
- getFeatureCache_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getFeatureIndex_() - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- getFeatureTemplateArray() - Method in class com.hankcs.hanlp.model.crf.LogLinearModel
-
- getFirst() - Method in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- getFirst(int) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
获取某一行的第一个节点
- GetFirst() - Method in class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
读取第一个元素,但不执行DeQueue操作
- getFirstChar() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取首字母
- getFirstCharArray() - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinKey
-
- getFragment() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Token
-
- getFreeSize() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- getFrequency(E) - Method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- getFrequency(String) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- getFrequency(String) - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
获取频次
- getFrequency() - Method in class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
- getFrequency(String, String) - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
获取转移频次
- getFrequency(E, E) - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
获取转移频次
- getFrequency() - Method in class com.hankcs.hanlp.seg.common.Term
-
获取本词语在HanLP词库中的频次
- getFrequency(String) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
获取某个单词的词频
- getFrom() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
- getGoldDependencies() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- getHead(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- getHead() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取输入法头
- getHeadString() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取输入法头
- getId(String) - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- getId(String) - Method in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- getId() - Method in interface com.hankcs.hanlp.corpus.synonym.ISynonym
-
获取ID
- getId() - Method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- getID(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.DecoderFeatureIndex
-
- getID(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- getID(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getIdString() - Method in interface com.hankcs.hanlp.corpus.synonym.ISynonym
-
获取字符类型的ID
- getIdString() - Method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- getIncomingTransitionCount() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Retrieves this node's incoming _transition count
- getIndex() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- getIndex(String[], int, TaggerImpl) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getInitSize() - Method in interface com.hankcs.hanlp.collection.trie.datrie.CharacterMapping
-
- getInitSize() - Method in class com.hankcs.hanlp.collection.trie.datrie.Utf8CharacterMapping
-
- getInputFile() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getIter() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getKey() - Method in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- getKey() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- getKey() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getKey() - Method in class com.hankcs.hanlp.model.crf.crfpp.Pair
-
Gets the key for this pair.
- getKeyword() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Emit
-
获取对应的模式串
- getKeyword(String) - Method in class com.hankcs.hanlp.summary.TextRankKeyword
-
- getKeywordCounts(String[]) - Static method in class com.hankcs.hanlp.classification.utilities.TextProcessUtility
-
统计每个词的词频
- getKeywordList(String, int) - Static method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
提取关键词(线程安全)
- getKeywordList(String, int) - Static method in class com.hankcs.hanlp.summary.TextRankKeyword
-
提取关键词
- getKeywords(List<Term>, int) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
提取关键词(非线程安全)
- getKeywords(List<Term>, int) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getKeywords(String, int) - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
提取关键词
- getKeywords(String) - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
提取关键词(top 10)
- getKeywords(List<Term>, int) - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
- getKeywords(List<Term>, int) - Method in class com.hankcs.hanlp.summary.TextRankKeyword
-
- getKeywordsOf(Object) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getKeywordsOf(Object, int) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getKeywordsWithTfIdf(String, int) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getKeywordsWithTfIdf(List<Term>, int) - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getLA() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- getLabel() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- getLabel() - Method in interface com.hankcs.hanlp.corpus.document.sentence.word.IWord
-
获取标签
- getLabel() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
- getLabels() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
依存关系
- getLargestValueId() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
获取最大的值
- getLast() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- getLayer1Size() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getLength() - Method in class com.hankcs.hanlp.corpus.io.ByteArray
-
- getLength() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getLetter() - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Retrieves the character representing the _transition laben leading up to this node.
- getLexemeArray(List<CommonSynonymDictionary.SynonymItem>) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
获取语义标签
- getLexemeArray(List<CommonSynonymDictionary.SynonymItem>) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionaryEx
-
获取语义标签
- getLexicon() - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- getLexicon() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
获取词表
- getLinearExpandFactor() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- getLongestSearcher(String, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getLongestSearcher(char[], int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getLongestSuffixLength(String) - Method in class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
获取最长的后缀
- getMatrix(int, int, int, int) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get a submatrix.
- getMatrix(int[], int[]) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get a submatrix.
- getMatrix(int, int, int[]) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get a submatrix.
- getMatrix(int[], int, int) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get a submatrix.
- getMatrixElement(int, int) - Method in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- getMax_xsize_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getMaxid_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getMaximumValue() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
可以设置的最大值
- getMaxSynonymItemIdDistance() - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary
-
获取最大id
- getMinCount() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getMode_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getModel() - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
获取训练后的模型,可用于序列化保存或预测.
- getModel() - Method in class com.hankcs.hanlp.classification.classifiers.NaiveBayesClassifier
-
- getModel() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
- getModel() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTagger
-
- getModel() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer.Result
-
获取模型
- getMostLikelyLabel() - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- getNaiveBayesModel() - Method in class com.hankcs.hanlp.classification.classifiers.NaiveBayesClassifier
-
获取训练结果
- getNature() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
获取该节点的词性,如果词性还未确定,则返回null
- getNature() - Method in class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
原子的词性
- getNatureFrequency(String) - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- getNatureFrequency(Nature) - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
获取词性的词频
- getNatureFrequency(String) - Method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- getNatureFrequency(Nature) - Method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
获取词性的词频
- getNbest_() - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- getNbest_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getNegative() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getNERTagSet() - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- getNERTagSet() - Method in class com.hankcs.hanlp.model.hmm.HMMNERecognizer
-
- getNERTagSet() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronNERecognizer
-
- getNERTagSet() - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- getNERTagSet() - Method in interface com.hankcs.hanlp.tokenizer.lexical.NERecognizer
-
- getNERTagSet() - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- GetNext() - Method in class com.hankcs.hanlp.seg.NShort.Path.CQueue
-
读取上次读取后的下一个元素,不执行DeQueue操作
- getNode_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getNodeList_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getNonzeroSize() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getNonzeroSize() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getNPaths(int) - Method in class com.hankcs.hanlp.seg.NShort.Path.NShortPath
-
从短到长获取至多 n 条路径
- getNPaths() - Method in class com.hankcs.hanlp.seg.NShort.Path.NShortPath
-
获取前10条最短路径
- getNumOutcomes() - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- getNumThreads() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getNumWords() - Method in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- getOffset() - Method in class com.hankcs.hanlp.corpus.io.ByteArray
-
- getOffset() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.CompactArray
-
- getOffset() - Method in class com.hankcs.hanlp.dictionary.BaseSearcher
-
获取当前偏移
- getOutcomes() - Method in class com.hankcs.hanlp.model.maxent.Context
-
获取输出
- getOutgoingTransitionCount() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Retrieves this node's outgoing _transition count.
- getOutgoingTransitions() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Retrieves this node's outgoing transitions.
- getOutgoingTransitionSetSize() - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Retrieves the size of this node's outgoing _transition set.
- getOutputFile() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getPairFrequency(String, String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- getParameters() - Method in class com.hankcs.hanlp.model.maxent.Context
-
获取参数
- getParams() - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- getPathList_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getPaths(int) - Method in class com.hankcs.hanlp.seg.NShort.Path.NShortPath
-
获取前index+1短的路径
- getPerceptionNERecognizer() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronLexicalAnalyzer
-
获取命名实体识别器
- getPerceptronPOSTagger() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronLexicalAnalyzer
-
获取词性标注器
- getPerceptronSegmenter() - Method in class com.hankcs.hanlp.model.perceptron.PerceptronLexicalAnalyzer
-
获取分词器
- getPhraseByLe() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- getPhraseByMi() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- getPhraseByRe() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- getPhraseByScore() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- getPinyin(int) - Static method in class com.hankcs.hanlp.dictionary.py.Integer2PinyinConverter
-
- getPinyinWithoutTone() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取纯字母的拼音
- getPinyinWithToneMark() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取带音调的拼音
- getRealWord() - Method in interface com.hankcs.hanlp.corpus.synonym.ISynonym
-
获取原本的词语
- getRealWord() - Method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- getRealWord() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
获取真实词
- getResourceAsStream(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
class.getResourceAsStream的wrapper,在资源不存在的情况下抛出IOException,
- getResult(boolean) - Method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
获取PRF
- getResult() - Method in class com.hankcs.hanlp.seg.common.CWSEvaluator
-
获取PRF
- getResult_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getReversedDependencies() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- getRowDimension() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Get row dimension.
- getRowPackedCopy() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Make a one-dimensional row packed copy of the internal array.
- getSample() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getScore(boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
Returns the current score of the configuration
- getSearcher(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getSearcher(String, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getSearcher(char[], int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getSearcher(String) - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary
-
- getSearcher(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
- getSearcher(char[]) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
获取一个BinTrie的查询工具
- getSearcher(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
- getSearcher(char[]) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
获取一个BinTrie的查询工具
- getSearcher(char[]) - Static method in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
- getSearcher(char[], DoubleArrayTrie<Pinyin[]>) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
- getSearcher(char[], DoubleArrayTrie<String>) - Static method in class com.hankcs.hanlp.dictionary.ts.BaseChineseDictionary
-
- getSecond() - Method in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- getSegment() - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- getSegment() - Method in interface com.hankcs.hanlp.dependency.IDependencyParser
-
获取Parser使用的分词器
- getSegment() - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
- getSegment() - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
- getSentence() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- getShengmu() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取声母
- getSimpleMDAGArray() - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
获取简化后的状态Array
Returns the array of SimpleMDAGNodes collectively containing the
data of this MDAG, or null if it hasn't been simplified yet.
- getSimpleSentenceList() - Method in class com.hankcs.hanlp.corpus.document.Document
-
获取简单的句子列表,其中复合词会被拆分为简单词
- getSimpleSentenceList(boolean) - Method in class com.hankcs.hanlp.corpus.document.Document
-
获取简单的句子列表
- getSimpleSentenceList(Set<String>) - Method in class com.hankcs.hanlp.corpus.document.Document
-
获取简单的句子列表,其中复合词的标签如果是set中指定的话会被拆分为简单词
- getSimpleWordList() - Method in class com.hankcs.hanlp.corpus.document.Document
-
- getSize() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getSize() - Method in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- getSize() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getStack() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- getStart() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- getStart() - Method in interface com.hankcs.hanlp.algorithm.ahocorasick.interval.Intervalable
-
起点
- getStates() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
- getStates() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- getStatus() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
获取节点的成词状态
- getStringsEndingWith(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
后缀查询
Retrieves all the Strings in the MDAG that begin with a given String.
- getStringsStartingWith(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
前缀查询
Retrieves all the Strings in the MDAG that begin with a given String.
- getStringsWithSubstring(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
返回包含字串的key
Retrieves all the Strings in the MDAG that contain a given String.
- getSuccess() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
获取goto表
- getSuffix(String, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
获取最后一个分隔符的后缀
- getsum() - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- getSummary(String, int) - Static method in class com.hankcs.hanlp.HanLP
-
自动摘要
分割目标文档时的默认句子分割符为,,。::“”??!!;;
- getSummary(String, int, String) - Static method in class com.hankcs.hanlp.HanLP
-
自动摘要
- getSummary(String, int) - Static method in class com.hankcs.hanlp.summary.TextRankSentence
-
一句话调用接口
- getSummary(String, int, String) - Static method in class com.hankcs.hanlp.summary.TextRankSentence
-
一句话调用接口
- getTag2id() - Method in class com.hankcs.hanlp.model.crf.CRFModel
-
- getTagId(String) - Method in class com.hankcs.hanlp.model.crf.CRFModel
-
获取某个tag的ID
- getTags() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- getTagSet() - Method in class com.hankcs.hanlp.model.hmm.HMMNERecognizer
-
- getTagSet() - Method in class com.hankcs.hanlp.model.hmm.HMMPOSTagger
-
- getTagSet() - Method in class com.hankcs.hanlp.model.hmm.HMMSegmenter
-
- getTagSet() - Method in class com.hankcs.hanlp.model.hmm.HMMTrainer
-
- getTargetVector() - Method in class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors
-
- getTargetVector() - Method in class com.hankcs.hanlp.mining.word2vec.Distance
-
- getTargetVector() - Method in class com.hankcs.hanlp.mining.word2vec.WordAnalogy
-
- getTemplate() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getTemplate() - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- getTemplate() - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
获取特征模板
- getTemplate() - Method in class com.hankcs.hanlp.model.crf.FeatureTemplate
-
- getTerm() - Method in class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
- getTermAndRank(String) - Method in class com.hankcs.hanlp.summary.TextRankKeyword
-
返回全部分词结果和对应的rank
- getTermAndRank(String, int) - Method in class com.hankcs.hanlp.summary.TextRankKeyword
-
返回分数最高的前size个分词结果和对应的rank
- getTermAndRank(List<Term>) - Method in class com.hankcs.hanlp.summary.TextRankKeyword
-
使用已经分好的词来计算rank
- getTermFrequency(String) - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
获取词频
- getTermFrequency(String) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
获取词频
- getTfMap() - Method in class com.hankcs.hanlp.classification.corpus.BagOfWordsDocument
-
- getTfMap() - Method in interface com.hankcs.hanlp.classification.corpus.ITermFrequencyHolder
-
- getTfMap() - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- getThread_id_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getThreadNum_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getTokenizer() - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- getTokenizer() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
获取分词器
- getTone() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取声调
- getTopSentence(int) - Method in class com.hankcs.hanlp.summary.TextRankSentence
-
获取前几个关键句子
- getTopSentenceList(String, int) - Static method in class com.hankcs.hanlp.summary.TextRankSentence
-
一句话调用接口
- getTopSentenceList(String, int, String) - Static method in class com.hankcs.hanlp.summary.TextRankSentence
-
一句话调用接口
- getTotalFrequency() - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
获取全部频次
- getTotalFrequency(int) - Method in class com.hankcs.hanlp.dictionary.TransformMatrix
-
- getTotalFrequency(E) - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
获取e的总频次
- getTotalFrequency() - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
获取所有标签的总频次
- getTotalSize() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getTotalSize() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getTraditionalChinese(String) - Static method in class com.hankcs.hanlp.dictionary.ts.SimplifiedChineseDictionary
-
- getTrainWords() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- getTransitionPathNodes(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
获取一个字符串路径上经过的节点
Retrieves the nodes in the _transition path starting
from this node corresponding to a given String .
- getTransitions() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
- getTransitions() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- getTransitionSetBeginIndex() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Retrieves the index in a simplified mdag data array that the SimpleMDAGNode
representation of this node's outgoing _transition set begins at.
- getTransitionSetBeginIndex() - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Retrieves the index in this node's containing array that its _transition set begins at.
- getTrie() - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
- getTrie() - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
- getTrie() - Method in class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- getTriGram() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
获取三阶共现
- getUA() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- getUniGram() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
获取一阶共现,其实就是词频统计
- getUnigramTempls_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getUnitSize() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getUnitSize() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getUnsigned(byte) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
获取字节对应的无符号整型数
- getValue() - Method in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- getValue() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
获取节点对应的值
- getValue() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- getValue() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- getValue() - Method in interface com.hankcs.hanlp.corpus.document.sentence.word.IWord
-
获取单词
- getValue() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
- getValue() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- getValue() - Method in class com.hankcs.hanlp.model.crf.crfpp.Pair
-
Gets the value for this pair.
- getValueArray(V[]) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- getValueArray(V[]) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- getValueArray(V[]) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- getValueArray(V[]) - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- getValueAt(int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
获取index对应的值
- getVertexes() - Method in class com.hankcs.hanlp.seg.common.Graph
-
- getVertexes() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
获取内部顶点表格,谨慎操作!
- getVlevel_() - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- getVlevel_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getVocab() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- getVocabIndexMap() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- getVocabSize() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- getWindow() - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- getWord(int) - Method in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- getWord(int) - Method in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- getWordArray() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
- getWordArrayWithRoot() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
获取包含根节点在内的单词数组
- getWordId() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
- getWordID(String) - Static method in class com.hankcs.hanlp.dictionary.CoreBiGramTableDictionary
-
获取词语的ID
- getWordID(String) - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
获取词语的ID
- getWordIdArray() - Method in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- GetWordInfo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary
-
- getWordList() - Method in class com.hankcs.hanlp.corpus.document.Document
-
获取单词序列
- getWords() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- getX_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getXsize_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getY_() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- getYsize_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- getYunmu() - Method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
获取韵母
- getZ_() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- gg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
地理地质相关词汇
- gi - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
计算机相关词汇
- GlobalObjectPool - Class in com.hankcs.hanlp.utility
-
全局对象缓存池
用于储存那些体积庞当的模型,如果该模型已经被加载过一次,那么就不需要重新加载。同时,如果JVM内存不够,并且没有任何强引用时,允许垃圾
回收器回收这些模型。
- GlobalObjectPool() - Constructor for class com.hankcs.hanlp.utility.GlobalObjectPool
-
- gm - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
数学相关词汇
- goldDependencies - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.CompactTree
-
- goldDependencies - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
dependent -> head
- goldFile - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- gp - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
物理相关词汇
- gradient(double[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
计算梯度
- Graph - Class in com.hankcs.hanlp.seg.common
-
- Graph(Vertex[]) - Constructor for class com.hankcs.hanlp.seg.common.Graph
-
将一个词网转为词图
- Gser(double, double) - Static method in class com.hankcs.hanlp.classification.statistics.ContinuousDistributions
-
Internal function used by GammaCdf
- guessAttribute(Term) - Static method in class com.hankcs.hanlp.seg.CharacterBasedSegment
-
查询或猜测一个词语的属性,
先查词典,然后对字母、数字串的属性进行判断,最后猜测未登录词
- guessNature() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
猜测最可能的词性,也就是这个节点的词性中出现频率最大的那一个词性
- i - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
成语
- IAction - Interface in com.hankcs.hanlp.dependency.perceptron.transition.parser
-
- ICacheAble - Interface in com.hankcs.hanlp.corpus.io
-
可写入或读取二进制
- IClassifier - Interface in com.hankcs.hanlp.classification.classifiers
-
文本分类器接口
- id - Variable in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNllLine
-
第一个值化为id
- ID - Variable in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
ID 当前词在句子中的序号,1开始.
- id - Variable in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- id - Variable in class com.hankcs.hanlp.dependency.common.Node
-
- id - Variable in class com.hankcs.hanlp.dependency.common.State
-
- id(String) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
- id - Variable in class com.hankcs.hanlp.model.perceptron.feature.FeatureSortItem
-
- id2tag - Variable in class com.hankcs.hanlp.model.crf.CRFModel
-
id转标签
- idArrayList - Variable in class com.hankcs.hanlp.suggest.scorer.lexeme.IdVector
-
- IDataSet - Interface in com.hankcs.hanlp.classification.corpus
-
文本分类数据集接口
- identity(int, int) - Static method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Generate identity matrix
- IDependencyParser - Interface in com.hankcs.hanlp.dependency
-
依存句法分析器接口
- idf(Iterable<Iterable<TERM>>, boolean, boolean) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
一系列文档的倒排词频
- idf(Iterable<Iterable<TERM>>) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
平滑处理后的一系列文档的倒排词频
- idfFromTfs(Iterable<Map<TERM, Double>>, boolean, boolean) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
从词频集合建立倒排频率
- idfFromTfs(Iterable<Map<TERM, Double>>) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
从词频集合建立倒排频率(默认平滑词频,且加一平滑tf-idf)
- idOf(char[]) - Method in class com.hankcs.hanlp.dependency.nnparser.Alphabet
-
label转id
- idOf(String) - Method in class com.hankcs.hanlp.dependency.nnparser.Alphabet
-
label转id
- idOf(String) - Method in class com.hankcs.hanlp.model.hmm.Vocabulary
-
- idOf(String) - Method in interface com.hankcs.hanlp.model.perceptron.common.IStringIdMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureDatMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.feature.LockableFeatureMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- idOf(String) - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- IdVector - Class in com.hankcs.hanlp.suggest.scorer.lexeme
-
一个同义词有多个id,多个同义词用这个封装做key
- IdVector(String) - Constructor for class com.hankcs.hanlp.suggest.scorer.lexeme.IdVector
-
- IdVector(List<Long[]>) - Constructor for class com.hankcs.hanlp.suggest.scorer.lexeme.IdVector
-
- IdVectorScorer - Class in com.hankcs.hanlp.suggest.scorer.lexeme
-
单词语义向量打分器
- IdVectorScorer() - Constructor for class com.hankcs.hanlp.suggest.scorer.lexeme.IdVectorScorer
-
- idWord - Variable in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- idWord - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
uid to word
- idWord(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.TransitionBasedParser
-
- IFeatureWeighter - Interface in com.hankcs.hanlp.classification.features
-
词权重计算
- IIdStringMap - Interface in com.hankcs.hanlp.model.perceptron.common
-
从id到label的映射
- IIOAdapter - Interface in com.hankcs.hanlp.corpus.io
-
IO适配器接口
实现该接口以移植HanLP到不同的平台
- ILogger - Interface in com.hankcs.hanlp.classification.utilities.io
-
一个简单的日志接口
- ImmutableFeatureDatMap - Class in com.hankcs.hanlp.model.perceptron.feature
-
- ImmutableFeatureDatMap(TreeMap<String, Integer>, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureDatMap
-
- ImmutableFeatureMap - Class in com.hankcs.hanlp.model.perceptron.feature
-
- ImmutableFeatureMap(Map<String, Integer>, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMap
-
- ImmutableFeatureMap(Set<Map.Entry<String, Integer>>, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMap
-
- ImmutableFeatureMDatMap - Class in com.hankcs.hanlp.model.perceptron.feature
-
用MutableDoubleArrayTrie实现的ImmutableFeatureMap
- ImmutableFeatureMDatMap() - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- ImmutableFeatureMDatMap(TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- ImmutableFeatureMDatMap(MutableDoubleArrayTrieInteger, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- ImmutableFeatureMDatMap(Map<String, Integer>, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- ImmutableFeatureMDatMap(Set<Map.Entry<String, Integer>>, TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- increase(int) - Method in class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
频次增加若干
- increase() - Method in class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
频次加一
- incrementBufferHead() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- incrementIteration() - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
Adds to the iterations
- index - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie.LongestSearcher
-
key的字典序坐标
- index - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie.Searcher
-
key的字典序坐标
- index - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.BeamElement
-
- index - Variable in class com.hankcs.hanlp.seg.common.Vertex
-
在一维顶点数组中的下标,可以视作这个顶点的id
- index - Variable in class com.hankcs.hanlp.seg.NShort.Path.PathNode
-
节点在顶点数组中的下标
- index - Variable in class com.hankcs.hanlp.seg.NShort.Path.QueueElement
-
边的终点在顶点数组中的下标
- IndexMaps - Class in com.hankcs.hanlp.dependency.perceptron.structures
-
将所有字符串混到一起赋予id的结构
- IndexMaps(HashMap<String, Integer>, HashMap<Integer, Integer>, String, HashMap<Integer, Integer>, HashMap<Integer, Integer>, HashMap<String, Integer>) - Constructor for class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
- indexMode - Variable in class com.hankcs.hanlp.seg.Config
-
是否是索引分词(合理地最小分割),indexMode代表全切分词语的最小长度(包含)
- indexOf(Object) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- indexOf(Object) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- IndexTokenizer - Class in com.hankcs.hanlp.tokenizer
-
索引分词器
- IndexTokenizer() - Constructor for class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
- INFO_LOG(String, Object...) - Static method in class com.hankcs.hanlp.dependency.nnparser.util.Log
-
- init(int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.LbfgsOptimizer
-
- init_range - Variable in class com.hankcs.hanlp.dependency.nnparser.option.LearnOption
-
- initFeatureMatrix(String, FeatureMap) - Method in class com.hankcs.hanlp.model.perceptron.instance.CWSInstance
-
- initNbest() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- innerList - Variable in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
由这些词复合而来
- inputFile - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- insert(String, int, boolean) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
插入条目
- insert(String, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
覆盖模式添加
- insert(String, String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
往自定义词典中插入一个新词(覆盖模式)
动态增删不会持久化到词典文件
- insert(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
以覆盖模式增加新词
动态增删不会持久化到词典文件
- insert(String, String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
往自定义词典中插入一个新词(覆盖模式)
动态增删不会持久化到词典文件
- insert(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
以覆盖模式增加新词
动态增删不会持久化到词典文件
- insert(int, Vertex, WordNet) - Method in class com.hankcs.hanlp.seg.common.WordNet
-
添加顶点,同时检查此顶点是否悬孤,如果悬孤则自动补全
- Instance - Class in com.hankcs.hanlp.dependency.nnparser
-
- Instance() - Constructor for class com.hankcs.hanlp.dependency.nnparser.Instance
-
- Instance - Class in com.hankcs.hanlp.dependency.perceptron.transition.configuration
-
训练实例
- Instance(Sentence, HashMap<Integer, Edge>) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- Instance - Class in com.hankcs.hanlp.model.perceptron.instance
-
- Instance() - Constructor for class com.hankcs.hanlp.model.perceptron.instance.Instance
-
- InstanceConsumer - Class in com.hankcs.hanlp.model.perceptron
-
需要处理实例的消费者
- InstanceConsumer() - Constructor for class com.hankcs.hanlp.model.perceptron.InstanceConsumer
-
- InstanceHandler - Interface in com.hankcs.hanlp.model.perceptron.instance
-
- IntArrayList - Class in com.hankcs.hanlp.collection.trie.datrie
-
动态数组
- IntArrayList() - Constructor for class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- IntArrayList(int) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- IntArrayList(int, int) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- Integer2PinyinConverter - Class in com.hankcs.hanlp.dictionary.py
-
将整型转为拼音
- Integer2PinyinConverter() - Constructor for class com.hankcs.hanlp.dictionary.py.Integer2PinyinConverter
-
- Interval - Class in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
区间
- Interval(int, int) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
构造一个区间
- Intervalable - Interface in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
区间接口
- IntervalableComparatorByPosition - Class in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
按起点比较区间
- IntervalableComparatorByPosition() - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalableComparatorByPosition
-
- IntervalableComparatorBySize - Class in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
按照长度比较区间,如果长度相同,则比较起点
- IntervalableComparatorBySize() - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalableComparatorBySize
-
- IntervalNode - Class in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
线段树上面的节点,实际上是一些区间的集合,并且按中点维护了两个节点
- IntervalNode(List<Intervalable>) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalNode
-
构造一个节点
- IntervalTree - Class in com.hankcs.hanlp.algorithm.ahocorasick.interval
-
线段树,用于检查区间重叠
- IntervalTree(List<Intervalable>) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalTree
-
构造线段树
- intToBytes(int) - Static method in class com.hankcs.hanlp.utility.ByteUtil
-
将一个整数转换位字节数组(4个字节),b[0]存储高位字符,大端
- IOAdapter - Static variable in class com.hankcs.hanlp.HanLP.Config
-
IO适配器(默认ResourceIOAdapter,表示从jar包中读取),实现com.hankcs.hanlp.corpus.io.IIOAdapter接口
以在不同的平台(Hadoop、Redis等)上运行HanLP
- IOUtil - Class in com.hankcs.hanlp.corpus.io
-
一些常用的IO操作
- IOUtil() - Constructor for class com.hankcs.hanlp.corpus.io.IOUtil
-
- IOUtil.LineIterator - Class in com.hankcs.hanlp.corpus.io
-
方便读取按行读取大文件
- IOUtility - Class in com.hankcs.hanlp.model.perceptron.utility
-
- IOUtility() - Constructor for class com.hankcs.hanlp.model.perceptron.utility.IOUtility
-
- IPhraseExtractor - Interface in com.hankcs.hanlp.mining.phrase
-
从一篇文章中自动识别出最可能的短语
- is_left_arc(Action, int[]) - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionUtils
-
- is_right_arc(Action, int[]) - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionUtils
-
- is_shift(Action) - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionUtils
-
- isAcceptable() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
是否是终止状态
- isAcceptNode() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Retrieves the accept state status of this node.
- isAcceptNode() - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Retrieves the accept state status of this node.
- isAllChinese(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是中文
- isAllChineseNum(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是中国数字
- isAllDelimiter(byte[]) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是分隔符
- isAllIndex(byte[]) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是序号
- isAllLetter(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全为英文
- isAllLetterOrNum(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全为英文或字母
- isAllNonChinese(byte[]) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全部不是中文
- isAllNum(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是数字
- isAllowOverlaps() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.TrieConfig
-
是否允许重叠
- isAllSingleByte(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
是否全是单字节
- isArabicNumber(char) - Static method in class com.hankcs.hanlp.utility.CharacterHelper
-
- ISaveAble - Interface in com.hankcs.hanlp.corpus.dictionary
-
- isBadCase(String) - Static method in class com.hankcs.hanlp.recognition.nr.JapanesePersonRecognition
-
是否是bad case
- isBlank(CharSequence) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断字符串是否为空(null和空格)
- isChinese(char) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断某个字符是否为汉字
- isCJKCharacter(char) - Static method in class com.hankcs.hanlp.utility.CharacterHelper
-
- isConfluenceNode() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Determines if this node is a confluence node
(defined as a node with two or more incoming transitions
- IScorer - Interface in com.hankcs.hanlp.suggest.scorer
-
打分接口
- isDBCCase(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断该字符串是否是半角字符
- isDelimiter(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断是否是一个连字符(分隔符)
- isDicNeedUpdate(String, String[]) - Static method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
获取本地词典更新状态
- isEmpty() - Method in class com.hankcs.hanlp.classification.corpus.BagOfWordsDocument
-
是否为空(文档中没有任何词)
- isEmpty() - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- isEmpty() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- isEmpty() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- isEmpty(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- isEmpty() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- isEmpty() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- isEmpty() - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- isEmpty() - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- isEmptyFlag() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- isEnglishLetter(char) - Static method in class com.hankcs.hanlp.utility.CharacterHelper
-
- ISentenceKey<T> - Interface in com.hankcs.hanlp.suggest.scorer
-
可以唯一代表一个句子的键,可以与其他句子区别开来
- isExponentialExpanding() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- isFileExisted(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
判断文件是否存在
- isFilterEnabled() - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
是否激活了停用词过滤器
- isInAggregate(String, String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断一个字符串的所有字符是否在另一个字符串集合中
- isIndexMode() - Method in class com.hankcs.hanlp.seg.Config
-
是否是索引模式
- isMatch() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.FragmentToken
-
- isMatch() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.MatchToken
-
- isMatch() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Token
-
- isNonprojective() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
Shows whether the tree to train is projective or not
- isPartial(boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- isResource(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
判断资源是否位于jar中
- isRight() - Method in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
该共现是否统计的是否是从左到右的顺序
- isSBCCase(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断该字符串是否是全角字符
- isSpaceLetter(char) - Static method in class com.hankcs.hanlp.utility.CharacterHelper
-
- isTerminal(ArrayList<Configuration>) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
Shows true if all of the configurations in the beam are in the terminal state
- isTerminalState() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- isTestingDataSet() - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- isTestingDataSet() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
是否是测试集
- IStringIdMap - Interface in com.hankcs.hanlp.model.perceptron.common
-
- ISuggester - Interface in com.hankcs.hanlp.suggest
-
- isUnknownWord(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
- isYearTime(String) - Static method in class com.hankcs.hanlp.utility.TextUtility
-
判断字符串是否是年份
- ISynonym - Interface in com.hankcs.hanlp.corpus.synonym
-
同义词接口
- Item - Class in com.hankcs.hanlp.corpus.dictionary.item
-
词典中的一个条目,比如“希望 v 7685 vn 616”
- Item(String, String) - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.Item
-
- Item(String) - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.Item
-
- iter - Variable in class com.hankcs.hanlp.mining.word2vec.Config
-
- iteration - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- iterator() - Method in class com.hankcs.hanlp.algorithm.MaxHeap
-
- iterator() - Method in class com.hankcs.hanlp.classification.corpus.FileDataSet
-
- iterator() - Method in class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- iterator() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- iterator() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- iterator() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- iterator() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
- iterator() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
- iterator() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- iterator() - Method in class com.hankcs.hanlp.corpus.io.IOUtil.LineIterator
-
- iterator() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- iterator() - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- iterator() - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- iterator() - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- ITermFrequencyHolder - Interface in com.hankcs.hanlp.classification.corpus
-
- ITokenizer - Interface in com.hankcs.hanlp.classification.tokenizers
-
- ITrie<V> - Interface in com.hankcs.hanlp.collection.trie
-
trie树接口
- IV_R - Variable in class com.hankcs.hanlp.seg.common.CWSEvaluator.Result
-
- IWord - Interface in com.hankcs.hanlp.corpus.document.sentence.word
-
词语接口
- m - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
数词
- M - Static variable in class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary
-
名
- M - Variable in class com.hankcs.hanlp.model.perceptron.tagset.CWSTagSet
-
- M_TAG_PREFIX - Variable in class com.hankcs.hanlp.model.perceptron.tagset.NERTagSet
-
- M_WORD_ID - Static variable in class com.hankcs.hanlp.dictionary.CoreDictionary
-
- Main - Class in com.hankcs.hanlp.dependency.perceptron.parser
-
- Main() - Constructor for class com.hankcs.hanlp.dependency.perceptron.parser.Main
-
- main(String[]) - Static method in class com.hankcs.hanlp.dependency.perceptron.parser.Main
-
- main(String[]) - Static method in class com.hankcs.hanlp.mining.word2vec.ComputeAccuracy
-
- main(String[]) - Static method in class com.hankcs.hanlp.mining.word2vec.Distance
-
- main(String[]) - Static method in class com.hankcs.hanlp.mining.word2vec.Train
-
- main(String[]) - Static method in class com.hankcs.hanlp.mining.word2vec.WordAnalogy
-
- main(String[]) - Static method in class com.hankcs.hanlp.mining.word2vec.WordCluster
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.crf_learn
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.crf_test
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.DecoderFeatureIndex
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.Encoder
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- Main - Class in com.hankcs.hanlp.model.perceptron
-
- Main() - Constructor for class com.hankcs.hanlp.model.perceptron.Main
-
- main(String[]) - Static method in class com.hankcs.hanlp.model.perceptron.Main
-
- make_left_arc(int) - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionFactory
-
建立右焦点词依存于左焦点词的依存关系
- make_right_arc(int) - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionFactory
-
建立左焦点词依存于右焦点词的依存关系
- make_shift() - Static method in class com.hankcs.hanlp.dependency.nnparser.action.ActionFactory
-
不建立依存关系,只转移句法分析的焦点,即新的左焦点词是原来的右焦点词,依此类推。
- makeEdge(Node[], int, int) - Method in class com.hankcs.hanlp.dependency.MaxEntDependencyParser
-
Deprecated.
- makeEdge(Node[], int, int) - Method in class com.hankcs.hanlp.dependency.MinimumSpanningTreeParser
-
- makeEdge(Node[], int, int) - Method in class com.hankcs.hanlp.dependency.WordNatureDependencyParser
-
- makeModel(String, String) - Static method in class com.hankcs.hanlp.corpus.dependency.model.MaxEntDependencyModelMaker
-
- makeModel(String, String) - Static method in class com.hankcs.hanlp.corpus.dependency.model.WordNatureWeightModelMaker
-
- makeSentence(String[], String[], boolean, boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
将句子中的字符串转换为id
- makeTempls(List<String>, List<String>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- makeToneToTheSame(List<Pinyin>) - Static method in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将所有音调都转为1
- maps - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.ParserModel
-
- maps - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.parser.TransitionBasedParser
-
- MatchToken - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
匹配到的片段
- MatchToken(String, Emit) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.MatchToken
-
- math - Class in com.hankcs.hanlp.dependency.nnparser.util
-
- math() - Constructor for class com.hankcs.hanlp.dependency.nnparser.util.math
-
- MathUtility - Class in com.hankcs.hanlp.utility
-
一些数学小工具
- MathUtility() - Constructor for class com.hankcs.hanlp.utility.MathUtility
-
- Matrix - Class in com.hankcs.hanlp.dependency.nnparser
-
Jama = Java Matrix class.
- Matrix(int, int) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct an m-by-n matrix of zeros.
- Matrix(int, int, double) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct an m-by-n constant matrix.
- Matrix(double[][]) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct a matrix from a 2-D array.
- Matrix(double[][], int, int) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct a matrix quickly without checking arguments.
- Matrix(double[], int) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Construct a matrix from a one-dimensional packed array
- Matrix() - Constructor for class com.hankcs.hanlp.dependency.nnparser.Matrix
-
- matrix - Variable in class com.hankcs.hanlp.model.crf.CRFModel
-
tag的二元转移矩阵,适用于BiGram Feature
- max(Map<String, Double>) - Static method in class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
- MAX_CODE_LENGTH - Static variable in class com.hankcs.hanlp.mining.word2vec.VocabWord
-
- MAX_FREQUENCY - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- MAX_INDEX_LENGTH - Static variable in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
尾数的长度,表示原子词的index
- max_iter - Static variable in class com.hankcs.hanlp.summary.TextRankKeyword
-
最大迭代次数
- MAX_SEGMENT_NUM - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- MAX_WORDS - Static variable in class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
同一行最多支持编码这么多个单词
- max_xsize_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- MaxEntDependencyModelMaker - Class in com.hankcs.hanlp.corpus.dependency.model
-
最大熵模型构建工具,训练暂时不使用自己的代码,借用opennlp训练。本maker只生成训练文件
- MaxEntDependencyModelMaker() - Constructor for class com.hankcs.hanlp.corpus.dependency.model.MaxEntDependencyModelMaker
-
- MaxEntDependencyParser - Class in com.hankcs.hanlp.dependency
-
- MaxEntDependencyParser(MaxEntModel) - Constructor for class com.hankcs.hanlp.dependency.MaxEntDependencyParser
-
Deprecated.
- MaxEntDependencyParser() - Constructor for class com.hankcs.hanlp.dependency.MaxEntDependencyParser
-
Deprecated.
- MaxEntModel - Class in com.hankcs.hanlp.model.maxent
-
最大熵模型,采用双数组Trie树加速,值得拥有
- MaxEntModel() - Constructor for class com.hankcs.hanlp.model.maxent.MaxEntModel
-
- MaxEntModelPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
- MaxHeap<E> - Class in com.hankcs.hanlp.algorithm
-
用固定容量的优先队列模拟的最大堆,用于解决求topN大的问题
- MaxHeap(int, Comparator<E>) - Constructor for class com.hankcs.hanlp.algorithm.MaxHeap
-
构造最大堆
- maxid_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- maxiter - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- maxSentenceSize - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- maxSize - Variable in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
- Mcsrch - Class in com.hankcs.hanlp.model.crf.crfpp
-
- Mcsrch() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.Mcsrch
-
- mcstep(double[], double[], double[], double[], double[], double[], double[], double, double, boolean[], double, double, int[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.Mcsrch
-
- MDAG - Class in com.hankcs.hanlp.collection.MDAG
-
最小环形图
A minimalistic directed acyclical graph suitable for storing a set of Strings.
- MDAG(String) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAG
-
从文件路径构造
- MDAG(File) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAG
-
从一个文件建立MDAG
Creates an MDAG from a newline delimited file containing the data of interest.
- MDAG(BufferedReader) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAG
-
从一个打开的BufferedReader构造
- MDAG(Collection<String>) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAG
-
Creates an MDAG from a collection of Strings.
- MDAG() - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAG
-
空白图
- mdagDataArray - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
调用simplify()后填充此空间
- MDAGMap<V> - Class in com.hankcs.hanlp.collection.MDAG
-
最好不要把MDAG当map用,现在的实现在key后面放一个int,导致右语言全部不同,退化为bintrie
- MDAGMap() - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
- MDAGNode - Class in com.hankcs.hanlp.collection.MDAG
-
MDAG中的一个节点
The class which represents a node in a MDAG.
- MDAGNode(boolean) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
建立一个节点
Constructs an MDAGNode.
- MDAGSet - Class in com.hankcs.hanlp.collection.MDAG
-
基于MDAG(又称DAWG,Minimal Acyclic Finite-State Automata)的String Set
- MDAGSet(File) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- MDAGSet(Collection<String>) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- MDAGSet() - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- MDAGSet(String) - Constructor for class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- meet(String, State) - Method in interface com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie.IWalker
-
遇到了一个节点
- MemoryDataSet - Class in com.hankcs.hanlp.classification.corpus
-
- MemoryDataSet() - Constructor for class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- MemoryDataSet(AbstractModel) - Constructor for class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- mergeCompoundWords() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
- mergeContinuousNsIntoOne() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
将连续的ns节点合并为一个
- mergeNumberQuantifier(List<Vertex>, WordNet, Config) - Method in class com.hankcs.hanlp.seg.Segment
-
合并数字
- mg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
数语素
- Mg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
甲乙丙丁之类的数词
- mi - Variable in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
互信息值
- MIN_PROBABILITY - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- minCount - Variable in class com.hankcs.hanlp.mining.word2vec.Config
-
- MinimumSpanningTreeParser - Class in com.hankcs.hanlp.dependency
-
- MinimumSpanningTreeParser() - Constructor for class com.hankcs.hanlp.dependency.MinimumSpanningTreeParser
-
- minus(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
C = A - B
- minus(Vector) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- MINUS_LOG_EPSILON - Static variable in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- minusEquals(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
A = A - B
- Model - Class in com.hankcs.hanlp.model.crf.crfpp
-
- Model() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.Model
-
- model - Variable in class com.hankcs.hanlp.model.crf.CRFTagger
-
- model - Variable in class com.hankcs.hanlp.model.perceptron.PerceptronTagger
-
用StructurePerceptron实现在线学习
- MODEL_VERSION - Static variable in class com.hankcs.hanlp.model.crf.crfpp.Encoder
-
- modelFile - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- ModelImpl - Class in com.hankcs.hanlp.model.crf.crfpp
-
- ModelImpl() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- mq - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
数量词
- mutable - Variable in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
是否允许新增特征
- MutableDoubleArrayTrie<V> - Class in com.hankcs.hanlp.collection.trie.datrie
-
泛型可变双数组trie树
- MutableDoubleArrayTrie() - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- MutableDoubleArrayTrie(Map<String, V>) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- MutableDoubleArrayTrieInteger - Class in com.hankcs.hanlp.collection.trie.datrie
-
可变双数组trie树,重构自:https://github.com/fancyerii/DoubleArrayTrie
- MutableDoubleArrayTrieInteger(Map<String, Integer>) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- MutableDoubleArrayTrieInteger(Set<Map.Entry<String, Integer>>) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- MutableDoubleArrayTrieInteger() - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- MutableDoubleArrayTrieInteger(CharacterMapping) - Constructor for class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- MutableDoubleArrayTrieInteger.KeyValuePair - Class in com.hankcs.hanlp.collection.trie.datrie
-
- MutableFeatureMap - Class in com.hankcs.hanlp.model.perceptron.feature
-
- MutableFeatureMap(TagSet) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- MutableFeatureMap(TagSet, Map<String, Integer>) - Constructor for class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- MutualInformationEntropyPhraseExtractor - Class in com.hankcs.hanlp.mining.phrase
-
利用互信息和左右熵的短语提取器
- MutualInformationEntropyPhraseExtractor() - Constructor for class com.hankcs.hanlp.mining.phrase.MutualInformationEntropyPhraseExtractor
-
- r - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
代词
- R - Variable in class com.hankcs.hanlp.seg.common.CWSEvaluator.Result
-
- random(int, int) - Static method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Generate matrix with random elements
- randomSynonym(Synonym.Type, String) - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
随机挑一个近义词
- randomSynonym() - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
- raSize() - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- re - Variable in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
右信息熵
- read(LineHandler, int) - Method in class com.hankcs.hanlp.corpus.io.EasyReader
-
读取
- read(LineHandler) - Method in class com.hankcs.hanlp.corpus.io.EasyReader
-
读取
- read(BufferedReader) - Static method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Read a matrix from a stream.
- read(BufferedReader) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- readBytes(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
将整个文件读取为字节数组
- readBytesFromOtherInputStream(InputStream) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
将非FileInputStream的某InputStream中的全部数据读入到字节数组中
- readBytesFromOtherInputStream(InputStream, byte[]) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
从InputStream读取指定长度的字节出来
- readBytesFromResource(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
将资源中的一个资源读入byte数组
- readCsv(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- readData(int, boolean, boolean, boolean, boolean, IndexMaps) - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.CoNLLReader
-
读取句子
- readExternal(ObjectInput) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- readLine(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- readLineList(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- readLineListWithLessMemory(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
用省内存的方式读取大文件
- readLineToArray(String) - Static method in class com.hankcs.hanlp.model.perceptron.utility.IOUtility
-
- readObjectFrom(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
反序列化对象
- readStringData() - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.CoNLLReader
-
- readTxt(File, String) - Static method in class com.hankcs.hanlp.classification.utilities.TextProcessUtility
-
- readTxt(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
一次性读入纯文本
- readTxt(String, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- readVectorFile() - Method in class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- readWordIndex() - Method in class com.hankcs.hanlp.mining.word2vec.CacheCorpus
-
- readWordIndex() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- realWord - Variable in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- realWord - Variable in class com.hankcs.hanlp.seg.common.Vertex
-
节点对应的真实词,绝对不含##
- reassignOutgoingTransition(char, MDAGNode, MDAGNode) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
重新设置转移状态函数的目标
Reassigns the target node of one of this node's outgoing transitions.
- rebuildFeatures(TaggerImpl) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- recall - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
分类召回率
- recognition(List<Vertex>, WordNet, WordNet) - Static method in class com.hankcs.hanlp.recognition.nr.JapanesePersonRecognition
-
执行识别
- recognition(List<Vertex>, WordNet, WordNet) - Static method in class com.hankcs.hanlp.recognition.nr.PersonRecognition
-
- recognition(List<Vertex>, WordNet, WordNet) - Static method in class com.hankcs.hanlp.recognition.nr.TranslatedPersonRecognition
-
执行识别
- recognition(List<Vertex>, WordNet, WordNet) - Static method in class com.hankcs.hanlp.recognition.ns.PlaceRecognition
-
- recognition(List<Vertex>, WordNet, WordNet) - Static method in class com.hankcs.hanlp.recognition.nt.OrganizationRecognition
-
- recognize(String[], String[]) - Method in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- recognize(String[], String[]) - Method in class com.hankcs.hanlp.model.hmm.HMMNERecognizer
-
- recognize(String[], String[]) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronNERecognizer
-
- recognize(NERInstance) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronNERecognizer
-
- recognize(String[], String[]) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- recognize(String[], String[]) - Method in interface com.hankcs.hanlp.tokenizer.lexical.NERecognizer
-
命名实体识别
- recognize(String[], String[]) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- recoverKeyValue() - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- reduce(State) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
- reduceFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- reduceFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.ParserModel
-
- reduceFeatureWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- reduceScore(Object[], boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- RegexRecognizePipe - Class in com.hankcs.hanlp.tokenizer.pipe
-
正则匹配管道
- RegexRecognizePipe(Pattern, String) - Constructor for class com.hankcs.hanlp.tokenizer.pipe.RegexRecognizePipe
-
- registerValueCreator(Args.ValueCreator) - Static method in class com.hankcs.hanlp.model.perceptron.cli.Args
-
Allows external extension of the valiue creators.
- regularize(char) - Static method in class com.hankcs.hanlp.utility.CharacterHelper
-
进行字符规格化(全角转半角,大写转小写处理)
- rel() - Method in class com.hankcs.hanlp.dependency.nnparser.action.AbstractInexactAction
-
- relation(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- relationId - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Edge
-
label
- reload() - Static method in class com.hankcs.hanlp.dictionary.CoreBiGramTableDictionary
-
热更新二元接续词典
集群环境(或其他IOAdapter)需要自行删除缓存文件
- reload() - Static method in class com.hankcs.hanlp.dictionary.CoreDictionary
-
热更新核心词典
集群环境(或其他IOAdapter)需要自行删除缓存文件
- reload() - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
热更新(重新加载)
集群环境(或其他IOAdapter)需要自行删除缓存文件(路径 = HanLP.Config.CustomDictionaryPath[0] + Predefine.BIN_EXT)
- reload() - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
热更新(重新加载)
集群环境(或其他IOAdapter)需要自行删除缓存文件(路径 = HanLP.Config.CustomDictionaryPath[0] + Predefine.BIN_EXT)
- reload() - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
重新加载HanLP.Config#CoreStopWordDictionaryPath所指定的停用词词典,并且生成新缓存。
- remainLongest() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
只保留最长匹配
- remainLongest - Variable in class com.hankcs.hanlp.algorithm.ahocorasick.trie.TrieConfig
-
只保留最长匹配
- remove(Object) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- remove(Object) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- remove(String) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
删除一个词
- remove(Object) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- remove() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- remove(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
删除键
- remove(String) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
删除一个词条
- remove(Map.Entry<String, V>) - Method in interface com.hankcs.hanlp.corpus.dictionary.SimpleDictionary.Filter
-
- remove(SimpleDictionary.Filter) - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
过滤部分词条
- remove() - Method in class com.hankcs.hanlp.corpus.io.IOUtil.LineIterator
-
- remove(String) - Static method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
删除单词
动态增删不会持久化到词典文件
- remove(String) - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
删除单词
动态增删不会持久化到词典文件
- remove(String) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
从停用词词典中删除停用词
- remove(Object) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- remove(K) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
删除元素
- remove(Object) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- remove(int) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- remove(Object) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- remove(int) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- removeAll(Collection<?>) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- removeAll(Collection<?>) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- removeAll(Collection<?>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- removeAll(Collection<?>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- removeAllSentences() - Method in interface com.hankcs.hanlp.suggest.ISuggester
-
清空该推荐器中的所有句子
- removeAllSentences() - Method in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
- removeAllSentences() - Method in interface com.hankcs.hanlp.suggest.scorer.IScorer
-
清空该推荐器中的所有句子
- removeAllSentences() - Method in class com.hankcs.hanlp.suggest.Suggester
-
- removeLabel(String) - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
删除一个标签
- removeLast() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- removeOutgoingTransition(char) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
移除一个转移目标
Removes a _transition labeled with a given char.
- removeOverlaps(List<Intervalable>) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.IntervalTree
-
从区间列表中移除重叠的区间
- removeOverlaps() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
- removeString(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
Removes a String from the MDAG.
- removeTone(Pinyin) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinUtil
-
- removeUTF8BOM(String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
去除文件第一行中的UTF8 BOM
这是Java的bug,且官方不会修复。参考 https://stackoverflow.com/questions/4897876/reading-utf-8-bom-marker
- repeatedBisection(int) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
repeated bisection 聚类
- repeatedBisection(double) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
repeated bisection 聚类
- repeatedBisection(int, double) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
repeated bisection 聚类
- reset(BufferedReader) - Method in class com.hankcs.hanlp.seg.common.wrapper.SegmentWrapper
-
重置分词器
- resetValueCreators() - Static method in class com.hankcs.hanlp.model.perceptron.cli.Args
-
Cleanup of registered ValueCreators (mainly for tests)
- reshapeNER(List<String[]>) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- reshapeNER(List<String[]>, String[], String[], String[]) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- ResourceIOAdapter - Class in com.hankcs.hanlp.corpus.io
-
从jar包资源读取文件的适配器
- ResourceIOAdapter() - Constructor for class com.hankcs.hanlp.corpus.io.ResourceIOAdapter
-
- Result(float[], int[]) - Constructor for class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors.Result
-
- result(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- result(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- Result(LinearModel, double[]) - Constructor for class com.hankcs.hanlp.model.perceptron.PerceptronTrainer.Result
-
- Result(float, float, float, float, float) - Constructor for class com.hankcs.hanlp.seg.common.CWSEvaluator.Result
-
- ResultTerm<V> - Class in com.hankcs.hanlp.seg.common
-
一个通用的Term
- ResultTerm(String, V, int) - Constructor for class com.hankcs.hanlp.seg.common.ResultTerm
-
- retainAll(Collection<?>) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- retainAll(Collection<?>) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- retainAll(Collection<?>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- retainAll(Collection<?>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- reverse() - Method in class com.hankcs.hanlp.corpus.dictionary.StringDictionary
-
将自己逆转过来返回
- REVERSE_EXT - Static variable in class com.hankcs.hanlp.utility.Predefine
-
逆转后缀名
- reversedDependencies - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
head -> dependents
- rewind(int, int) - Method in class com.hankcs.hanlp.mining.word2vec.CacheCorpus
-
- rewind(int, int) - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
reset current corpus to initial status
- rewind(int, int) - Method in class com.hankcs.hanlp.mining.word2vec.TextFileCorpus
-
- rewrite(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary
-
- rewrite(String) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
- rewriteQuickly(String) - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary
-
- rewriteQuickly(String) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
不分词直接转换
- rg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
代词性语素
- Rg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
古汉语代词性语素
- RIGHT - Static variable in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
两个词的正向连接符 中国 RIGHT 人民
- rightArc(State, int) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
- rightArcFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- rightArcFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.ParserModel
-
- rightArcFeatureWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- rightArcScores(Object[], boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- rightDependentLabels(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rightDepLabels - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rightMostArcs - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rightMostModifier(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rightValency - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rightValency(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rnode - Variable in class com.hankcs.hanlp.model.crf.crfpp.Path
-
- roleObserve(List<Vertex>) - Static method in class com.hankcs.hanlp.recognition.nr.PersonRecognition
-
角色观察(从模型中加载所有词语对应的所有角色,允许进行一些规则补充)
- roleTag(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
角色标注,如果子类要进行label的调整或增加新的首尾等等,可以在此进行
- roleTag(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NatureDictionaryMaker
-
- roleTag(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NRDictionaryMaker
-
- roleTag(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NSDictionaryMaker
-
- roleTag(List<List<IWord>>) - Method in class com.hankcs.hanlp.corpus.dictionary.NTDictionaryMaker
-
- roleTag(List<Vertex>, WordNet) - Static method in class com.hankcs.hanlp.recognition.ns.PlaceRecognition
-
- roleTag(List<Vertex>, WordNet) - Static method in class com.hankcs.hanlp.recognition.nt.OrganizationRecognition
-
- ROOT - Static variable in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
根节点
- ROOT - Static variable in class com.hankcs.hanlp.dependency.nnparser.option.SpecialOption
-
- rootFirst - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- rootIndex - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- rootString - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.IndexMaps
-
ROOT
- roughSegSentence(char[]) - Method in class com.hankcs.hanlp.seg.CharacterBasedSegment
-
单纯的分词模型实现该方法,仅输出词
- roughSegSentence(char[]) - Method in class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- roughSegSentence(char[]) - Method in class com.hankcs.hanlp.seg.HMM.HMMSegment
-
- roughSegSentence(char[]) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- row(int) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
取出第i行作为一个行向量
- rows() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
- rpath - Variable in class com.hankcs.hanlp.model.crf.crfpp.Node
-
- rr - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
人称代词
- run() - Method in class com.hankcs.hanlp.corpus.document.CorpusLoader.HandlerThread
-
- run(String) - Static method in class com.hankcs.hanlp.model.crf.crfpp.crf_learn
-
- run(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.crf_learn
-
- run(String[]) - Static method in class com.hankcs.hanlp.model.crf.crfpp.crf_test
-
- runMIRA(List<TaggerImpl>, EncoderFeatureIndex, double[], int, double, double, int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Encoder
-
- ry - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
疑问代词
- rys - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
处所疑问代词
- ryt - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
时间疑问代词
- ryv - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
谓词性疑问代词
- rz - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
指示代词
- rzs - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
处所指示代词
- rzt - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
时间指示代词
- rzv - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
谓词性指示代词
- s - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
处所词
- S - Variable in class com.hankcs.hanlp.model.perceptron.tagset.CWSTagSet
-
- s2hk(String) - Static method in class com.hankcs.hanlp.HanLP
-
簡體到香港繁體
- s2t(String) - Static method in class com.hankcs.hanlp.HanLP
-
- s2tw(String) - Static method in class com.hankcs.hanlp.HanLP
-
簡體到臺灣正體
- S_TAG - Variable in class com.hankcs.hanlp.model.perceptron.tagset.NERTagSet
-
- S_TAG_CHAR - Variable in class com.hankcs.hanlp.model.perceptron.tagset.NERTagSet
-
- Sample - Class in com.hankcs.hanlp.dependency.nnparser
-
- Sample() - Constructor for class com.hankcs.hanlp.dependency.nnparser.Sample
-
- Sample(List<Integer>, List<Double>) - Constructor for class com.hankcs.hanlp.dependency.nnparser.Sample
-
- sample - Variable in class com.hankcs.hanlp.mining.word2vec.Config
-
- samples() - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
持久化
- save(ObjectOutputStream) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
持久化
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- save(OutputStream) - Method in class com.hankcs.hanlp.collection.dartsclone.DoubleArray
-
Saves the trie data into a stream.
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
- save(String) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
保存到二进制输出流
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- save(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
将base和check保存下来
- save(ObjectOutputStream) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- save(DataOutputStream) - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- save(String) - Method in class com.hankcs.hanlp.corpus.dictionary.StringDictionary
-
保存词典
- save(DataOutputStream) - Method in interface com.hankcs.hanlp.corpus.io.ICacheAble
-
写入
- save(DataOutputStream) - Method in class com.hankcs.hanlp.dependency.nnparser.Alphabet
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.dependency.nnparser.NeuralNetworkParser
-
保存到磁盘
- save(DataOutputStream) - Method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.crf.CRFModel
-
- save(String) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- save(String, boolean) - Method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.crf.FeatureFunction
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.crf.FeatureTemplate
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- save(String) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
保存到路径
- save(String, double) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
压缩并保存
- save(String, Set<Map.Entry<String, Integer>>, double) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
- save(String, Set<Map.Entry<String, Integer>>, double, boolean) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
保存
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.trigram.CharacterBasedGenerativeModel
-
- save(DataOutputStream) - Method in class com.hankcs.hanlp.model.trigram.frequency.Probability
-
- saveCollectionToTxt(Collection<T>, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- saveCorpus(List<Document>, String) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- saveDat(String, List<V>) - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
保存dat到路径
- saveEntrySetToTxt(Set<Map.Entry<Object, Object>>, String, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- saveKeyTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
仅仅将值保存到文件
- saveMapToTxt(Map<Object, Object>, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- saveMapToTxt(Map<Object, Object>, String, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- saveModel(String) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.ParserModel
-
- saveNGramToTxt(String) - Method in class com.hankcs.hanlp.corpus.dictionary.NGramDictionaryMaker
-
保存NGram词典
- saveObjectTo(Object, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
序列化对象
- saveSentenceList(List<List<IWord>>, String) - Static method in class com.hankcs.hanlp.corpus.document.CorpusLoader
-
- saveTransformMatrixToTxt(String) - Method in class com.hankcs.hanlp.corpus.dictionary.NGramDictionaryMaker
-
保存转移矩阵
- saveTxt(String, String) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
快速保存
- saveTxt(String, StringBuilder) - Static method in class com.hankcs.hanlp.corpus.io.IOUtil
-
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- saveTxtTo(String, DictionaryMaker.Filter) - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
允许保存之前对其做一些调整
- saveTxtTo(String) - Method in interface com.hankcs.hanlp.corpus.dictionary.ISaveAble
-
将自己以文本文档的方式保存到磁盘
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.NGramDictionaryMaker
-
保存NGram词典和转移矩阵
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.NTDictionaryMaker
-
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
- saveTxtTo(String) - Method in class com.hankcs.hanlp.corpus.dictionary.TMDictionaryMaker
-
- saveValue(V, DataOutputStream) - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
保存单个值到流中
- saveValue(EnumItem<E>, DataOutputStream) - Method in class com.hankcs.hanlp.dictionary.common.EnumItemDictionary
-
- scanner - Variable in class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors
-
- score - Variable in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
分数
- score - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.BeamElement
-
- score - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- score(int[], int) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
通过命中的特征函数计算得分
- scorePath - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- Searcher(int, char[]) - Constructor for class com.hankcs.hanlp.collection.trie.DoubleArrayTrie.Searcher
-
构造一个双数组搜索工具
- Searcher(char[]) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Searcher
-
- Searcher(String) - Constructor for class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Searcher
-
- Searcher(char[], DoubleArrayTrie<Character>) - Constructor for class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary.Searcher
-
- Searcher(String, DoubleArrayTrie<Character>) - Constructor for class com.hankcs.hanlp.dictionary.nr.JapanesePersonDictionary.Searcher
-
- Searcher(char[], DoubleArrayTrie<Pinyin[]>) - Constructor for class com.hankcs.hanlp.dictionary.py.PinyinDictionary.Searcher
-
- Searcher(String, DoubleArrayTrie<Pinyin[]>) - Constructor for class com.hankcs.hanlp.dictionary.py.PinyinDictionary.Searcher
-
- Searcher(char[], DoubleArrayTrie<String>) - Constructor for class com.hankcs.hanlp.dictionary.ts.BaseChineseDictionary.Searcher
-
- Searcher(String, DoubleArrayTrie<String>) - Constructor for class com.hankcs.hanlp.dictionary.ts.BaseChineseDictionary.Searcher
-
- second - Variable in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- second - Variable in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
- second - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Pair
-
- SecondOrderHiddenMarkovModel - Class in com.hankcs.hanlp.model.hmm
-
- SecondOrderHiddenMarkovModel(float[], float[][], float[][], float[][][]) - Constructor for class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- SecondOrderHiddenMarkovModel() - Constructor for class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- seg(String) - Method in class com.hankcs.hanlp.seg.Segment
-
分词
此方法是线程安全的
- seg(char[]) - Method in class com.hankcs.hanlp.seg.Segment
-
分词
- seg(String) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- seg2sentence(String) - Method in class com.hankcs.hanlp.seg.Segment
-
分词断句 输出句子形式
- seg2sentence(String, boolean) - Method in class com.hankcs.hanlp.seg.Segment
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.BasicTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.BasicTokenizer
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
分词断句 输出句子形式
- seg2sentence(String, Filter...) - Static method in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
切分为句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
分词断句 输出句子形式
- seg2sentence(String) - Static method in class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
切分为句子形式
- seg2sentence(String, boolean) - Static method in class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
分词断句 输出句子形式
- segLongest(char[], AhoCorasickDoubleArrayTrie<Pinyin[]>) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
用最长分词算法匹配拼音
- segLongest(char[], AhoCorasickDoubleArrayTrie<Pinyin[]>, boolean) - Static method in class com.hankcs.hanlp.dictionary.py.PinyinDictionary
-
- segLongest(char[], DoubleArrayTrie<String>) - Static method in class com.hankcs.hanlp.dictionary.ts.BaseChineseDictionary
-
- segLongest(char[], AhoCorasickDoubleArrayTrie<String>) - Static method in class com.hankcs.hanlp.dictionary.ts.BaseChineseDictionary
-
- segment(String) - Method in class com.hankcs.hanlp.classification.tokenizers.BigramTokenizer
-
- segment(String) - Method in class com.hankcs.hanlp.classification.tokenizers.BlankTokenizer
-
- segment(String) - Method in class com.hankcs.hanlp.classification.tokenizers.HanLPTokenizer
-
- segment(String) - Method in interface com.hankcs.hanlp.classification.tokenizers.ITokenizer
-
- segment(String) - Static method in class com.hankcs.hanlp.HanLP
-
分词
- segment - Variable in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
- segment(String) - Method in class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- segment(String, String, List<String>) - Method in class com.hankcs.hanlp.model.crf.CRFSegmenter
-
- segment(String) - Method in class com.hankcs.hanlp.model.hmm.HMMSegmenter
-
- segment(String, String, List<String>) - Method in class com.hankcs.hanlp.model.hmm.HMMSegmenter
-
- segment(String, List<String>) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronLexicalAnalyzer
-
中文分词
- segment(String, List<String>) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- segment(String, String, List<String>) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- segment(String, Instance, List<String>) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- segment(String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronSegmenter
-
- segment(String) - Method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickDoubleArrayTrieSegment
-
- segment(char[]) - Method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickDoubleArrayTrieSegment
-
- segment(String, AhoCorasickDoubleArrayTrie<V>) - Static method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickSegmentUtil
-
最长分词,合并未知语素
- segment(char[], AhoCorasickDoubleArrayTrie<V>) - Static method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickSegmentUtil
-
最长分词,合并未知语素
- Segment - Class in com.hankcs.hanlp.seg
-
分词器(分词服务)
是所有分词器的基类(Abstract)
分词器的分词方法是线程安全的,但配置方法则不保证
- Segment() - Constructor for class com.hankcs.hanlp.seg.Segment
-
构造一个分词器
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.BasicTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.BasicTokenizer
-
分词
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.BasicTokenizer
-
分词
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.IndexTokenizer
-
分词
- segment(String, String, List<String>, List<CoreDictionary.Attribute>) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
分词
- segment(String, String, List<String>) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- segment(String) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
中文分词
- segment(String, String) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
这个方法会查询用户词典
- segment(String) - Method in interface com.hankcs.hanlp.tokenizer.lexical.Segmenter
-
中文分词
- segment(String, String, List<String>) - Method in interface com.hankcs.hanlp.tokenizer.lexical.Segmenter
-
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.NLPTokenizer
-
分词
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.NotionalTokenizer
-
分词
- segment(String, String, List<String>) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- segment(String) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
分词
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
分词
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
分词
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
- segment(char[]) - Static method in class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
分词
- SEGMENT - Static variable in class com.hankcs.hanlp.tokenizer.URLTokenizer
-
预置分词器
- segment(String) - Static method in class com.hankcs.hanlp.tokenizer.URLTokenizer
-
分词
- segmentAfterRule(String, String, List<String>) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
丑陋的规则系统
- segmenter - Variable in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- Segmenter - Interface in com.hankcs.hanlp.tokenizer.lexical
-
分词器接口
- SegmentPipeline - Class in com.hankcs.hanlp.seg
-
- SegmentPipeline(Segment) - Constructor for class com.hankcs.hanlp.seg.SegmentPipeline
-
- segmentReverseOrder(String, AhoCorasickDoubleArrayTrie<V>) - Static method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickSegmentUtil
-
逆向最长分词,合并未知语素
- segmentReverseOrder(char[], AhoCorasickDoubleArrayTrie<V>) - Static method in class com.hankcs.hanlp.seg.Other.CommonAhoCorasickSegmentUtil
-
逆向最长分词,合并未知语素
- SegmentWrapper - Class in com.hankcs.hanlp.seg.common.wrapper
-
一个将BufferedReader wrap进来的类
- SegmentWrapper(BufferedReader, Segment) - Constructor for class com.hankcs.hanlp.seg.common.wrapper.SegmentWrapper
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.CharacterBasedSegment
-
以下方法用于纯分词模型
分词、词性标注联合模型则直接重载segSentence
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.Dijkstra.DijkstraSegment
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.NShort.NShortSegment
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.Other.DoubleArrayTrieSegment
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.Segment
-
给一个句子分词
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- segSentence(char[]) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- selectFeatures(IDataSet) - Method in class com.hankcs.hanlp.classification.classifiers.NaiveBayesClassifier
-
统计特征并且执行特征选择,返回一个FeatureStats对象,用于计算模型中的概率
- Sentence - Class in com.hankcs.hanlp.corpus.document.sentence
-
句子,指的是以。!等标点结尾的句子
- Sentence(List<IWord>) - Constructor for class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
- Sentence - Class in com.hankcs.hanlp.dependency.perceptron.structures
-
CoNLL中的一个句子
- Sentence(ArrayList<Integer>, ArrayList<Integer>, ArrayList<Integer>, ArrayList<Integer>, ArrayList<Integer>) - Constructor for class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- sentence - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- sentence - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Instance
-
- sentence - Variable in class com.hankcs.hanlp.seg.common.WordNet
-
- sentenceList - Variable in class com.hankcs.hanlp.corpus.document.Document
-
- SentencesUtil - Class in com.hankcs.hanlp.utility
-
文本断句
- SentencesUtil() - Constructor for class com.hankcs.hanlp.utility.SentencesUtil
-
- separator - Variable in class com.hankcs.hanlp.corpus.dictionary.StringDictionary
-
key value之间的分隔符
- separator - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- SEPERATOR_C_SENTENCE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- SEPERATOR_C_SUB_SENTENCE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- SEPERATOR_E_SENTENCE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- SEPERATOR_E_SUB_SENTENCE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- SEPERATOR_LINK - Static variable in class com.hankcs.hanlp.utility.Predefine
-
- serializeTo(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
将自己序列化到
- set(String, V) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
更新某个键对应的值
- set(String, V) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
设置键值对,当键不存在的时候会自动插入
- set(int, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- set(String, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
设置键值 (同put)
- set(String, V) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
更新某个键对应的值
- set(int, int, double) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Set a single element.
- set(int, float) - Method in class com.hankcs.hanlp.dependency.perceptron.structures.CompactArray
-
将index处的元素设置为value
- set(char, byte) - Static method in class com.hankcs.hanlp.dictionary.other.CharType
-
设置字符类型
- set(int, Pipe<List<IWord>, List<IWord>>) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- set(int, Pipe<M, M>) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- set_node(Node, int, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setAcceptStateStatus(boolean) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Sets this node's accept state status.
- setALevel(double) - Method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
- setAllowOverlaps(boolean) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.TrieConfig
-
设置是否允许重叠
- setAlpha(float) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setAlpha_(double[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setAlphaFloat_(float[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setAnswer_(List<Integer>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setAttribute(String, CoreDictionary.Attribute) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
设置某个单词的属性
- setAttribute(String, Nature...) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
设置某个单词的属性
- setAttribute(String, String...) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
设置某个单词的属性
- setAttribute(String, String) - Static method in class com.hankcs.hanlp.utility.LexiconUtility
-
设置某个单词的属性
- setBase(int[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setBigramTempls_(List<String>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setBoost(double) - Method in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
设置权重
- setBufferHead(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- setCallback(TrainingCallback) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setCallback(TrainingCallback) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
设置训练回调
- setCheck(int[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setChisquareCriticalValue(double) - Method in class com.hankcs.hanlp.classification.features.ChiSquareFeatureExtractor
-
设置卡方临界值
- setCn(int) - Method in class com.hankcs.hanlp.mining.word2vec.VocabWord
-
- setConfig(String[], Config) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractTrainer
-
- setCorrectionParam(double) - Method in class com.hankcs.hanlp.model.maxent.EvalParameters
-
- setCost_(double) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setCostFactor_(double) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setDat(DoubleArrayTrie<CoreDictionary.Attribute>) - Method in class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- setDeprelTranslater(String) - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
设置映射表
- setDeprelTranslator(Map<String, String>) - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- setDeprelTranslator(Map<String, String>) - Method in interface com.hankcs.hanlp.dependency.IDependencyParser
-
设置依存关系映射表
- setDownSamplingRate(float) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
设置高频词的下采样频率(高频词频率一旦高于此频率,训练时将被随机忽略),在不使用停用词词典的情况下,停用词就符合高频词的标准
- setElementArray(float[]) - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- setEmptyFlag(boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- setExponentialExpandFactor(double) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- setExponentialExpandFactor(double) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
指数膨胀的底数
- setExponentialExpanding(boolean) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- setExponentialExpanding(boolean) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
激活指数膨胀
- setFailure(State) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
设置failure状态
- setFailure(State, int[]) - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
设置failure状态
- setFeature_id_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setFeature_index_(FeatureIndex) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setFeatureCache_(List<List<Integer>>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setFeatureIndex_(DecoderFeatureIndex) - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- setFirst(T1) - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Pair
-
- setFrom(Vertex) - Method in class com.hankcs.hanlp.seg.common.Vertex
-
- setIndex(int) - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- setInitialLearningRate(float) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
设置初始学习率
- setInputFile(String) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setIter(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setKey(List<String>) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setLabel(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- setLabel(String) - Method in interface com.hankcs.hanlp.corpus.document.sentence.word.IWord
-
设置标签
- setLabel(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
- setLabels(String[]) - Method in class com.hankcs.hanlp.model.maxent.UniformPrior
-
初始化
- setLast(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- setLast(int, String) - Method in class com.hankcs.hanlp.model.crf.Table
-
- setLayer1Size(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setLayerSize(int) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
词向量的维度(等同于神经网络模型隐藏层的大小)
- setLength(int[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setLinearExpandFactor(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- setLinearExpandFactor(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
设置线性膨胀
- setMatrix(int, int, int, int, Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Set a submatrix.
- setMatrix(int[], int[], Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Set a submatrix.
- setMatrix(int[], int, int, Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Set a submatrix.
- setMatrix(int, int, int[], Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Set a submatrix.
- setMax_xsize_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setMaxid_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setMaxSentenceSize(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- setMinCount(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setMinVocabFrequency(int) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
最低词频,低于此数值将被过滤掉
- setMode_(TaggerImpl.Mode) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setModel(ModelImpl) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setNbest_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- setNbest_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setNegative(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setNode_(List<List<Node>>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setNodeList_(List<List<Node>>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setNumIterations(int) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
设置迭代次数
- setNumThreads(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setOutputFile(String) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setPathList_(List<List<Path>>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setPenalty(int, int, double) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- setPenalty(int, int, double) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setRealWord(String) - Method in class com.hankcs.hanlp.seg.common.Vertex
-
- setResult_(List<Integer>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setSample(float) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setScore(float) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- setSegment(Segment) - Method in class com.hankcs.hanlp.dependency.AbstractDependencyParser
-
- setSegment(Segment) - Method in interface com.hankcs.hanlp.dependency.IDependencyParser
-
设置Parser使用的分词器
- setSegment(Segment) - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
- setSegment(Segment) - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
设置关键词提取器使用的分词器
- setSize(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setThread_id_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setThreadNum_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setTokenizer(ITokenizer) - Method in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- setTokenizer(ITokenizer) - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
设置分词器
- setTransitionSetBeginIndex(int) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
转移状态在数组中的起始下标
Records the index that this node's _transition set starts at
in an array containing this node's containing MDAG data (simplified MDAG).
- setTransitionSetBeginIndex(int) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Records the index in this node's containing array that its _transition set begins at.
- setTrie(AhoCorasickDoubleArrayTrie<CoreDictionary.Attribute>) - Method in class com.hankcs.hanlp.seg.Other.AhoCorasickDoubleArrayTrieSegment
-
- setUnigramTempls_(List<String>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setUseContinuousBagOfWords(boolean) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setUseHierarchicalSoftmax(boolean) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setValue(V[]) - Method in class com.hankcs.hanlp.collection.trie.bintrie._ValueArray
-
- setValue(V) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
设置节点对应的值
- setValue(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- setValue(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- setValue(String) - Method in interface com.hankcs.hanlp.corpus.document.sentence.word.IWord
-
设置单词
- setValue(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
- setValue(int[]) - Method in class com.hankcs.hanlp.model.crf.crfpp.DoubleArrayTrieInteger
-
- setVlevel_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.ModelImpl
-
- setVlevel_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setWindow(int) - Method in class com.hankcs.hanlp.mining.word2vec.Config
-
- setWindowSize(int) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
窗口大小
- setWord(String) - Method in class com.hankcs.hanlp.seg.common.Vertex
-
- setX_(List<List<String>>) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setXsize_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setY_(List<String>) - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- setYsize_(int) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setZ_(double) - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- setZero() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
- Shengmu - Enum in com.hankcs.hanlp.dictionary.py
-
声母
- shift(State) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ArcEager
-
- shiftFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
This is the main part of the extension to the original perceptron algorithm which the averaging over all the history
- shiftFeatureAveragedWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.structures.ParserModel
-
- shiftFeatureWeights - Variable in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
For the weights for all features
- shiftScore(Object[], boolean) - Method in class com.hankcs.hanlp.dependency.perceptron.learning.AveragedPerceptron
-
- shouldInclude(Term) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
是否应当将这个term纳入计算
- shouldInclude(Term) - Method in interface com.hankcs.hanlp.dictionary.stopword.Filter
-
是否应当将这个term纳入计算
- shouldInclude(Term) - Method in class com.hankcs.hanlp.dictionary.stopword.StopWordDictionary
-
- shouldInclude(Term) - Method in class com.hankcs.hanlp.summary.KeywordExtractor
-
是否应当将这个term纳入计算,词性属于名词、动词、副词、形容词
- shouldRemove(Term) - Static method in class com.hankcs.hanlp.dictionary.stopword.CoreStopWordDictionary
-
是否应当去掉这个词
- showHelp - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- showHelp() - Static method in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- ShowTermNature - Static variable in class com.hankcs.hanlp.HanLP.Config
-
分词结果是否展示词性
- shrink(int[]) - Method in class com.hankcs.hanlp.classification.corpus.FileDataSet
-
- shrink(int[]) - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
- shrink(int[]) - Method in class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- shrink(int, List<TaggerImpl>) - Method in class com.hankcs.hanlp.model.crf.crfpp.EncoderFeatureIndex
-
- shrink() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- shrinking_size - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- shuffleArray(int[]) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
Fisher–Yates shuffle
- shuffleArray(T[]) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- shutdown() - Method in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- shutdown() - Method in class com.hankcs.hanlp.mining.word2vec.TextFileCorpus
-
- shutDownLiveThreads() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.KBeamArcEagerParser
-
- sigma(double) - Static method in class com.hankcs.hanlp.model.crf.crfpp.Mcsrch
-
- sigMoid(float) - Static method in class com.hankcs.hanlp.summary.TextRankKeyword
-
sigmoid函数
- sim(List<String>, int) - Method in class com.hankcs.hanlp.summary.BM25
-
计算一个句子与一个文档的BM25相似度
- simAll(List<String>) - Method in class com.hankcs.hanlp.summary.BM25
-
- similar(HiddenMarkovModel) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
- similar(float[], float[]) - Static method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
- similar(HiddenMarkovModel) - Method in class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- similarity(String, String) - Static method in class com.hankcs.hanlp.dictionary.CoreSynonymDictionary
-
计算两个单词之间的相似度,0表示不相似,1表示完全相似
- similarity(K, K) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
余弦相似度
- similarity(String, String) - Method in class com.hankcs.hanlp.mining.word2vec.DocVectorModel
-
文档相似度计算
- similarity(CharArray) - Method in class com.hankcs.hanlp.suggest.scorer.editdistance.CharArray
-
- similarity(T) - Method in interface com.hankcs.hanlp.suggest.scorer.ISentenceKey
-
- similarity(IdVector) - Method in class com.hankcs.hanlp.suggest.scorer.lexeme.IdVector
-
- similarity(PinyinKey) - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinKey
-
- simpleAtomSegment(char[], int, int) - Static method in class com.hankcs.hanlp.seg.Segment
-
简易原子分词,将所有字放到一起作为一个词
- SimpleDictionary<V> - Class in com.hankcs.hanlp.corpus.dictionary
-
可以调整大小的词典
- SimpleDictionary() - Constructor for class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
- SimpleDictionary.Filter<V> - Interface in com.hankcs.hanlp.corpus.dictionary
-
- SimpleItem - Class in com.hankcs.hanlp.corpus.dictionary.item
-
- SimpleItem() - Constructor for class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- SimpleMDAGNode - Class in com.hankcs.hanlp.collection.MDAG
-
The class capable of representing a MDAG node, its _transition set, and one of its incoming transitions;
objects of this class are used to represent a MDAG after its been simplified in order to save space.
- SimpleMDAGNode(char, boolean, int) - Constructor for class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Constructs a SimpleMDAGNode.
- SimpleMDAGNode() - Constructor for class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
- SimplifiedChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
简体=繁体词典
- SimplifiedChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.SimplifiedChineseDictionary
-
- simplifiedSourceNode - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
简化后的根节点(简化指的是用数组简化,而不是最小化,这个结构永远是最小化的)
- SimplifiedToHongKongChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
简体转香港繁体繁体
- SimplifiedToHongKongChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.SimplifiedToHongKongChineseDictionary
-
- SimplifiedToTaiwanChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
简体转台湾繁体
- SimplifiedToTaiwanChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.SimplifiedToTaiwanChineseDictionary
-
- simplify() - Method in class com.hankcs.hanlp.collection.MDAG.MDAG
-
固化自己
Creates a space-saving version of the MDAG in the form of an array.
- simplify() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGMap
-
进一步降低内存,提高查询速度
副作用是下次插入速度会变慢
- size() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- size() - Method in interface com.hankcs.hanlp.algorithm.ahocorasick.interval.Intervalable
-
长度
- size() - Method in class com.hankcs.hanlp.algorithm.MaxHeap
-
- size() - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- size() - Method in class com.hankcs.hanlp.classification.corpus.FileDataSet
-
- size() - Method in interface com.hankcs.hanlp.classification.corpus.IDataSet
-
数据集的样本大小
- size() - Method in class com.hankcs.hanlp.classification.corpus.Lexicon
-
- size() - Method in class com.hankcs.hanlp.classification.corpus.MemoryDataSet
-
- size - Variable in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
base 和 check 的大小
- size() - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
大小,即包含多少个模式串
- size() - Method in class com.hankcs.hanlp.collection.dartsclone.DoubleArray
-
大小
- size() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- size() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BinTrie
-
- size() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- size() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- size() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
键值对个数
- size - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
base 和 check 的大小
- size() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
树叶子节点个数
- size() - Method in interface com.hankcs.hanlp.collection.trie.ITrie
-
- size() - Method in class com.hankcs.hanlp.corpus.dictionary.SimpleDictionary
-
- size() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
句子中单词(复合词或简单词)的数量
- size() - Method in class com.hankcs.hanlp.dependency.nnparser.Alphabet
-
字母表大小
- size() - Method in class com.hankcs.hanlp.dependency.perceptron.structures.Sentence
-
- size() - Method in class com.hankcs.hanlp.dictionary.common.CommonDictionary
-
词典大小
- size() - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
已向聚类分析器添加的文档数量
- size() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- size() - Method in class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
模型中的词向量总数(词表大小)
- size() - Method in class com.hankcs.hanlp.mining.word2vec.Vector
-
- size - Variable in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- size() - Method in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- size() - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- size() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- size() - Method in class com.hankcs.hanlp.model.crf.Table
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureDatMap
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMap
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.feature.ImmutableFeatureMDatMap
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.feature.MutableFeatureMap
-
- size() - Method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
实例大小(有多少个要预测的元素)
- size() - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- size() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
- size() - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- size() - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinKey
-
拼音的个数
- size() - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- sizeIncludingBos() - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- sortBeam(ArrayList<Configuration>, TreeSet<BeamElement>, Boolean, Instance, int, boolean, int, AveragedPerceptron, Collection<Integer>) - Static method in class com.hankcs.hanlp.dependency.perceptron.transition.parser.ParseThread
-
- sortDictionary(String) - Static method in class com.hankcs.hanlp.corpus.util.DictionaryUtil
-
给某个字典排序
- sortedAllTf() - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- sortedAllTfInt() - Method in class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- sortMapByValue(Map<K, V>, boolean) - Static method in class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
- sortMapByValue(Map<K, V>) - Static method in class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
- sourceNode - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
根节点
- SparseVector - Class in com.hankcs.hanlp.mining.cluster
-
- SparseVector() - Constructor for class com.hankcs.hanlp.mining.cluster.SparseVector
-
- SpecialOption - Class in com.hankcs.hanlp.dependency.nnparser.option
-
- SpecialOption() - Constructor for class com.hankcs.hanlp.dependency.nnparser.option.SpecialOption
-
- speechTagging - Variable in class com.hankcs.hanlp.seg.Config
-
词性标注
- speechTagging(List<Vertex>) - Static method in class com.hankcs.hanlp.seg.WordBasedSegment
-
词性标注
- speed - Variable in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
速度
- SpeedTokenizer - Class in com.hankcs.hanlp.tokenizer
-
极速分词,基于Double Array Trie实现的词典分词,适用于“高吞吐量”“精度一般”的场合
- SpeedTokenizer() - Constructor for class com.hankcs.hanlp.tokenizer.SpeedTokenizer
-
- spilt(List<IWord>) - Static method in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- spiltArray(String[], double) - Static method in class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
分割数组为两个数组
- splitMap(Map<String, String[]>, double) - Static method in class com.hankcs.hanlp.classification.utilities.CollectionUtility
-
分割Map,其中旧map直接被改变
- SString - Class in com.hankcs.hanlp.collection.sequence
-
(SimpleString)字符串,因为String内部的char[]无法访问,而许多任务经常操作char[],所以封装了这个结构。
- SString(char[], int, int) - Constructor for class com.hankcs.hanlp.collection.sequence.SString
-
建立一个字符串
- SString(String) - Constructor for class com.hankcs.hanlp.collection.sequence.SString
-
- stack - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- stackEmpty() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- stackSize() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- stackTop() - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
View top element of stack
- StandardTokenizer - Class in com.hankcs.hanlp.tokenizer
-
标准分词器
- StandardTokenizer() - Constructor for class com.hankcs.hanlp.tokenizer.StandardTokenizer
-
- start(String, Object...) - Method in class com.hankcs.hanlp.classification.utilities.io.ConsoleLogger
-
- start(String, Object...) - Method in interface com.hankcs.hanlp.classification.utilities.io.ILogger
-
- start_i - Variable in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- start_probability - Variable in class com.hankcs.hanlp.dictionary.TransformMatrix
-
初始概率
- start_probability - Variable in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
初始状态概率向量
- startsWith(String) - Method in class com.hankcs.hanlp.corpus.tag.Nature
-
词性是否以该前缀开头
词性根据开头的几个字母可以判断大的类别
- startsWith(char) - Method in class com.hankcs.hanlp.corpus.tag.Nature
-
词性是否以该前缀开头
词性根据开头的几个字母可以判断大的类别
- State - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
一个状态有如下几个功能
- State() - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
构造深度为0的节点
- State(int) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
构造深度为depth的节点
- State - Class in com.hankcs.hanlp.collection.AhoCorasick
-
一个状态有如下几个功能
- State() - Constructor for class com.hankcs.hanlp.collection.AhoCorasick.State
-
构造深度为0的节点
- State(int) - Constructor for class com.hankcs.hanlp.collection.AhoCorasick.State
-
构造深度为depth的节点
- State - Class in com.hankcs.hanlp.dependency.common
-
- State(float, int, Edge) - Constructor for class com.hankcs.hanlp.dependency.common.State
-
- State - Class in com.hankcs.hanlp.dependency.nnparser
-
- State() - Constructor for class com.hankcs.hanlp.dependency.nnparser.State
-
- State(Dependency) - Constructor for class com.hankcs.hanlp.dependency.nnparser.State
-
- state - Variable in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.Configuration
-
- State - Class in com.hankcs.hanlp.dependency.perceptron.transition.configuration
-
由buffer、stack和arc组成的状态
- State(int) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- State(int, boolean) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- State - Class in com.hankcs.hanlp.seg.Dijkstra.Path
-
- State(double, int) - Constructor for class com.hankcs.hanlp.seg.Dijkstra.Path.State
-
- states - Variable in class com.hankcs.hanlp.dictionary.TransformMatrix
-
隐状态
- stateValue(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
- status - Variable in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
节点状态
- std - Class in com.hankcs.hanlp.dependency.nnparser.util
-
- std() - Constructor for class com.hankcs.hanlp.dependency.nnparser.util.std
-
- StopWordDictionary - Class in com.hankcs.hanlp.dictionary.stopword
-
- StopWordDictionary(File) - Constructor for class com.hankcs.hanlp.dictionary.stopword.StopWordDictionary
-
- StopWordDictionary(Collection<String>) - Constructor for class com.hankcs.hanlp.dictionary.stopword.StopWordDictionary
-
- StopWordDictionary() - Constructor for class com.hankcs.hanlp.dictionary.stopword.StopWordDictionary
-
- StopWordDictionary(String) - Constructor for class com.hankcs.hanlp.dictionary.stopword.StopWordDictionary
-
- storage - Variable in class com.hankcs.hanlp.suggest.scorer.BaseScorer
-
储存
- String2PinyinConverter - Class in com.hankcs.hanlp.dictionary.py
-
汉字转拼音,提供拼音字符串转拼音,支持汉英混合的杂乱文本
- String2PinyinConverter() - Constructor for class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
- StringDictionary - Class in com.hankcs.hanlp.corpus.dictionary
-
满足 key=value 格式的词典,其中“=”可以自定义
- StringDictionary(String) - Constructor for class com.hankcs.hanlp.corpus.dictionary.StringDictionary
-
- StringDictionary() - Constructor for class com.hankcs.hanlp.corpus.dictionary.StringDictionary
-
- StringDictionaryMaker - Class in com.hankcs.hanlp.corpus.dictionary
-
方便的工厂类
- StringDictionaryMaker() - Constructor for class com.hankcs.hanlp.corpus.dictionary.StringDictionaryMaker
-
- stringOf(int) - Method in interface com.hankcs.hanlp.model.perceptron.common.IIdStringMap
-
- stringOf(int) - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- StructuredPerceptron - Class in com.hankcs.hanlp.model.perceptron.model
-
结构化感知机算法学习的线性模型
- StructuredPerceptron(FeatureMap, float[]) - Constructor for class com.hankcs.hanlp.model.perceptron.model.StructuredPerceptron
-
- StructuredPerceptron(FeatureMap) - Constructor for class com.hankcs.hanlp.model.perceptron.model.StructuredPerceptron
-
- subList(int, int) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- subList(int, int) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- subMap(String, String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- subSequence(int, int) - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- SuffixDictionary - Class in com.hankcs.hanlp.corpus.dictionary
-
后缀树词典
- SuffixDictionary() - Constructor for class com.hankcs.hanlp.corpus.dictionary.SuffixDictionary
-
- suggest(String, int) - Method in interface com.hankcs.hanlp.suggest.ISuggester
-
根据一个输入的句子推荐相似的句子
- suggest(String, int) - Method in class com.hankcs.hanlp.suggest.Suggester
-
- Suggester - Class in com.hankcs.hanlp.suggest
-
文本推荐器
- Suggester() - Constructor for class com.hankcs.hanlp.suggest.Suggester
-
- Suggester(List<BaseScorer>) - Constructor for class com.hankcs.hanlp.suggest.Suggester
-
- Suggester(BaseScorer...) - Constructor for class com.hankcs.hanlp.suggest.Suggester
-
构造一个推荐器
- sum(int...) - Static method in class com.hankcs.hanlp.utility.MathUtility
-
- sum(float...) - Static method in class com.hankcs.hanlp.utility.MathUtility
-
- sWord - Variable in class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
- Synonym - Class in com.hankcs.hanlp.corpus.synonym
-
同义词
- Synonym(String, String) - Constructor for class com.hankcs.hanlp.corpus.synonym.Synonym
-
Deprecated.
- Synonym(String, long) - Constructor for class com.hankcs.hanlp.corpus.synonym.Synonym
-
Deprecated.
- Synonym(String, long, Synonym.Type) - Constructor for class com.hankcs.hanlp.corpus.synonym.Synonym
-
- Synonym.Type - Enum in com.hankcs.hanlp.corpus.synonym
-
- SynonymHelper - Class in com.hankcs.hanlp.corpus.synonym
-
一个Util
- SynonymHelper() - Constructor for class com.hankcs.hanlp.corpus.synonym.SynonymHelper
-
- SynonymItem(Synonym, List<Synonym>, Synonym.Type) - Constructor for class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
- SynonymItem(Synonym, List<Synonym>, char) - Constructor for class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
- SynonymItem(Synonym, Map<String, Synonym>) - Constructor for class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionaryEx.SynonymItem
-
- synonymList - Variable in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
条目的value,是key的同义词列表
- synonymMap - Variable in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionaryEx.SynonymItem
-
条目的value,是key的同义词近义词列表
- t - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
时间词
- t2hk(String) - Static method in class com.hankcs.hanlp.HanLP
-
繁體到香港繁體
- t2s(String) - Static method in class com.hankcs.hanlp.HanLP
-
- t2tw(String) - Static method in class com.hankcs.hanlp.HanLP
-
繁體到臺灣正體
- T_WORD_ID - Static variable in class com.hankcs.hanlp.dictionary.CoreDictionary
-
- table - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- Table - Class in com.hankcs.hanlp.model.crf
-
给一个实例生成一个元素表
- Table() - Constructor for class com.hankcs.hanlp.model.crf.Table
-
- tag(Table) - Method in class com.hankcs.hanlp.model.crf.CRFModel
-
维特比后向算法标注
- tag(List<String>) - Method in class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- tag(String...) - Method in class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- tag(Table) - Method in class com.hankcs.hanlp.model.CRFSegmentModel
-
- tag(String...) - Method in class com.hankcs.hanlp.model.hmm.HMMPOSTagger
-
- tag(List<String>) - Method in class com.hankcs.hanlp.model.hmm.HMMPOSTagger
-
- tag(String...) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronPOSTagger
-
标注
- tag(POSInstance) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronPOSTagger
-
- tag(List<String>) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronPOSTagger
-
标注
- tag(char[]) - Method in class com.hankcs.hanlp.model.trigram.CharacterBasedGenerativeModel
-
序列标注
- tag(String...) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- tag(List<String>) - Method in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
- tag(String...) - Method in interface com.hankcs.hanlp.tokenizer.lexical.POSTagger
-
词性标注
- tag(List<String>) - Method in interface com.hankcs.hanlp.tokenizer.lexical.POSTagger
-
词性标注
- tag(String...) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- tag(List<String>) - Method in class com.hankcs.hanlp.tokenizer.pipe.LexicalAnalyzerPipeline
-
- TAG_BIGIN - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_BIGIN - Static variable in class com.hankcs.hanlp.utility.Predefine
-
句子的开始 begin
- TAG_CLUSTER - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_CLUSTER - Static variable in class com.hankcs.hanlp.utility.Predefine
-
字符串 x
- TAG_END - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_END - Static variable in class com.hankcs.hanlp.utility.Predefine
-
结束 end
- TAG_GROUP - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_GROUP - Static variable in class com.hankcs.hanlp.utility.Predefine
-
团体名词 nt
- TAG_NUMBER - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_NUMBER - Static variable in class com.hankcs.hanlp.utility.Predefine
-
数词 m
- TAG_OTHER - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_OTHER - Static variable in class com.hankcs.hanlp.utility.Predefine
-
其它
- TAG_PEOPLE - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_PEOPLE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
人名 nr
- TAG_PLACE - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_PLACE - Static variable in class com.hankcs.hanlp.utility.Predefine
-
地址 ns
- TAG_PROPER - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_PROPER - Static variable in class com.hankcs.hanlp.utility.Predefine
-
专有名词 nx
- TAG_QUANTIFIER - Static variable in class com.hankcs.hanlp.utility.Predefine
-
数量词 mq (现在觉得应该和数词同等处理,比如一个人和一人都是合理的)
- TAG_TIME - Static variable in class com.hankcs.hanlp.corpus.util.CorpusUtil
-
- TAG_TIME - Static variable in class com.hankcs.hanlp.utility.Predefine
-
时间 t
- tagArray - Variable in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
- Tagger - Class in com.hankcs.hanlp.model.crf.crfpp
-
- Tagger() - Constructor for class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- TaggerImpl - Class in com.hankcs.hanlp.model.crf.crfpp
-
- TaggerImpl(TaggerImpl.Mode) - Constructor for class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- TaggerImpl.Mode - Enum in com.hankcs.hanlp.model.crf.crfpp
-
- TaggerImpl.ReadStatus - Enum in com.hankcs.hanlp.model.crf.crfpp
-
- tags(TagSet) - Method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
根据标注集还原字符形式的标签
- tags() - Method in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- tagSet - Variable in class com.hankcs.hanlp.model.crf.CRFNERecognizer
-
- tagSet - Variable in class com.hankcs.hanlp.model.perceptron.feature.FeatureMap
-
- tagSet() - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
- tagSet - Variable in class com.hankcs.hanlp.model.perceptron.NERTrainer
-
支持任意自定义NER类型,例如:
tagSet.nerLabels.clear();
tagSet.nerLabels.add("nr");
tagSet.nerLabels.add("ns");
tagSet.nerLabels.add("nt");
- TagSet - Class in com.hankcs.hanlp.model.perceptron.tagset
-
- TagSet(TaskType) - Constructor for class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- tailMap(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- TaiwanToHongKongChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
台湾繁体转香港繁体
- TaiwanToHongKongChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TaiwanToHongKongChineseDictionary
-
- TaiwanToSimplifiedChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
台湾繁体转简体
- TaiwanToSimplifiedChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TaiwanToSimplifiedChineseDictionary
-
- TaiwanToTraditionalChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
台湾繁体转繁体
- TaiwanToTraditionalChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TaiwanToTraditionalChineseDictionary
-
- TaskType - Enum in com.hankcs.hanlp.model.perceptron.common
-
- taskType() - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
- tcDictionaryRoot - Static variable in class com.hankcs.hanlp.HanLP.Config
-
简繁转换词典根目录
- templs_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- Term - Class in com.hankcs.hanlp.seg.common
-
一个单词,用户可以直接访问此单词的全部属性
- Term(String, Nature) - Constructor for class com.hankcs.hanlp.seg.common.Term
-
构造一个单词
- TermFrequency - Class in com.hankcs.hanlp.corpus.occurrence
-
词与词频的简单封装
- TermFrequency(String, Integer) - Constructor for class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
- TermFrequency(String) - Constructor for class com.hankcs.hanlp.corpus.occurrence.TermFrequency
-
- TermFrequencyCounter - Class in com.hankcs.hanlp.mining.word
-
词频统计工具
- TermFrequencyCounter(Segment, boolean) - Constructor for class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
构造
- TermFrequencyCounter() - Constructor for class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- TermOccurrence - Class in com.hankcs.hanlp.corpus.occurrence
-
词频统计
- TermOccurrence() - Constructor for class com.hankcs.hanlp.corpus.occurrence.TermOccurrence
-
- testingDataSet - Variable in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
是否属于测试集
- TestOption - Class in com.hankcs.hanlp.dependency.nnparser.option
-
- TestOption() - Constructor for class com.hankcs.hanlp.dependency.nnparser.option.TestOption
-
- text() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
原始文本形式(无标注,raw text)
- text(String) - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
原始文本形式(无标注,raw text)
- text - Variable in class com.hankcs.hanlp.mining.word.WordInfo
-
词语
- TextFileCorpus - Class in com.hankcs.hanlp.mining.word2vec
-
- TextFileCorpus(Config) - Constructor for class com.hankcs.hanlp.mining.word2vec.TextFileCorpus
-
- textmodel - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- TextProcessUtility - Class in com.hankcs.hanlp.classification.utilities
-
文件预处理工具
- TextProcessUtility() - Constructor for class com.hankcs.hanlp.classification.utilities.TextProcessUtility
-
- TextRankKeyword - Class in com.hankcs.hanlp.summary
-
基于TextRank算法的关键字提取,适用于单文档
- TextRankKeyword(Segment) - Constructor for class com.hankcs.hanlp.summary.TextRankKeyword
-
- TextRankKeyword() - Constructor for class com.hankcs.hanlp.summary.TextRankKeyword
-
- TextRankSentence - Class in com.hankcs.hanlp.summary
-
TextRank 自动摘要
- TextRankSentence(List<List<String>>) - Constructor for class com.hankcs.hanlp.summary.TextRankSentence
-
- TextUtility - Class in com.hankcs.hanlp.utility
-
文本工具类
- TextUtility() - Constructor for class com.hankcs.hanlp.utility.TextUtility
-
- tf(Collection<TERM>, TfIdf.TfType) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
单文档词频
- tf(Collection<TERM>) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
单文档词频
- TFDictionary - Class in com.hankcs.hanlp.corpus.dictionary
-
词频词典
- TFDictionary(String) - Constructor for class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
- TFDictionary() - Constructor for class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
- TfIdf - Class in com.hankcs.hanlp.mining.word
-
词频-倒排文档词频统计
- TfIdf() - Constructor for class com.hankcs.hanlp.mining.word.TfIdf
-
- tfIdf(Map<TERM, Double>, Map<TERM, Double>, TfIdf.Normalization) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
计算文档的tf-idf
- tfIdf(Map<TERM, Double>, Map<TERM, Double>) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
计算文档的tf-idf(不正规化)
- TfIdf.Normalization - Enum in com.hankcs.hanlp.mining.word
-
tf-idf 向量的正规化算法
- TfIdf.TfType - Enum in com.hankcs.hanlp.mining.word
-
词频统计方式
- TfIdfCounter - Class in com.hankcs.hanlp.mining.word
-
TF-IDF统计工具兼关键词提取工具
- TfIdfCounter() - Constructor for class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- TfIdfCounter(boolean) - Constructor for class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- TfIdfCounter(Segment, boolean) - Constructor for class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- TfIdfCounter(Segment) - Constructor for class com.hankcs.hanlp.mining.word.TfIdfCounter
-
- TfIdfFeatureWeighter - Class in com.hankcs.hanlp.classification.features
-
TF-IDF权重计算
- TfIdfFeatureWeighter(int, int[]) - Constructor for class com.hankcs.hanlp.classification.features.TfIdfFeatureWeighter
-
- tfMap - Variable in class com.hankcs.hanlp.classification.corpus.BagOfWordsDocument
-
- TfOnlyFeatureWeighter - Class in com.hankcs.hanlp.classification.features
-
仅仅使用TF的权重计算方式
- TfOnlyFeatureWeighter() - Constructor for class com.hankcs.hanlp.classification.features.TfOnlyFeatureWeighter
-
- tfs(Iterable<Collection<TERM>>, TfIdf.TfType) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
多文档词频
- tfs(Iterable<Collection<TERM>>) - Static method in class com.hankcs.hanlp.mining.word.TfIdf
-
多文档词频
- tg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
时间词性语素
- third - Variable in class com.hankcs.hanlp.corpus.occurrence.TriaFrequency
-
- thread - Variable in class com.hankcs.hanlp.model.crf.crfpp.crf_learn.Option
-
- threadNum - Variable in class com.hankcs.hanlp.model.crf.crfpp.CRFEncoderThread
-
- threadNum_ - Variable in class com.hankcs.hanlp.model.crf.crfpp.FeatureIndex
-
- threadNumber - Variable in class com.hankcs.hanlp.seg.Config
-
并行分词的线程数
- times(double) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Multiply a matrix by a scalar, C = s*A
- times(Matrix) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Linear algebraic matrix multiplication, A * B
- timesEquals(double) - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Multiply a matrix by a scalar in place, A = s*A
- TMDictionaryMaker - Class in com.hankcs.hanlp.corpus.dictionary
-
转移矩阵词典制作工具
- TMDictionaryMaker() - Constructor for class com.hankcs.hanlp.corpus.dictionary.TMDictionaryMaker
-
- to - Variable in class com.hankcs.hanlp.dependency.common.Edge
-
- to863(List<Term>) - Static method in class com.hankcs.hanlp.dependency.nnparser.util.PosTagUtil
-
转为863标注集
863词性标注集,其各个词性含义如下表:
- toArray() - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- toArray() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- toArray(T[]) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGSet
-
- toArray() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- toArray(T[]) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- toArray() - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- toArray(T[]) - Method in class com.hankcs.hanlp.seg.SegmentPipeline
-
- toArray() - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- toArray(T[]) - Method in class com.hankcs.hanlp.tokenizer.pipe.Pipeline
-
- toCharArray() - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- toDefaultNature(String) - Static method in class com.hankcs.hanlp.seg.CRF.CRFSegment
-
Deprecated.
- toFeatureArray(List<Integer>) - Static method in class com.hankcs.hanlp.model.perceptron.instance.Instance
-
- toGraph() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
词网转词图
- toIdList(String) - Method in interface com.hankcs.hanlp.collection.trie.datrie.CharacterMapping
-
- toIdList(int) - Method in interface com.hankcs.hanlp.collection.trie.datrie.CharacterMapping
-
- toIdList(String) - Method in class com.hankcs.hanlp.collection.trie.datrie.Utf8CharacterMapping
-
- toIdList(int) - Method in class com.hankcs.hanlp.collection.trie.datrie.Utf8CharacterMapping
-
codes ported from iconv lib in utf8.h utf8_codepointtomb
- Token - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
一个片段
- Token(String) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.Token
-
- tokenize(String) - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
一个最长分词器
- tokenizer - Variable in class com.hankcs.hanlp.classification.corpus.AbstractDataSet
-
- tokenizer - Variable in class com.hankcs.hanlp.classification.models.AbstractModel
-
分词器
- toList() - Method in class com.hankcs.hanlp.algorithm.MaxHeap
-
转为有序列表,自毁性操作
- toLog() - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
- toLog() - Method in class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
- tone2tone5 - Static variable in class com.hankcs.hanlp.dictionary.py.String2PinyinConverter
-
将音调统一换为轻声,下标为拼音的ordinal,值为音调5或最大值
- TonePinyinString2PinyinConverter - Class in com.hankcs.hanlp.dictionary.py
-
将类似āiyā的词语转为拼音的转换器
- TonePinyinString2PinyinConverter() - Constructor for class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
- top(int) - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
取前N个高频词
- toSegment() - Method in class com.hankcs.hanlp.model.hmm.HMMSegmenter
-
获取兼容旧的Segment接口
- toSentenceList(String) - Static method in class com.hankcs.hanlp.utility.SentencesUtil
-
将文本切割为最细小的句子(逗号也视作分隔符)
- toSentenceList(String, boolean) - Static method in class com.hankcs.hanlp.utility.SentencesUtil
-
文本分句
- toSentenceList(char[]) - Static method in class com.hankcs.hanlp.utility.SentencesUtil
-
- toSentenceList(char[], boolean) - Static method in class com.hankcs.hanlp.utility.SentencesUtil
-
- toSimpleWordList() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
转换为简单单词列表
- toStandoff() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
brat standoff format
http://brat.nlplab.org/standoff.html
- toStandoff(boolean) - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
brat standoff format
http://brat.nlplab.org/standoff.html
- toString() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.interval.Interval
-
- toString() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Emit
-
- toString() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.State
-
- toString() - Method in class com.hankcs.hanlp.algorithm.ahocorasick.trie.Token
-
- toString() - Method in class com.hankcs.hanlp.classification.corpus.Catalog
-
- toString() - Method in class com.hankcs.hanlp.classification.statistics.evaluations.FMeasure
-
- toString() - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie.Hit
-
- toString() - Method in class com.hankcs.hanlp.collection.AhoCorasick.State
-
- toString() - Method in class com.hankcs.hanlp.collection.dartsclone.Pair
-
- toString() - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
- toString() - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
- toString() - Method in class com.hankcs.hanlp.collection.sequence.SString
-
- toString() - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
- toString(int[]) - Method in interface com.hankcs.hanlp.collection.trie.datrie.CharacterMapping
-
- toString() - Method in class com.hankcs.hanlp.collection.trie.datrie.IntArrayList
-
- toString() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- toString() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- toString(int[]) - Method in class com.hankcs.hanlp.collection.trie.datrie.Utf8CharacterMapping
-
- toString() - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- toString() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNllLine
-
- toString() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLSentence
-
- toString() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNLLWord
-
- toString() - Method in class com.hankcs.hanlp.corpus.dependency.CoNll.Evaluator
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.DictionaryMaker
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.item.EnumItem
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.item.Item
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.item.SimpleItem
-
- toString() - Method in class com.hankcs.hanlp.corpus.dictionary.TMDictionaryMaker
-
- toString() - Method in class com.hankcs.hanlp.corpus.document.Document
-
- toString() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
- toString() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
- toString() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
- toString() - Method in class com.hankcs.hanlp.corpus.occurrence.Occurrence
-
- toString() - Method in class com.hankcs.hanlp.corpus.occurrence.PairFrequency
-
- toString() - Method in class com.hankcs.hanlp.corpus.occurrence.TriaFrequency
-
- toString() - Method in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- toString() - Method in class com.hankcs.hanlp.corpus.tag.Nature
-
- toString() - Method in class com.hankcs.hanlp.dependency.common.Node
-
- toString() - Method in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- toString() - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
- toString() - Method in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionaryEx.SynonymItem
-
- toString() - Method in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- toString() - Method in class com.hankcs.hanlp.dictionary.CustomDictionary
-
- toString() - Method in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
- toString() - Method in class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
- toString() - Method in class com.hankcs.hanlp.mining.word.TermFrequencyCounter
-
- toString() - Method in class com.hankcs.hanlp.mining.word.WordInfo
-
- toString() - Method in class com.hankcs.hanlp.mining.word2vec.VocabWord
-
- toString() - Method in class com.hankcs.hanlp.model.crf.crfpp.Pair
-
String representation of this
Pair.
- toString() - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- toString(String, int) - Method in class com.hankcs.hanlp.model.crf.crfpp.Tagger
-
- toString() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- toString() - Method in class com.hankcs.hanlp.model.crf.FeatureTemplate
-
- toString() - Method in class com.hankcs.hanlp.model.crf.Table
-
- toString() - Method in class com.hankcs.hanlp.seg.common.CWSEvaluator.Result
-
- toString() - Method in class com.hankcs.hanlp.seg.common.EdgeFrom
-
- toString() - Method in class com.hankcs.hanlp.seg.common.Graph
-
- toString() - Method in class com.hankcs.hanlp.seg.common.ResultTerm
-
- toString() - Method in class com.hankcs.hanlp.seg.common.Term
-
- toString() - Method in class com.hankcs.hanlp.seg.common.Vertex
-
- toString() - Method in class com.hankcs.hanlp.seg.common.WordNet
-
- toString() - Method in class com.hankcs.hanlp.seg.NShort.Path.AtomNode
-
- toString() - Method in class com.hankcs.hanlp.seg.NShort.Path.PathNode
-
- toString() - Method in class com.hankcs.hanlp.seg.Viterbi.Path.Node
-
- toString() - Method in class com.hankcs.hanlp.suggest.scorer.pinyin.PinyinKey
-
- toStringWithoutLabels() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
转换为空格分割无标签的String
- total - Variable in class com.hankcs.hanlp.model.perceptron.feature.FeatureSortItem
-
- totalFrequency - Variable in class com.hankcs.hanlp.corpus.dictionary.EasyDictionary.Attribute
-
- totalFrequency - Variable in class com.hankcs.hanlp.dictionary.CoreDictionary.Attribute
-
- totalFrequency - Static variable in class com.hankcs.hanlp.dictionary.CoreDictionary
-
- totalFrequency - Variable in class com.hankcs.hanlp.model.perceptron.common.FrequencyMap
-
- toVector(List<String>) - Method in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
- toVertexList(List<Term>, boolean) - Method in class com.hankcs.hanlp.seg.CharacterBasedSegment
-
将中间结果转换为词网顶点,
这样就可以利用基于Vertex开发的功能, 如词性标注、NER等
- toWord() - Method in class com.hankcs.hanlp.corpus.document.sentence.word.CompoundWord
-
转换为一个简单词
- toWordArray() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
获取所有单词构成的数组
- toWordArray(List<Word>) - Static method in class com.hankcs.hanlp.model.perceptron.utility.Utility
-
- toWordTagArray() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
word pos
- toWordTagNerArray(NERTagSet) - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
word pos ner
- trace() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Matrix trace.
- TraditionalChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
繁简词典,提供简繁转换
- TraditionalChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TraditionalChineseDictionary
-
- TraditionalChineseTokenizer - Class in com.hankcs.hanlp.tokenizer
-
繁体中文分词器
- TraditionalChineseTokenizer() - Constructor for class com.hankcs.hanlp.tokenizer.TraditionalChineseTokenizer
-
- TraditionalToHongKongChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
繁体转香港繁体
- TraditionalToHongKongChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TraditionalToHongKongChineseDictionary
-
- TraditionalToTaiwanChineseDictionary - Class in com.hankcs.hanlp.dictionary.ts
-
繁体转台湾繁体
- TraditionalToTaiwanChineseDictionary() - Constructor for class com.hankcs.hanlp.dictionary.ts.TraditionalToTaiwanChineseDictionary
-
- train(String, String) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- train(Map<String, String[]>) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- train(String) - Method in class com.hankcs.hanlp.classification.classifiers.AbstractClassifier
-
- train(Map<String, String[]>) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
训练模型
- train(String, String) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
训练模型
- train(String) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
用UTF-8编码的语料训练模型
- train(IDataSet) - Method in interface com.hankcs.hanlp.classification.classifiers.IClassifier
-
训练模型
- train(IDataSet) - Method in class com.hankcs.hanlp.classification.classifiers.NaiveBayesClassifier
-
- train(String) - Method in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
训练
- train - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- train(String, String, String, String) - Static method in class com.hankcs.hanlp.dependency.perceptron.parser.KBeamArcEagerDependencyParser
-
训练依存句法分析器
- train(Options) - Static method in class com.hankcs.hanlp.dependency.perceptron.parser.Main
-
- train(ArrayList<Instance>, String, int, String, boolean, HashSet<String>, int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.trainer.ArcEagerBeamTrainer
-
- Train - Class in com.hankcs.hanlp.mining.word2vec
-
- Train() - Constructor for class com.hankcs.hanlp.mining.word2vec.Train
-
- train(String, String) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
执行训练
- train(String, String) - Method in class com.hankcs.hanlp.model.crf.CRFPOSTagger
-
- train(String, String, String, int, int, double, double, int, int, Encoder.Algorithm) - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
训练
- train(String, String) - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
- train(String, String, int, int, double, double, int, int, Encoder.Algorithm) - Method in class com.hankcs.hanlp.model.crf.CRFTagger
-
- train(Collection<int[][]>) - Method in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
训练
- train(String) - Method in class com.hankcs.hanlp.model.hmm.HMMTrainer
-
- train(String, int) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
训练
- train(String, int, boolean) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronClassifier
-
训练
- train(String, String, String, double, int, int) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer
-
训练
- train(String, String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer
-
- train(String, String, String) - Method in class com.hankcs.hanlp.model.perceptron.PerceptronTrainer
-
- train(String, String, String) - Method in class com.hankcs.hanlp.model.perceptron.POSTrainer
-
- train() - Method in class com.hankcs.hanlp.model.trigram.CharacterBasedGenerativeModel
-
观测结束,开始训练
- training(float, float) - Method in interface com.hankcs.hanlp.mining.word2vec.TrainingCallback
-
训练过程的回调
- TrainingCallback - Interface in com.hankcs.hanlp.mining.word2vec
-
- trainingIter - Variable in class com.hankcs.hanlp.dependency.perceptron.accessories.Options
-
- trainWords - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- transfer(int, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
转移状态
- transferValues(int, int) - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger
-
转移状态并输出值
- TransformMatrix - Class in com.hankcs.hanlp.dictionary
-
- TransformMatrix() - Constructor for class com.hankcs.hanlp.dictionary.TransformMatrix
-
- transformMatrixDictionary - Static variable in class com.hankcs.hanlp.dictionary.CoreDictionaryTransformMatrixDictionary
-
- transformMatrixDictionary - Static variable in class com.hankcs.hanlp.dictionary.nr.PersonDictionary
-
转移矩阵词典
- transformMatrixDictionary - Static variable in class com.hankcs.hanlp.dictionary.ns.PlaceDictionary
-
转移矩阵词典
- transformMatrixDictionary - Static variable in class com.hankcs.hanlp.dictionary.nt.OrganizationDictionary
-
转移矩阵词典
- TransformMatrixDictionary<E extends Enum<E>> - Class in com.hankcs.hanlp.dictionary
-
转移矩阵词典
- TransformMatrixDictionary(Class<E>) - Constructor for class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
- TransformMatrixDictionary() - Constructor for class com.hankcs.hanlp.dictionary.TransformMatrixDictionary
-
- transition(int, char) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
转移状态
- transition(char) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
Follows an outgoing _transition of this node labeled with a given char.
- transition(String) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
沿着一个路径转移
Follows a _transition path starting from this node.
- transition(char[]) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
- transition(char[], int) - Method in class com.hankcs.hanlp.collection.MDAG.MDAGNode
-
- transition(SimpleMDAGNode[], char) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Follows an outgoing _transition from this node.
- transition(SimpleMDAGNode[], String) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Follows a _transition path starting from this node.
- transition(SimpleMDAGNode[], char[]) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
- transition(SimpleMDAGNode[], char[], int) - Method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
- transition(String, int) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
- transition(char[], int) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
- transition(char) - Method in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
转移状态
- transition(String) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
沿着路径转移状态
- transition(char[]) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
沿着节点转移状态
- transition(String, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
沿着路径转移状态
- transition(char, int) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
转移状态
- transition(int, char) - Method in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
转移状态
- transition_probability - Variable in class com.hankcs.hanlp.model.hmm.HiddenMarkovModel
-
状态转移概率矩阵
- transition_probability2 - Variable in class com.hankcs.hanlp.model.hmm.SecondOrderHiddenMarkovModel
-
状态转移概率矩阵
- TransitionBasedParser - Class in com.hankcs.hanlp.dependency.perceptron.transition.parser
-
This class is just for making connection between different types of transition-based parsers
- TransitionBasedParser(AveragedPerceptron, ArrayList<Integer>, int, IndexMaps) - Constructor for class com.hankcs.hanlp.dependency.perceptron.transition.parser.TransitionBasedParser
-
- transitionCount - Variable in class com.hankcs.hanlp.collection.MDAG.MDAG
-
所有边的数量
- TransitionSystem - Class in com.hankcs.hanlp.dependency.nnparser
-
动作转移系统
- TransitionSystem() - Constructor for class com.hankcs.hanlp.dependency.nnparser.TransitionSystem
-
- transitionWithRoot(int, char) - Method in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
c转移,如果是根节点则返回自己
- transititon_probability - Variable in class com.hankcs.hanlp.dictionary.TransformMatrix
-
转移概率
- translate(String) - Static method in class com.hankcs.hanlp.dictionary.other.PartOfSpeechTagDictionary
-
翻译词性
- translateCompoundWordLabels() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
按照 PartOfSpeechTagDictionary 指定的映射表将复合词词语词性翻译过去
- translatedNameRecognize - Variable in class com.hankcs.hanlp.seg.Config
-
是否识别音译人名
- TranslatedPersonDictionary - Class in com.hankcs.hanlp.dictionary.nr
-
翻译人名词典,储存和识别翻译人名
- TranslatedPersonDictionary() - Constructor for class com.hankcs.hanlp.dictionary.nr.TranslatedPersonDictionary
-
- TranslatedPersonDictionaryPath - Static variable in class com.hankcs.hanlp.HanLP.Config
-
音译人名词典
- TranslatedPersonRecognition - Class in com.hankcs.hanlp.recognition.nr
-
音译人名识别
- TranslatedPersonRecognition() - Constructor for class com.hankcs.hanlp.recognition.nr.TranslatedPersonRecognition
-
- translateLabels() - Method in class com.hankcs.hanlp.corpus.document.sentence.Sentence
-
按照 PartOfSpeechTagDictionary 指定的映射表将词语词性翻译过去
- translator - Static variable in class com.hankcs.hanlp.dictionary.other.PartOfSpeechTagDictionary
-
词性映射表
- transpose() - Method in class com.hankcs.hanlp.dependency.nnparser.Matrix
-
Matrix transpose.
- traverseMDAG(SimpleMDAGNode[], SimpleMDAGNode, String) - Static method in class com.hankcs.hanlp.collection.MDAG.SimpleMDAGNode
-
Follows a _transition path starting from the source node of a MDAG.
- TriaFrequency - Class in com.hankcs.hanlp.corpus.occurrence
-
一个二元的词串的频度
- Trie - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
基于 Aho-Corasick 白皮书, 贝尔实验室: ftp://163.13.200.222/assistant/bearhero/prog/%A8%E4%A5%A6/ac_bm.pdf
- Trie(TrieConfig) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
构造一棵trie树
- Trie() - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
- Trie(Collection<String>) - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.Trie
-
- trie - Static variable in class com.hankcs.hanlp.dictionary.CoreDictionary
-
- trie - Variable in class com.hankcs.hanlp.dictionary.DynamicCustomDictionary
-
用于储存用户动态插入词条的二分trie树
- trie - Static variable in class com.hankcs.hanlp.dictionary.nr.PersonDictionary
-
AC算法用到的Trie树
- trie - Static variable in class com.hankcs.hanlp.dictionary.ns.PlaceDictionary
-
AC算法用到的Trie树
- trie - Static variable in class com.hankcs.hanlp.dictionary.nt.OrganizationDictionary
-
AC算法用到的Trie树
- trie - Static variable in class com.hankcs.hanlp.dictionary.ts.TraditionalChineseDictionary
-
繁体=简体
- trie - Variable in class com.hankcs.hanlp.seg.Other.DoubleArrayTrieSegment
-
分词用到的trie树,可以直接赋值为自己的trie树(赋值操作不保证线程安全)
- Trie.IWalker - Interface in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
- TRIE_EXT - Static variable in class com.hankcs.hanlp.utility.Predefine
-
trie树文件后缀名
- TrieConfig - Class in com.hankcs.hanlp.algorithm.ahocorasick.trie
-
配置
- TrieConfig() - Constructor for class com.hankcs.hanlp.algorithm.ahocorasick.trie.TrieConfig
-
- TrieEntry(String, V) - Constructor for class com.hankcs.hanlp.collection.trie.bintrie.BaseNode.TrieEntry
-
- tw2hk(String) - Static method in class com.hankcs.hanlp.HanLP
-
臺灣正體到香港繁體
- tw2s(String) - Static method in class com.hankcs.hanlp.HanLP
-
臺灣正體到簡體
- tw2t(String) - Static method in class com.hankcs.hanlp.HanLP
-
臺灣正體到繁體
- type - Variable in class com.hankcs.hanlp.corpus.synonym.Synonym
-
- type - Variable in class com.hankcs.hanlp.dictionary.common.CommonSynonymDictionary.SynonymItem
-
这个条目的类型,同义词或同类词或封闭词
- type - Static variable in class com.hankcs.hanlp.dictionary.other.CharType
-
- type(NeuralNetworkType) - Method in class com.hankcs.hanlp.mining.word2vec.Word2VecTrainer
-
神经网络类型
- type - Variable in class com.hankcs.hanlp.model.perceptron.tagset.TagSet
-
- typeTable - Static variable in class com.hankcs.hanlp.tokenizer.lexical.AbstractLexicalAnalyzer
-
字符类型表
- v - Variable in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie
-
保存value
- v - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie
-
- v - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
动词
- v - Variable in class com.hankcs.hanlp.model.crf.Table
-
真实值,请不要直接读取
- valence(int) - Method in class com.hankcs.hanlp.dependency.perceptron.transition.configuration.State
-
- valid(String) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
这个拼音是否合格
- valid(String[]) - Static method in class com.hankcs.hanlp.dictionary.py.TonePinyinString2PinyinConverter
-
这些拼音是否全部合格
- value - Variable in class com.hankcs.hanlp.collection.AhoCorasick.AhoCorasickDoubleArrayTrie.Hit
-
模式串对应的值
- value - Variable in class com.hankcs.hanlp.collection.sequence.SString
-
- value - Variable in class com.hankcs.hanlp.collection.trie.bintrie.BaseNode
-
节点代表的值
- value() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrieInteger.KeyValuePair
-
- value - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie.LongestSearcher
-
key对应的value
- value - Variable in class com.hankcs.hanlp.collection.trie.DoubleArrayTrie.Searcher
-
key对应的value
- value - Variable in class com.hankcs.hanlp.corpus.dependency.CoNll.CoNllLine
-
十个值
- value - Variable in class com.hankcs.hanlp.corpus.document.sentence.word.Word
-
单词的真实值,比如“程序”
- VALUE_EXT - Static variable in class com.hankcs.hanlp.utility.Predefine
-
值文件后缀名
- valueOf(char) - Static method in class com.hankcs.hanlp.collection.sequence.SString
-
- valueOf(String) - Static method in enum com.hankcs.hanlp.collection.trie.bintrie.BaseNode.Status
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.corpus.synonym.Synonym.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.corpus.tag.NR
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.corpus.tag.NS
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.corpus.tag.NT
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.dependency.perceptron.transition.parser.Action
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Method in class com.hankcs.hanlp.dictionary.common.EnumItemDictionary
-
代理E.valueOf
- valueOf(String) - Method in class com.hankcs.hanlp.dictionary.nr.NRDictionary
-
- valueOf(String) - Static method in enum com.hankcs.hanlp.dictionary.nr.NRPattern
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Method in class com.hankcs.hanlp.dictionary.ns.NSDictionary
-
- valueOf(String) - Method in class com.hankcs.hanlp.dictionary.nt.NTDictionary
-
- valueOf(String) - Static method in enum com.hankcs.hanlp.dictionary.py.Head
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.dictionary.py.Shengmu
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.dictionary.py.Yunmu
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.mining.word.TfIdf.Normalization
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.mining.word.TfIdf.TfType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.mining.word2vec.NeuralNetworkType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.model.crf.crfpp.Encoder.Algorithm
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.model.crf.crfpp.TaggerImpl.Mode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.model.crf.crfpp.TaggerImpl.ReadStatus
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.hankcs.hanlp.model.perceptron.common.TaskType
-
Returns the enum constant of this type with the specified name.
- values() - Method in class com.hankcs.hanlp.collection.dartsclone.DartMap
-
- values() - Static method in enum com.hankcs.hanlp.collection.trie.bintrie.BaseNode.Status
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Method in class com.hankcs.hanlp.collection.trie.datrie.MutableDoubleArrayTrie
-
- values() - Method in class com.hankcs.hanlp.corpus.dictionary.TFDictionary
-
按照频率从高到低排序的条目
- values() - Static method in enum com.hankcs.hanlp.corpus.synonym.Synonym.Type
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in class com.hankcs.hanlp.corpus.tag.Nature
-
- values() - Static method in enum com.hankcs.hanlp.corpus.tag.NR
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.corpus.tag.NS
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.corpus.tag.NT
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.dependency.perceptron.transition.parser.Action
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Method in class com.hankcs.hanlp.dictionary.common.EnumItemDictionary
-
代理E.values
- values() - Method in class com.hankcs.hanlp.dictionary.nr.NRDictionary
-
- values() - Static method in enum com.hankcs.hanlp.dictionary.nr.NRPattern
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Method in class com.hankcs.hanlp.dictionary.ns.NSDictionary
-
- values() - Method in class com.hankcs.hanlp.dictionary.nt.NTDictionary
-
- values() - Static method in enum com.hankcs.hanlp.dictionary.py.Head
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.dictionary.py.Pinyin
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.dictionary.py.Shengmu
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.dictionary.py.Yunmu
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.mining.word.TfIdf.Normalization
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.mining.word.TfIdf.TfType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.mining.word2vec.NeuralNetworkType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.model.crf.crfpp.Encoder.Algorithm
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.model.crf.crfpp.TaggerImpl.Mode
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.model.crf.crfpp.TaggerImpl.ReadStatus
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum com.hankcs.hanlp.model.perceptron.common.TaskType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- vd - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
副动词
- vector(K) - Method in class com.hankcs.hanlp.mining.word2vec.AbstractVectorModel
-
获取一个键的向量(键不会被预处理)
- Vector - Class in com.hankcs.hanlp.mining.word2vec
-
- Vector(float[]) - Constructor for class com.hankcs.hanlp.mining.word2vec.Vector
-
- Vector(int) - Constructor for class com.hankcs.hanlp.mining.word2vec.Vector
-
- vectorsReader - Variable in class com.hankcs.hanlp.mining.word2vec.AbstractClosestVectors
-
- VectorsReader - Class in com.hankcs.hanlp.mining.word2vec
-
- VectorsReader(String) - Constructor for class com.hankcs.hanlp.mining.word2vec.VectorsReader
-
- verbose - Variable in class com.hankcs.hanlp.corpus.dictionary.CommonDictionaryMaker
-
- Vertex - Class in com.hankcs.hanlp.seg.common
-
顶点
- Vertex(String, String, CoreDictionary.Attribute) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
最复杂的构造函数
- Vertex(String, String, CoreDictionary.Attribute, int) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
- Vertex(String, CoreDictionary.Attribute) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
真实词与编译词相同时候的构造函数
- Vertex(String, CoreDictionary.Attribute, int) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
- Vertex(Map.Entry<String, CoreDictionary.Attribute>) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
通过一个键值对方便地构造节点
- Vertex(String) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
自动构造一个合理的顶点
- Vertex(char, CoreDictionary.Attribute) - Constructor for class com.hankcs.hanlp.seg.common.Vertex
-
- vertex - Variable in class com.hankcs.hanlp.seg.Dijkstra.Path.State
-
当前位置
- vertexes - Variable in class com.hankcs.hanlp.seg.common.Graph
-
顶点
- vf - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
趋向动词
- vg - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
动词性语素
- vi - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
不及物动词(内动词)
- Viterbi - Class in com.hankcs.hanlp.algorithm
-
维特比算法
- Viterbi() - Constructor for class com.hankcs.hanlp.algorithm.Viterbi
-
- viterbi() - Method in class com.hankcs.hanlp.model.crf.crfpp.TaggerImpl
-
- viterbiCompute(List<EnumItem<NR>>) - Static method in class com.hankcs.hanlp.recognition.nr.PersonRecognition
-
维特比算法求解最优标签
- viterbiCompute(List<EnumItem<NS>>) - Static method in class com.hankcs.hanlp.recognition.ns.PlaceRecognition
-
维特比算法求解最优标签
- viterbiCompute(List<EnumItem<NT>>) - Static method in class com.hankcs.hanlp.recognition.nt.OrganizationRecognition
-
维特比算法求解最优标签
- viterbiComputeSimply(List<EnumItem<NR>>) - Static method in class com.hankcs.hanlp.recognition.nr.PersonRecognition
-
简化的"维特比算法"求解最优标签
- viterbiDecode(Instance) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
维特比解码
- viterbiDecode(Instance, int[]) - Method in class com.hankcs.hanlp.model.perceptron.model.LinearModel
-
维特比解码
- ViterbiSegment - Class in com.hankcs.hanlp.seg.Viterbi
-
Viterbi分词器
也是最短路分词,最短路求解采用Viterbi算法
- ViterbiSegment() - Constructor for class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- ViterbiSegment(String) - Constructor for class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- ViterbiSegment(String, boolean) - Constructor for class com.hankcs.hanlp.seg.Viterbi.ViterbiSegment
-
- vl - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
动词性惯用语
- vn - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
名动词
- vocab - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- vocabIndexMap - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- vocabMaxSize - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- vocabSize - Variable in class com.hankcs.hanlp.mining.word2vec.Corpus
-
- vocabulary - Variable in class com.hankcs.hanlp.mining.cluster.ClusterAnalyzer
-
- Vocabulary - Class in com.hankcs.hanlp.model.hmm
-
- Vocabulary(BinTrie<Integer>, boolean) - Constructor for class com.hankcs.hanlp.model.hmm.Vocabulary
-
- Vocabulary() - Constructor for class com.hankcs.hanlp.model.hmm.Vocabulary
-
- VocabWord - Class in com.hankcs.hanlp.mining.word2vec
-
- VocabWord(String) - Constructor for class com.hankcs.hanlp.mining.word2vec.VocabWord
-
- vshi - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
动词“是”
- vx - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
形式动词
- vyou - Static variable in class com.hankcs.hanlp.corpus.tag.Nature
-
动词“有”