public abstract class AbstractDataSet extends Object implements IDataSet
| 限定符和类型 | 字段和说明 |
|---|---|
protected Catalog |
catalog |
protected Lexicon |
lexicon |
protected boolean |
testingDataSet
是否属于测试集
|
protected ITokenizer |
tokenizer |
| 构造器和说明 |
|---|
AbstractDataSet() |
AbstractDataSet(AbstractModel model)
构造测试集
|
| 限定符和类型 | 方法和说明 |
|---|---|
IDataSet |
add(Map<String,String[]> testingDataSet) |
Document |
convert(String category,
String text)
利用本数据集的词表和类目表将文本形式的文档转换为内部通用的文档
|
Catalog |
getCatalog()
获取类目表
|
Lexicon |
getLexicon()
获取词表
|
ITokenizer |
getTokenizer()
获取分词器
|
boolean |
isTestingDataSet()
是否是测试集
|
IDataSet |
load(String folderPath)
加载数据集
|
IDataSet |
load(String folderPath,
double rate) |
IDataSet |
load(String folderPath,
String charsetName)
加载数据集
|
IDataSet |
load(String folderPath,
String charsetName,
double percentage) |
IDataSet |
setTokenizer(ITokenizer tokenizer)
设置分词器
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitforEach, iterator, spliteratorprotected ITokenizer tokenizer
protected Catalog catalog
protected Lexicon lexicon
protected boolean testingDataSet
public AbstractDataSet(AbstractModel model)
model - 待测试的模型public AbstractDataSet()
public IDataSet setTokenizer(ITokenizer tokenizer)
IDataSetsetTokenizer 在接口中 IDataSetpublic Document convert(String category, String text)
IDataSetpublic ITokenizer getTokenizer()
IDataSetgetTokenizer 在接口中 IDataSetpublic Catalog getCatalog()
IDataSetgetCatalog 在接口中 IDataSetpublic Lexicon getLexicon()
IDataSetgetLexicon 在接口中 IDataSetpublic IDataSet load(String folderPath, String charsetName) throws IllegalArgumentException, IOException
IDataSetload 在接口中 IDataSetfolderPath - 分类语料的根目录.目录必须满足如下结构:charsetName - 文件编码IllegalArgumentExceptionIOExceptionpublic IDataSet load(String folderPath) throws IllegalArgumentException, IOException
IDataSetload 在接口中 IDataSetfolderPath - 分类语料的根目录.目录必须满足如下结构:IllegalArgumentExceptionIOExceptionpublic boolean isTestingDataSet()
IDataSetisTestingDataSet 在接口中 IDataSetpublic IDataSet load(String folderPath, String charsetName, double percentage) throws IllegalArgumentException, IOException
load 在接口中 IDataSetIllegalArgumentExceptionIOExceptionpublic IDataSet load(String folderPath, double rate) throws IllegalArgumentException, IOException
load 在接口中 IDataSetIllegalArgumentExceptionIOExceptionCopyright © 2014–2018 码农场. All rights reserved.