public class ChiSquareFeatureExtractor extends Object
| Modifier and Type | Field and Description |
|---|---|
protected double |
chisquareCriticalValue
在P值(拒真错误概率)为0.001时的卡方临界值,用于特征选择算法
|
protected int |
maxSize |
| Constructor and Description |
|---|
ChiSquareFeatureExtractor() |
| Modifier and Type | Method and Description |
|---|---|
Map<Integer,Double> |
chi_square(BaseFeatureData stats)
使用卡方非参数校验来执行特征选择
https://nlp.stanford.edu/IR-book/html/htmledition/feature-selectionchi2-feature-selection-1.html |
static BaseFeatureData |
extractBasicFeatureData(IDataSet dataSet)
生成一个FeatureStats对象,包含一个分类中的所有词语,分类数,实例数。这些统计数据
将用于特征选择算法。
|
double |
getALevel() |
double |
getChisquareCriticalValue()
获取卡方临界值
|
ChiSquareFeatureExtractor |
setALevel(double aLevel) |
void |
setChisquareCriticalValue(double chisquareCriticalValue)
设置卡方临界值
|
protected double chisquareCriticalValue
protected int maxSize
public static BaseFeatureData extractBasicFeatureData(IDataSet dataSet)
dataSet - public Map<Integer,Double> chi_square(BaseFeatureData stats)
stats - public double getChisquareCriticalValue()
public void setChisquareCriticalValue(double chisquareCriticalValue)
chisquareCriticalValue - public ChiSquareFeatureExtractor setALevel(double aLevel)
public double getALevel()
Copyright © 2014–2021 码农场. All rights reserved.