public class NaiveBayesMultinomial extends AbstractClassifier implements WeightedInstancesHandler, TechnicalInformationHandler
@inproceedings{Mccallum1998,
author = {Andrew Mccallum and Kamal Nigam},
booktitle = {AAAI-98 Workshop on 'Learning for Text Categorization'},
title = {A Comparison of Event Models for Naive Bayes Text Classification},
year = {1998}
}
Valid options are:
-output-debug-info
-do-not-check-capabilities
If set, classifier capabilities are not checked before classifier is built (use with caution).
-num-decimal-laces
The number of decimal places for the output of numbers in the model.
-batch-size
The desired batch size for batch prediction.
| Modifier and Type | Field and Description |
|---|---|
protected Instances |
m_headerInfo
copy of header information for use in toString method
|
protected int |
m_numAttributes
number of unique words
|
protected int |
m_numClasses
number of class values
|
protected double[] |
m_probOfClass
the probability of a class (i.e.
|
protected double[][] |
m_probOfWordGivenClass
probability that a word (w) exists in a class (H) (i.e.
|
BATCH_SIZE_DEFAULT, m_BatchSize, m_Debug, m_DoNotCheckCapabilities, m_numDecimalPlaces, NUM_DECIMAL_PLACES_DEFAULT| Constructor and Description |
|---|
NaiveBayesMultinomial() |
| Modifier and Type | Method and Description |
|---|---|
void |
buildClassifier(Instances instances)
Generates the classifier.
|
double[] |
distributionForInstance(Instance instance)
Calculates the class membership probabilities for the given test instance.
|
Capabilities |
getCapabilities()
Returns default capabilities of the classifier.
|
java.lang.String |
getRevision()
Returns the revision string.
|
TechnicalInformation |
getTechnicalInformation()
Returns an instance of a TechnicalInformation object, containing detailed information about the
technical background of this class, e.g., paper reference or book this class is based on.
|
java.lang.String |
globalInfo()
Returns a string describing this classifier
|
protected void |
initializeClassifier(Instances instances)
Sets up the classifier before any actual instances are processed.
|
static void |
main(java.lang.String[] argv)
Main method for testing this class.
|
protected double |
probOfDocGivenClass(Instance inst,
int classIndex)
log(N!)
|
java.lang.String |
toString()
Returns a string representation of the classifier.
|
batchSizeTipText, classifyInstance, debugTipText, distributionsForInstances, doNotCheckCapabilitiesTipText, forName, getBatchSize, getDebug, getDoNotCheckCapabilities, getNumDecimalPlaces, getOptions, implementsMoreEfficientBatchPrediction, listOptions, makeCopies, makeCopy, numDecimalPlacesTipText, postExecution, preExecution, run, runClassifier, setBatchSize, setDebug, setDoNotCheckCapabilities, setNumDecimalPlaces, setOptionsprotected double[][] m_probOfWordGivenClass
protected double[] m_probOfClass
protected int m_numAttributes
protected int m_numClasses
protected Instances m_headerInfo
public java.lang.String globalInfo()
public TechnicalInformation getTechnicalInformation()
getTechnicalInformation in interface TechnicalInformationHandlerpublic Capabilities getCapabilities()
getCapabilities in interface ClassifiergetCapabilities in interface CapabilitiesHandlergetCapabilities in class AbstractClassifierCapabilitiesprotected void initializeClassifier(Instances instances) throws java.lang.Exception
java.lang.Exceptionpublic void buildClassifier(Instances instances) throws java.lang.Exception
buildClassifier in interface Classifierinstances - set of instances serving as training datajava.lang.Exception - if the classifier has not been generated successfullypublic double[] distributionForInstance(Instance instance) throws java.lang.Exception
distributionForInstance in interface ClassifierdistributionForInstance in class AbstractClassifierinstance - the instance to be classifiedjava.lang.Exception - if there is a problem generating the predictionprotected double probOfDocGivenClass(Instance inst, int classIndex)
inst - The instance to be classifiedclassIndex - The index of the class we are calculating the probability with respect topublic java.lang.String toString()
toString in class java.lang.Objectpublic java.lang.String getRevision()
getRevision in interface RevisionHandlergetRevision in class AbstractClassifierpublic static void main(java.lang.String[] argv)
argv - the options