emolib.classifier.machinelearning
Class BernoulliNB

java.lang.Object
  extended by emolib.util.proc.TextDataProcessor
      extended by emolib.classifier.Classifier
          extended by emolib.classifier.machinelearning.BernoulliNB
All Implemented Interfaces:
Configurable, DataProcessor

public class BernoulliNB
extends Classifier

The BernoulliNB class is a Bernoulli Naive Bayes classifier.

The BernoulliNB follows the implementation described in (Manning, et al., 2008). The model essentially assumes conditional independence among the presence of terms in a given text. The same term weighting schemes as the ones used in the ARN-R are considered.

--
(Manning, et al., 2008) Manning, C. D., Raghavan, P. and Schutze, H., "An Introduction to Information Retrieval", 2008.

Author:
Alexandre Trilla (atrilla@salle.url.edu)
See Also:
ARNReduced

Constructor Summary
BernoulliNB()
          Main constructor of this exponential regression classifier.
 
Method Summary
 java.lang.String getCategory(FeatureBox inputFeatures)
          The function that decides the most appropriate emotional category.
 void load(java.lang.String path)
          Generic function to load a previously saved classifier.
 void resetExamples()
          Method to reset the classifier and flush the training examples.
 void save(java.lang.String path)
          Generic method to save the fully fledged classifier into a given file path.
 void setCOF(boolean cof)
          Method to consider bigram frequencies.
 void setEmotionDims(boolean emodims)
          Method to consider emotion dimensions.
 void setNegation(boolean neg)
          Method to consider negations.
 void setPOS(boolean pos)
          Method to consider POS tags.
 void setStemming(boolean stems)
          Method to consider stems.
 void setSynonyms(boolean syns)
          Method to consider synonyms.
 void trainingProcedure()
          Training method based on the algorithm in (Manning, et al., 2008).
 
Methods inherited from class emolib.classifier.Classifier
applyClassification, getData, getListOfExampleCategories, getListOfExampleFeatures, initialize, inputTrainingExample, newProperties, register, train
 
Methods inherited from class emolib.util.proc.TextDataProcessor
flush, getName, getPredecessor, setPredecessor, toString
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

BernoulliNB

public BernoulliNB()
Main constructor of this exponential regression classifier.

Method Detail

getCategory

public java.lang.String getCategory(FeatureBox inputFeatures)
Description copied from class: Classifier
The function that decides the most appropriate emotional category. This is required for any classifier. The classifier in question has to previously run any training algorithm in order to provide the required prediction.

Specified by:
getCategory in class Classifier
Parameters:
inputFeatures - The input emotional features.
Returns:
The most appropriate emotional category.

trainingProcedure

public void trainingProcedure()
Training method based on the algorithm in (Manning, et al., 2008). Nevertheless, doc counts are approximated by the sum of term freqs. (non-Javadoc)

Specified by:
trainingProcedure in class Classifier
See Also:
Classifier.trainingProcedure()

setCOF

public void setCOF(boolean cof)
Method to consider bigram frequencies.

Parameters:
cof - The COF flag.

setPOS

public void setPOS(boolean pos)
Method to consider POS tags.

Parameters:
pos - The POS flag.

setStemming

public void setStemming(boolean stems)
Method to consider stems.

Parameters:
stems - The stemming flag.

setSynonyms

public void setSynonyms(boolean syns)
Method to consider synonyms.

Parameters:
syns - The synonyms flag.

setEmotionDims

public void setEmotionDims(boolean emodims)
Method to consider emotion dimensions.

Parameters:
emodims - The emotion dimensions flag.

setNegation

public void setNegation(boolean neg)
Method to consider negations.

Parameters:
neg - The negation flag.

save

public void save(java.lang.String path)
Description copied from class: Classifier
Generic method to save the fully fledged classifier into a given file path. It is recommended to use a plain text file (such as XML) to save the classifier's configuration since it's readable directly.

Specified by:
save in class Classifier
Parameters:
path - The file path to save the classifier.

load

public void load(java.lang.String path)
Description copied from class: Classifier
Generic function to load a previously saved classifier. This function should be consistent with the design followed in the saving procedure.

Specified by:
load in class Classifier
Parameters:
path - The path of the file which contains the previously saved classifier.

resetExamples

public void resetExamples()
Description copied from class: Classifier
Method to reset the classifier and flush the training examples. This method only makes sense if the classifier in question is trainable and already has some training examples.

Overrides:
resetExamples in class Classifier