| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectemolib.util.proc.TextDataProcessor
emolib.stemmer.Stemmer
emolib.stemmer.snowball.GenericSnowballStemmer
public class GenericSnowballStemmer
The GenericSnowballStemmer class performs the stemming process using the Snowball library.
This class accepts two parameters through the Configuration Manager: the "language", which determines the algorithm of the stemming process thus stating the language of use, and the "iterations", which determines the number of stemming iterations that must be performed on the incoming word removing suffices one at a time, starting at the end of the word and working towards the beginning.
Only the words that may have an affective content are stemmed. This assumption responds to the indexing goal that stemming pursues in Information Retrieval (IR). Read more about this in the article Snowball: A language for stemming algorithms.
| Field Summary | |
|---|---|
| static java.lang.String | PROP_ITERATIONS | 
| static java.lang.String | PROP_LANGUAGEThe name of the property indicating the language of this Stemmer. | 
| Constructor Summary | |
|---|---|
| GenericSnowballStemmer()Main constructor of the GenericSnowballStemmer. | |
| Method Summary | |
|---|---|
|  void | applyStemming(TextData inputTextDataObject)Method to perform the stemming process. | 
|  void | initialize()Method to initialize the GenericSnowballStemmer. | 
|  void | newProperties(PropertySheet ps)This method is called when this configurable component has new data. | 
|  void | register(java.lang.String name,
         Registry registry)Register my properties. | 
| Methods inherited from class emolib.stemmer.Stemmer | 
|---|
| getData | 
| Methods inherited from class emolib.util.proc.TextDataProcessor | 
|---|
| flush, getName, getPredecessor, setPredecessor, toString | 
| Methods inherited from class java.lang.Object | 
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait | 
| Field Detail | 
|---|
public static final java.lang.String PROP_LANGUAGE
public static final java.lang.String PROP_ITERATIONS
| Constructor Detail | 
|---|
public GenericSnowballStemmer()
| Method Detail | 
|---|
public void register(java.lang.String name,
                     Registry registry)
              throws PropertyException
Configurable
register in interface Configurableregister in class Stemmername - the name of the componentregistry - the registry for this component
PropertyException
public void newProperties(PropertySheet ps)
                   throws PropertyException
Configurable
newProperties in interface ConfigurablenewProperties in class Stemmerps - a property sheet holding the new data
PropertyException - if there is a problem with the properties.public void initialize()
initialize in interface DataProcessorinitialize in class Stemmerpublic void applyStemming(TextData inputTextDataObject)
applyStemming in class StemmerinputTextDataObject - The TextData object to process.| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||