| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectemolib.util.proc.TextDataProcessor
emolib.pos.POSTagger
emolib.pos.qtag.SpanishQTag
public class SpanishQTag
The SpanishQTag class performs the Part-Of-Speech (POS) tagging process in Spanish using the QTag library.
In order to obtain a Spanish version of QTag, the guidelines posted on the blog "Pythonner Zone!" Building a Spanish Part-of-Speech Tagger for Java in 5 Easy Steps.... Basically, the steps have been the following:
The necessary files to generate the tagger in Spanish using QTag are available in the dat/dataset/conll02task folder,
This POS tagger makes mistakes. QTag is a probabilistic POS tagger, so it may be inaccurate. The training Spanish corpus also has incoherences. But if used for what it is meant to be (the disambiguation of the function of nouns, verbs and adjectives in a sentence) this tool does its job successfully.
| Field Summary | |
|---|---|
| static java.lang.String | PROP_RESOURCES_PATHThe name of the property indicating the location of the lexicon and matrix Spanish files. | 
| Constructor Summary | |
|---|---|
| SpanishQTag()Main constructor of the SpanishQTag. | |
| Method Summary | |
|---|---|
|  void | applyPOSTagging(TextData inputTextDataObject)Method to perform the POS tagging process. | 
|  void | initialize()Method to initialize the SpanishQTag. | 
|  void | newProperties(PropertySheet ps)This method is called when this configurable component has new data. | 
|  void | register(java.lang.String name,
         Registry registry)Register my properties. | 
| Methods inherited from class emolib.pos.POSTagger | 
|---|
| getData | 
| Methods inherited from class emolib.util.proc.TextDataProcessor | 
|---|
| flush, getName, getPredecessor, setPredecessor, toString | 
| Methods inherited from class java.lang.Object | 
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait | 
| Field Detail | 
|---|
public static final java.lang.String PROP_RESOURCES_PATH
| Constructor Detail | 
|---|
public SpanishQTag()
| Method Detail | 
|---|
public void register(java.lang.String name,
                     Registry registry)
              throws PropertyException
Configurable
register in interface Configurableregister in class POSTaggername - the name of the componentregistry - the registry for this component
PropertyException
public void newProperties(PropertySheet ps)
                   throws PropertyException
Configurable
newProperties in interface ConfigurablenewProperties in class POSTaggerps - a property sheet holding the new data
PropertyException - if there is a problem with the properties.public void initialize()
initialize in interface DataProcessorinitialize in class POSTaggerpublic void applyPOSTagging(TextData inputTextDataObject)
applyPOSTagging in class POSTaggerinputTextDataObject - The TextData object to process.| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||