Package opennlp.tools.util.featuregen
Class WordClusterFeatureGenerator
java.lang.Object
opennlp.tools.util.featuregen.WordClusterFeatureGenerator
- All Implemented Interfaces:
- AdaptiveFeatureGenerator
An 
AdaptiveFeatureGenerator implementation of a word cluster feature generator.
 It is based on a pre-defined WordClusterDictionary.- 
Constructor SummaryConstructorsConstructorDescriptionWordClusterFeatureGenerator(WordClusterDictionary dict, String dictResourceKey, boolean lowerCaseDictionary) Instantiates aWordClusterFeatureGeneratorvia a specifiedWordClusterDictionary.
- 
Method SummaryModifier and TypeMethodDescriptionvoidcreateFeatures(List<String> features, String[] tokens, int index, String[] previousOutcomes) Adds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.Methods inherited from class java.lang.Objectequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface opennlp.tools.util.featuregen.AdaptiveFeatureGeneratorclearAdaptiveData, updateAdaptiveData
- 
Constructor Details- 
WordClusterFeatureGeneratorpublic WordClusterFeatureGenerator(WordClusterDictionary dict, String dictResourceKey, boolean lowerCaseDictionary) Instantiates aWordClusterFeatureGeneratorvia a specifiedWordClusterDictionary.- Parameters:
- dict- The token- dictionaryto use.
- dictResourceKey- The prefix to use for detected features. Typically, the value for this prefix should be- "dict".
- lowerCaseDictionary-- trueif tokens will be lower-cased during dictionary lookup,- falseotherwise.
 
 
- 
- 
Method Details- 
createFeaturespublic void createFeatures(List<String> features, String[] tokens, int index, String[] previousOutcomes) Description copied from interface:AdaptiveFeatureGeneratorAdds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.- Specified by:
- createFeaturesin interface- AdaptiveFeatureGenerator
- Parameters:
- features- The list of features to be added to.
- tokens- The tokens of the sentence or other text unit being processed.
- index- The index of the token which is currently being processed.
- previousOutcomes- The outcomes for the tokens prior to the specified index.
 
 
-