Package opennlp.tools.util.featuregen
Class CharacterNgramFeatureGenerator
java.lang.Object
opennlp.tools.util.featuregen.CharacterNgramFeatureGenerator
- All Implemented Interfaces:
- AdaptiveFeatureGenerator
The 
CharacterNgramFeatureGenerator uses character ngrams to
 generate features about each token.- See Also:
- 
Constructor SummaryConstructorsConstructorDescriptionInitializes aCharacterNgramFeatureGeneratorwith default values for ngrams.CharacterNgramFeatureGenerator(int minLength, int maxLength) Initializes aCharacterNgramFeatureGeneratorwith the specified parameters.
- 
Method SummaryModifier and TypeMethodDescriptionvoidcreateFeatures(List<String> features, String[] tokens, int index, String[] preds) Adds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.Methods inherited from class java.lang.Objectequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface opennlp.tools.util.featuregen.AdaptiveFeatureGeneratorclearAdaptiveData, updateAdaptiveData
- 
Constructor Details- 
CharacterNgramFeatureGeneratorpublic CharacterNgramFeatureGenerator(int minLength, int maxLength) Initializes aCharacterNgramFeatureGeneratorwith the specified parameters.- Parameters:
- minLength- The minimum length to use. Must not be negative.
- maxLength- The maximum length to use. Must not be negative.
 
- 
CharacterNgramFeatureGeneratorpublic CharacterNgramFeatureGenerator()Initializes aCharacterNgramFeatureGeneratorwith default values for ngrams. The minimal length is set to2and maximum length to a value of5.
 
- 
- 
Method Details- 
createFeaturesDescription copied from interface:AdaptiveFeatureGeneratorAdds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.- Specified by:
- createFeaturesin interface- AdaptiveFeatureGenerator
- Parameters:
- features- The list of features to be added to.
- tokens- The tokens of the sentence or other text unit being processed.
- index- The index of the token which is currently being processed.
- preds- The outcomes for the tokens prior to the specified index.
 
 
-