Class CharacterNgramFeatureGenerator

java.lang.Object
opennlp.tools.util.featuregen.CharacterNgramFeatureGenerator
All Implemented Interfaces:
AdaptiveFeatureGenerator

public class CharacterNgramFeatureGenerator extends Object implements AdaptiveFeatureGenerator
The CharacterNgramFeatureGenerator uses character ngrams to generate features about each token.
  • Constructor Details

    • CharacterNgramFeatureGenerator

      public CharacterNgramFeatureGenerator(int minLength, int maxLength)
      Parameters:
      minLength - The minimum length to use. Must not be negative.
      maxLength - The maximum length to use. Must not be negative.
    • CharacterNgramFeatureGenerator

      public CharacterNgramFeatureGenerator()
      Initializes a CharacterNgramFeatureGenerator with min length of 2 and max length of 5 for ngrams.
  • Method Details

    • createFeatures

      public void createFeatures(List<String> features, String[] tokens, int index, String[] preds)
      Description copied from interface: AdaptiveFeatureGenerator
      Adds the appropriate features for the token at the specified index with the specified array of previousOutcomes to the specified list of features.
      Specified by:
      createFeatures in interface AdaptiveFeatureGenerator
      Parameters:
      features - The list of features to be added to.
      tokens - The tokens of the sentence or other text unit being processed.
      index - The index of the token which is currently being processed.
      preds - The outcomes for the tokens prior to the specified index.