Interface FeatureGenerator

All Known Implementing Classes:
BagOfWordsFeatureGenerator, NGramFeatureGenerator

public interface FeatureGenerator
Interface for generating features for document categorization.
  • Method Details

    • extractFeatures

      Collection<String> extractFeatures(String[] text, Map<String,Object> extraInformation)
      Extracts features from given text fragments.
      Parameters:
      text - The text fragments to extract features from
      extraInformation - Optional extra information to be used by the FeatureGenerator.
      Returns:
      A collection of features.