Package opennlp.tools.doccat
Interface FeatureGenerator
-
- All Known Implementing Classes:
BagOfWordsFeatureGenerator
,NGramFeatureGenerator
public interface FeatureGenerator
Interface for generating features for document categorization.
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description Collection<String>
extractFeatures(String[] text, Map<String,Object> extraInformation)
Extract features from given text fragments
-
-
-
Method Detail
-
extractFeatures
Collection<String> extractFeatures(String[] text, Map<String,Object> extraInformation)
Extract features from given text fragments- Parameters:
text
- the text fragments to extract features fromextraInformation
- optional extra information to be used by the feature generator- Returns:
- a collection of features
-
-