Package opennlp.tools.doccat
Package for classifying a document into a category.
-
Interface Summary Interface Description DoccatEvaluationMonitor A marker interface for evaluatingdoccat.DocumentCategorizer Interface for classes which categorize documents.FeatureGenerator Interface for generating features for document categorization. -
Class Summary Class Description BagOfWordsFeatureGenerator Generates a feature for each word in a document.DoccatCrossValidator Cross validator forDocumentCategorizer.DoccatFactory The factory that provides Doccat default implementations and resources.DoccatModel A model for document categorizationDocumentCategorizerEvaluator TheDocumentCategorizerEvaluatormeasures the performance of the givenDocumentCategorizerwith the provided referencesamples.DocumentCategorizerEventStream Iterator-like class for modeling document classification events.DocumentCategorizerME A Max-Ent based implementation ofDocumentCategorizer.DocumentSample Class which holds a classified document and its category.DocumentSampleStream Reads in string encoded training samples, parses them and outputsDocumentSampleobjects.NGramFeatureGenerator Generates ngram features for a document.