Package opennlp.tools.doccat
Package for classifying a document into a category.
-
Interface Summary Interface Description DoccatEvaluationMonitor A marker interface for evaluatingdoccat
.DocumentCategorizer Interface for classes which categorize documents.FeatureGenerator Interface for generating features for document categorization. -
Class Summary Class Description BagOfWordsFeatureGenerator Generates a feature for each word in a document.DoccatCrossValidator Cross validator forDocumentCategorizer
.DoccatFactory The factory that provides Doccat default implementations and resources.DoccatModel A model for document categorizationDocumentCategorizerEvaluator TheDocumentCategorizerEvaluator
measures the performance of the givenDocumentCategorizer
with the provided referencesamples
.DocumentCategorizerEventStream Iterator-like class for modeling document classification events.DocumentCategorizerME A Max-Ent based implementation ofDocumentCategorizer
.DocumentSample Class which holds a classified document and its category.DocumentSampleStream Reads in string encoded training samples, parses them and outputsDocumentSample
objects.NGramFeatureGenerator Generates ngram features for a document.