Package opennlp.tools.doccat
Package for classifying a document into a category.
-
Interface Summary Interface Description DoccatEvaluationMonitor EvaluationMonitor
for doccat.DocumentCategorizer Interface for classes which categorize documents.FeatureGenerator Interface for generating features for document categorization. -
Class Summary Class Description BagOfWordsFeatureGenerator Generates a feature for each word in a document.DoccatCrossValidator Cross validator for document categorizationDoccatFactory The factory that provides Doccat default implementations and resourcesDoccatModel A model for document categorizationDocumentCategorizerEvaluator TheDocumentCategorizerEvaluator
measures the performance of the givenDocumentCategorizer
with the provided referenceDocumentSample
s.DocumentCategorizerEventStream Iterator-like class for modeling document classification events.DocumentCategorizerME Maxent implementation ofDocumentCategorizer
.DocumentSample Class which holds a classified document and its category.DocumentSampleStream This class reads in string encoded training samples, parses them and outputsDocumentSample
objects.NGramFeatureGenerator Generates ngram features for a document.