Package opennlp.tools.doccat
package opennlp.tools.doccat
Package for classifying a document into a category.
-
ClassDescriptionGenerates a feature for each word in a document.Cross validator for
DocumentCategorizer
.A marker interface for evaluatingdoccat
.The factory that provides Doccat default implementations and resources.A model for document categorizationInterface for classes which categorize documents.TheDocumentCategorizerEvaluator
measures the performance of the givenDocumentCategorizer
with the provided referencesamples
.Iterator-like class for modeling document classification events.A Max-Ent based implementation ofDocumentCategorizer
.Class which holds a classified document and its category.Reads in string encoded training samples, parses them and outputsDocumentSample
objects.Interface for generating features for document categorization.Generates ngram features for a document.