Class DocumentCategorizerEvaluator

java.lang.Object
opennlp.tools.util.eval.Evaluator<opennlp.tools.doccat.DocumentSample>
opennlp.tools.doccat.DocumentCategorizerEvaluator

public class DocumentCategorizerEvaluator extends Evaluator<opennlp.tools.doccat.DocumentSample>
The DocumentCategorizerEvaluator measures the performance of the given DocumentCategorizer with the provided reference samples.
See Also:
  • DocumentCategorizer
  • DocumentSample
  • Evaluator
  • Constructor Details

    • DocumentCategorizerEvaluator

      public DocumentCategorizerEvaluator(opennlp.tools.doccat.DocumentCategorizer categorizer, opennlp.tools.doccat.DoccatEvaluationMonitor... listeners)
      Initializes a DocumentCategorizerEvaluator instance.
      Parameters:
      categorizer - the DocumentCategorizer instance.
      listeners - the evaluation listeners.
  • Method Details

    • processSample

      public opennlp.tools.doccat.DocumentSample processSample(opennlp.tools.doccat.DocumentSample sample)
      Evaluates the given reference sample.

      This is done by categorizing the document from the provided DocumentSample. The detected category is then used to calculate and update the score.

      Parameters:
      sample - The reference TokenSample.
      Returns:
      The processed TokenSample.
    • getAccuracy

      public double getAccuracy()
      accuracy = correctly categorized documents / total documents
      Returns:
      Retrieves the accuracy of provided DocumentCategorizer.
    • getDocumentCount

      public long getDocumentCount()
    • toString

      public String toString()
      Represents this object as human-readable String.
      Overrides:
      toString in class Object