Package opennlp.tools.sentdetect
package opennlp.tools.sentdetect
Package related to identifying sentence boundaries.
-
ClassDescriptionDefault implementation of the
EndOfSentenceScanner
.Generate event contexts for maxent decisions for sentence detection.ObjectStream
to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.The NewlineSentenceDetector
assumes that sentences are line delimited and recognizes one sentence per non-empty line.Interface forSentenceDetectorME
context generators.A cross validator forsentence detectors
.The interface for sentence detectors, which find the sentence boundaries in a text.TheSentenceDetectorEvaluator
measures the performance of the givenSentenceDetector
with the provided referenceSentenceSample
s.The factory that providesSentenceDetector
default implementations and resourcesA sentence detector for splitting up raw text into sentences.TheSentenceModel
is the model used by a learnableSentenceDetector
.ASentenceSample
contains a document with begin indexes of the individual sentences.This class is a stream filter which reads a sentence by line samples from anObjectStream
and converts them intoSentenceSample
objects.