Package opennlp.tools.sentdetect
Package related to identifying sentence boundaries.
-
Interface Summary Interface Description EndOfSentenceScanner SDContextGenerator Interface forSentenceDetectorMEcontext generators.SentenceDetector The interface for sentence detectors, which find the sentence boundaries in a text.SentenceDetectorEvaluationMonitor -
Class Summary Class Description DefaultEndOfSentenceScanner Default implementation of theEndOfSentenceScanner.DefaultSDContextGenerator Generate event contexts for maxent decisions for sentence detection.EmptyLinePreprocessorStream ObjectStreamto clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.NewlineSentenceDetector The NewlineSentenceDetectorassumes that sentences are line delimited and recognizes one sentence per non-empty line.SDCrossValidator A cross validator forsentence detectors.SDEventStream SentenceDetectorEvaluator TheSentenceDetectorEvaluatormeasures the performance of the givenSentenceDetectorwith the provided referenceSentenceSamples.SentenceDetectorFactory The factory that providesSentenceDetectordefault implementations and resourcesSentenceDetectorME A sentence detector for splitting up raw text into sentences.SentenceModel TheSentenceModelis the model used by a learnableSentenceDetector.SentenceSample ASentenceSamplecontains a document with begin indexes of the individual sentences.SentenceSampleStream This class is a stream filter which reads a sentence by line samples from anObjectStreamand converts them intoSentenceSampleobjects.