Scans Strings, StringBuffers, and char arrays for the offsets of sentence ending characters.
The interface for sentence detectors, which find the sentence boundaries in a text.
use DefaultEndOfSentenceScanner instead
Default implementation of the
Generate event contexts for maxent decisions for sentence detection.
Stream to to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.
The Newline Sentence Detector assumes that sentences are line delimited and recognizes one sentence per non-empty line.
A cross validator for the sentence detector.
The factory that provides SentenceDetecor default implementations and resources
A sentence detector for splitting up raw text into sentences.
This class is a stream filter which reads a sentence by line samples from a
Copyright © 2017 The Apache Software Foundation. All rights reserved.