Package opennlp.tools.sentdetect
Class EmptyLinePreprocessorStream
- java.lang.Object
-
- opennlp.tools.util.FilterObjectStream<String,String>
-
- opennlp.tools.sentdetect.EmptyLinePreprocessorStream
-
- All Implemented Interfaces:
AutoCloseable
,ObjectStream<String>
@Internal public class EmptyLinePreprocessorStream extends FilterObjectStream<String,String>
ObjectStream
to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.Note: This class is not thread safe.
Note: Do not use this class, internal use only!
-
-
Constructor Summary
Constructors Constructor Description EmptyLinePreprocessorStream(ObjectStream<String> in)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description String
read()
Returns the nextObjectStream
object.-
Methods inherited from class opennlp.tools.util.FilterObjectStream
close, reset
-
-
-
-
Constructor Detail
-
EmptyLinePreprocessorStream
public EmptyLinePreprocessorStream(ObjectStream<String> in)
-
-
Method Detail
-
read
public String read() throws IOException
Description copied from interface:ObjectStream
Returns the nextObjectStream
object. Calling this method repeatedly until it returnsnull
will return each object from the underlying source exactly once.- Returns:
- The next object or
null
to signal that the stream is exhausted. - Throws:
IOException
- Thrown if there is an error during reading.
-
-