Package opennlp.tools.doccat
Class DocumentSampleStream
java.lang.Object
opennlp.tools.util.FilterObjectStream<String,DocumentSample>
opennlp.tools.doccat.DocumentSampleStream
- All Implemented Interfaces:
AutoCloseable
,ObjectStream<DocumentSample>
Reads in string encoded training samples, parses them and
outputs
DocumentSample
objects.
Format:
Each line contains one sample document.
The category is the first string in the line followed by a tab and whitespace
separated document tokens.
Sample line:
category-string tab-char whitespace-separated-tokens line-break-char(s)
- See Also:
-
Constructor Summary
-
Method Summary
Methods inherited from class opennlp.tools.util.FilterObjectStream
close, reset
-
Constructor Details
-
DocumentSampleStream
Initializes ainstance
.- Parameters:
samples
- A plain textline stream
.
-
-
Method Details
-
read
Description copied from interface:ObjectStream
Returns the nextObjectStream
object. Calling this method repeatedly until it returnsnull
will return each object from the underlying source exactly once.- Returns:
- The next object or
null
to signal that the stream is exhausted. - Throws:
IOException
- Thrown if there is an error during reading.
-