Package opennlp.tools.tokenize
Class TokenSampleStream
- java.lang.Object
- 
- opennlp.tools.util.FilterObjectStream<String,TokenSample>
- 
- opennlp.tools.tokenize.TokenSampleStream
 
 
- 
- All Implemented Interfaces:
- AutoCloseable,- ObjectStream<TokenSample>
 
 public class TokenSampleStream extends FilterObjectStream<String,TokenSample> This class is a stream filter which reads in string encoded samples and createsTokenSamples out of them. The input string sample is tokenized if a whitespace or the special separator chars occur.Sample: 
 "token1 token2 token3<SPLIT>token4"
 The tokens token1 and token2 are separated by a whitespace, token3 and token3 are separated by the special character sequence, in this case the default split sequence.The sequence must be unique in the input string and is not escaped. 
- 
- 
Constructor SummaryConstructors Constructor Description TokenSampleStream(ObjectStream<String> sentences)TokenSampleStream(ObjectStream<String> sampleStrings, String separatorChars)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description TokenSampleread()Returns the next object.- 
Methods inherited from class opennlp.tools.util.FilterObjectStreamclose, reset
 
- 
 
- 
- 
- 
Constructor Detail- 
TokenSampleStreampublic TokenSampleStream(ObjectStream<String> sampleStrings, String separatorChars) 
 - 
TokenSampleStreampublic TokenSampleStream(ObjectStream<String> sentences) 
 
- 
 - 
Method Detail- 
readpublic TokenSample read() throws IOException Description copied from interface:ObjectStreamReturns the next object. Calling this method repeatedly until it returns null will return each object from the underlying source exactly once.- Returns:
- the next object or null to signal that the stream is exhausted
- Throws:
- IOException- if there is an error during reading
 
 
- 
 
-