Package opennlp.tools.tokenize.lang.en
Class TokenSampleStream
java.lang.Object
opennlp.tools.tokenize.lang.en.TokenSampleStream
- All Implemented Interfaces:
Iterator<TokenSample>
Class which produces an Iterator<TokenSample> from a file of space delimited token.
This class uses a number of English-specific heuristics to un-separate tokens which
are typically found together in text.
-
Constructor Details
-
TokenSampleStream
- Throws:
IOException
-
-
Method Details
-
hasNext
public boolean hasNext()- Specified by:
hasNext
in interfaceIterator<TokenSample>
-
next
- Specified by:
next
in interfaceIterator<TokenSample>
-
remove
public void remove()- Specified by:
remove
in interfaceIterator<TokenSample>
-