Class TokenSampleStream

java.lang.Object
opennlp.tools.tokenize.lang.en.TokenSampleStream
All Implemented Interfaces:
Iterator<opennlp.tools.tokenize.TokenSample>

public class TokenSampleStream extends Object implements Iterator<opennlp.tools.tokenize.TokenSample>
Class which produces an Iterator<TokenSample> from a file of space delimited token. This class uses a number of English-specific heuristics to un-separate tokens which are typically found together in text.
  • Constructor Details

  • Method Details

    • hasNext

      public boolean hasNext()
      Specified by:
      hasNext in interface Iterator<opennlp.tools.tokenize.TokenSample>
    • next

      public opennlp.tools.tokenize.TokenSample next()
      Specified by:
      next in interface Iterator<opennlp.tools.tokenize.TokenSample>
    • remove

      public void remove()
      Specified by:
      remove in interface Iterator<opennlp.tools.tokenize.TokenSample>