Class TokSpanEventStream

java.lang.Object
opennlp.tools.util.AbstractEventStream<opennlp.tools.tokenize.TokenSample>
opennlp.tools.tokenize.TokSpanEventStream
All Implemented Interfaces:
AutoCloseable, opennlp.tools.util.ObjectStream<opennlp.tools.ml.model.Event>

public class TokSpanEventStream extends AbstractEventStream<opennlp.tools.tokenize.TokenSample>
This class reads the samples via an Iterator and converts the samples into events which can be used by the maxent library for training.
  • Constructor Summary

    Constructors
    Constructor
    Description
    TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics)
    Initializes a new event stream based on the data stream using a TokenContextGenerator that relies on a DefaultTokenContextGenerator.
    TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics, Pattern alphaNumeric, opennlp.tools.tokenize.TokenContextGenerator cg)
    Initializes a new event stream based on the data stream using a TokenContextGenerator.
    TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics, opennlp.tools.tokenize.TokenContextGenerator cg)
    Initializes a new event stream based on the data stream using a TokenContextGenerator.
  • Method Summary

    Methods inherited from class AbstractEventStream

    close, read, reset

    Methods inherited from class Object

    equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • TokSpanEventStream

      public TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics, Pattern alphaNumeric, opennlp.tools.tokenize.TokenContextGenerator cg)
      Initializes a new event stream based on the data stream using a TokenContextGenerator.
      Parameters:
      tokenSamples - The data stream for this event stream.
      skipAlphaNumerics - Whether alphanumerics are skipped, or not.
      alphaNumeric - A custom alphanumeric Pattern or null. Default is: "^[A-Za-z0-9]+$", provided by Factory.DEFAULT_ALPHANUMERIC.
      cg - A TokenContextGenerator which should be used for the event stream d.
    • TokSpanEventStream

      public TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics, opennlp.tools.tokenize.TokenContextGenerator cg)
      Initializes a new event stream based on the data stream using a TokenContextGenerator.
      Parameters:
      tokenSamples - The data stream for this event stream.
      skipAlphaNumerics - Whether alphanumerics are skipped, or not.
      cg - A TokenContextGenerator which should be used for the event stream d.
    • TokSpanEventStream

      public TokSpanEventStream(opennlp.tools.util.ObjectStream<opennlp.tools.tokenize.TokenSample> tokenSamples, boolean skipAlphaNumerics)
      Initializes a new event stream based on the data stream using a TokenContextGenerator that relies on a DefaultTokenContextGenerator.
      Parameters:
      tokenSamples - The data stream for this event stream.
      skipAlphaNumerics - Whether alphanumerics are skipped, or not.