Class MascToken

java.lang.Object
opennlp.tools.util.Span
opennlp.tools.formats.masc.MascToken
All Implemented Interfaces:
Serializable, Comparable<Span>

public class MascToken extends Span
A specialized Span to express tokens in documents.
See Also:
  • Constructor Details

    • MascToken

      public MascToken(int s, int e, int pennId, String pos, String base, MascWord[] quarks)
      Initializes a MascToken which may combine multiple quarks.
      Parameters:
      s - The start of the token in the corpus file. Must be equal to or greater than 0.
      e - The end of the token in the corpus file. Must be equal to or greater than 0 and be greater than s.
      pennId - The ID of the token as assigned by the Penn stand-off annotation.
      pos - The POS-tag.
      base - The base form.
      quarks - The array of Quarks contained in the token.
      Throws:
      IllegalArgumentException - Thrown if one of the parameters are invalid.
  • Method Details

    • getTokenId

      public int getTokenId()
      Returns:
      Retrieves the ID of the token.
    • getBase

      public String getBase()
      Returns:
      Retrieves the base form.
    • getPos

      public String getPos()
      Returns:
      Retrieves the POS tag.
    • getQuarks

      public MascWord[] getQuarks()
      Returns:
      Retrieves quarks of the token.