Class AncoraSpanishHeadRules

  • All Implemented Interfaces:
    GapLabeler, HeadRules, SerializableArtifact

    public class AncoraSpanishHeadRules
    extends Object
    implements HeadRules, GapLabeler, SerializableArtifact
    Class for storing the Ancora Spanish head rules associated with parsing. In this class headrules for noun phrases are specified. The rest of the rules are in opennlp-tools/lang/es/parser/es-head-rules NOTE: This class has been adapted from The main change is the constituents search direction in the first for loop. Note also the change in the return of the getHead() method: In the lang.en.HeadRules class: return constituents[ci].getHead(); Now: return constituents[ci]; Other changes include removal of deprecated methods.
    • Constructor Detail

      • AncoraSpanishHeadRules

        public AncoraSpanishHeadRules​(Reader rulesReader)
                               throws IOException
        Creates a new set of head rules based on the specified reader.
        rulesReader - the head rules reader.
        IOException - if the head rules reader can not be read.
    • Method Detail

      • getPunctuationTags

        public Set<String> getPunctuationTags()
        Description copied from interface: HeadRules
        Returns the set of punctuation tags. Attachment decisions for these tags will not be modeled.
        Specified by:
        getPunctuationTags in interface HeadRules
        the set of punctuation tags.
      • getHead

        public Parse getHead​(Parse[] constituents,
                             String type)
        Description copied from interface: HeadRules
        Returns the head constituent for the specified constituents of the specified type.
        Specified by:
        getHead in interface HeadRules
        constituents - The constituents which make up a constituent of the specified type.
        type - The type of a constituent which is made up of the specified constituents.
        The constituent which is the head.
      • labelGaps

        public void labelGaps​(Stack<Constituent> stack)
        Description copied from interface: GapLabeler
        Labels the constituents found in the stack with gap labels if appropriate.
        Specified by:
        labelGaps in interface GapLabeler
        stack - The stack of un-completed constituents.
      • serialize

        public void serialize​(Writer writer)
                       throws IOException
        Writes the head rules to the writer in a format suitable for loading the head rules again with the constructor. The encoding must be taken into account while working with the writer and reader.

        After the entries have been written, the writer is flushed. The writer remains open after this method returns.

        writer -
      • hashCode

        public int hashCode()
        hashCode in class Object
      • getArtifactSerializerClass

        public Class<?> getArtifactSerializerClass()
        Description copied from interface: SerializableArtifact
        Retrieves the class which can serialize and recreate this artifact.
        Note: The serializer class must have a public zero argument constructor or an exception is thrown during model serialization/loading.
        Specified by:
        getArtifactSerializerClass in interface SerializableArtifact
        the corresponding ArtifactSerializer class.