Class BrownCluster

  • All Implemented Interfaces:

    public class BrownCluster
    extends Object
    implements SerializableArtifact
    Class to load a Brown cluster document: word\tword_class\tprob The file containing the clustering lexicon has to be passed as the value of the dict attribute of each BrownCluster feature generator.
    • Constructor Detail

      • BrownCluster

        public BrownCluster​(InputStream in)
                     throws IOException
        Generates the token to cluster map from Brown cluster input file. NOTE: we only add those tokens with frequency bigger than 5.
        in - the inputstream
        IOException - the io exception
    • Method Detail

      • lookupToken

        public String lookupToken​(String string)
        Check if a token is in the Brown:paths, token map.
        string - the token to look-up
        the brown class if such token is in the brown cluster map
      • getArtifactSerializerClass

        public Class<?> getArtifactSerializerClass()
        Description copied from interface: SerializableArtifact
        Retrieves the class which can serialize and recreate this artifact.
        Note: The serializer class must have a public zero argument constructor or an exception is thrown during model serialization/loading.
        Specified by:
        getArtifactSerializerClass in interface SerializableArtifact
        the corresponding ArtifactSerializer class.