Class BrownCluster

java.lang.Object
opennlp.tools.util.featuregen.BrownCluster
All Implemented Interfaces:
opennlp.tools.util.model.SerializableArtifact

public class BrownCluster extends Object implements opennlp.tools.util.model.SerializableArtifact
Class to load a Brown cluster document in the format: word\tword_class\tprob.

Originally available at: http://metaoptimize.com/projects/wordreprs/.

Further details can be found in the related research paper.

The file containing the clustering lexicon has to be passed as the value of the dict attribute of each BrownCluster feature generator.

  • Constructor Details

  • Method Details

    • lookupToken

      public String lookupToken(String string)
      Check if a token is in the Brown:paths, token map.
      Parameters:
      string - the token to look-up
      Returns:
      the brown class if such token is in the brown cluster map.
    • serialize

      public void serialize(OutputStream out) throws IOException
      Throws:
      IOException
    • getArtifactSerializerClass

      public Class<?> getArtifactSerializerClass()
      Specified by:
      getArtifactSerializerClass in interface opennlp.tools.util.model.SerializableArtifact