Package opennlp.tools.namefind
Class TokenNameFinderModel
- java.lang.Object
-
- opennlp.tools.util.model.BaseModel
-
- opennlp.tools.namefind.TokenNameFinderModel
-
- All Implemented Interfaces:
Serializable,ArtifactProvider
public class TokenNameFinderModel extends BaseModel
TheTokenNameFinderModelis the model used by a learnableTokenNameFinder.- See Also:
NameFinderME, Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classTokenNameFinderModel.FeatureGeneratorCreationError
-
Field Summary
-
Fields inherited from class opennlp.tools.util.model.BaseModel
TRAINING_CUTOFF_PROPERTY, TRAINING_EVENTHASH_PROPERTY, TRAINING_ITERATIONS_PROPERTY
-
-
Constructor Summary
Constructors Constructor Description TokenNameFinderModel(File modelFile)Initializes aTokenNameFinderModelinstance via a validFile.TokenNameFinderModel(InputStream in)Initializes aTokenNameFinderModelinstance via a validInputStream.TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries)Initializes aTokenNameFinderModelinstance via given parameters.TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, int beamSize, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries)Initializes aTokenNameFinderModelinstance via given parameters.TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, int beamSize, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries, SequenceCodec<String> seqCodec, TokenNameFinderFactory factory)Initializes aTokenNameFinderModelinstance via given parameters.TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, Map<String,Object> resources, Map<String,String> manifestInfoEntries)Initializes aTokenNameFinderModelinstance via given parameters.TokenNameFinderModel(String languageCode, SequenceClassificationModel<String> nameFinderModel, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries, SequenceCodec<String> seqCodec, TokenNameFinderFactory factory)Initializes aTokenNameFinderModelinstance via given parameters.TokenNameFinderModel(URL modelURL)Initializes aTokenNameFinderModelinstance via a validURL.TokenNameFinderModel(Path modelPath)Initializes aTokenNameFinderModelinstance via a validPath.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static Map<String,ArtifactSerializer<?>>createArtifactSerializers()Create theserializers.TokenNameFinderFactorygetFactory()SequenceClassificationModel<String>getNameFinderSequenceModel()SequenceCodec<String>getSequenceCodec()-
Methods inherited from class opennlp.tools.util.model.BaseModel
getArtifact, getLanguage, getManifestProperty, getVersion, isLoadedFromSerialized, serialize, serialize, serialize
-
-
-
-
Constructor Detail
-
TokenNameFinderModel
public TokenNameFinderModel(String languageCode, SequenceClassificationModel<String> nameFinderModel, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries, SequenceCodec<String> seqCodec, TokenNameFinderFactory factory)
Initializes aTokenNameFinderModelinstance via given parameters.- Parameters:
languageCode- The ISO conform language code.nameFinderModel- A validMaxentModel.generatorDescriptor- Thebyte[]representing the feature generator descriptor.resources- Additional resources in a mapping.manifestInfoEntries- Additional information kept in the manifest.seqCodec- TheSequenceCodecto use.factory- TheTokenNameFinderFactoryfor creating related objects.- Throws:
IllegalArgumentException- Thrown if thenamFinderModelincompatible withseqCodec.
-
TokenNameFinderModel
public TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, int beamSize, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries, SequenceCodec<String> seqCodec, TokenNameFinderFactory factory)
Initializes aTokenNameFinderModelinstance via given parameters.- Parameters:
languageCode- The ISO conform language code.nameFinderModel- A validMaxentModel.beamSize- The beam size. Must be greater than0.generatorDescriptor- Thebyte[]representing the feature generator descriptor.resources- Additional resources in a mapping.manifestInfoEntries- Additional information kept in the manifest.seqCodec- TheSequenceCodecto use.factory- TheTokenNameFinderFactoryfor creating related objects.- Throws:
IllegalArgumentException- Thrown if thenamFinderModelincompatible withseqCodec.
-
TokenNameFinderModel
public TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries)
Initializes aTokenNameFinderModelinstance via given parameters.- Parameters:
languageCode- The ISO conform language code.nameFinderModel- A validMaxentModel.generatorDescriptor- Thebyte[]representing the feature generator descriptor.resources- Additional resources in a mapping.manifestInfoEntries- Additional information kept in the manifest.- Throws:
IllegalArgumentException- Thrown if thenamFinderModelincompatible withseqCodec.
-
TokenNameFinderModel
public TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, int beamSize, byte[] generatorDescriptor, Map<String,Object> resources, Map<String,String> manifestInfoEntries)
Initializes aTokenNameFinderModelinstance via given parameters.- Parameters:
languageCode- The ISO conform language code.nameFinderModel- A validMaxentModel.beamSize- The beam size. Must be greater than0.generatorDescriptor- Thebyte[]representing the feature generator descriptor.resources- Additional resources in a mapping.manifestInfoEntries- Additional information kept in the manifest.- Throws:
IllegalArgumentException- Thrown if thenamFinderModelincompatible withseqCodec.
-
TokenNameFinderModel
public TokenNameFinderModel(String languageCode, MaxentModel nameFinderModel, Map<String,Object> resources, Map<String,String> manifestInfoEntries)
Initializes aTokenNameFinderModelinstance via given parameters.- Parameters:
languageCode- The ISO conform language code.nameFinderModel- A validMaxentModel.resources- Additional resources in a mapping.manifestInfoEntries- Additional information kept in the manifest.- Throws:
IllegalArgumentException- Thrown if thenameFinderModelis incompatible withseqCodec.
-
TokenNameFinderModel
public TokenNameFinderModel(InputStream in) throws IOException
Initializes aTokenNameFinderModelinstance via a validInputStream.- Parameters:
in- TheInputStreamused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenNameFinderModel
public TokenNameFinderModel(File modelFile) throws IOException
Initializes aTokenNameFinderModelinstance via a validFile.- Parameters:
modelFile- TheFileused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenNameFinderModel
public TokenNameFinderModel(Path modelPath) throws IOException
Initializes aTokenNameFinderModelinstance via a validPath.- Parameters:
modelPath- ThePathused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenNameFinderModel
public TokenNameFinderModel(URL modelURL) throws IOException
Initializes aTokenNameFinderModelinstance via a validURL.- Parameters:
modelURL- TheURLused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
-
Method Detail
-
getNameFinderSequenceModel
public SequenceClassificationModel<String> getNameFinderSequenceModel()
- Returns:
- Retrieves a valid
SequenceClassificationModelornullif no matching one could be found.
-
getSequenceCodec
public SequenceCodec<String> getSequenceCodec()
- Returns:
- Retrieves the
SequenceCodecin use.
-
getFactory
public TokenNameFinderFactory getFactory()
- Returns:
- Retrieves the
TokenNameFinderFactoryin use.
-
createArtifactSerializers
public static Map<String,ArtifactSerializer<?>> createArtifactSerializers()
Create theserializers. Currently, for serializers related to features that require external resources, such asW2VClassesDictionaryobjects, the convention is to add its element tag name as key of the serializer map.For example, the element tag name for the
WordClusterFeatureGeneratorwhich usesW2VClassesDictionaryobjects serialized by theW2VClassesDictionarySerializeris 'wordcluster', which is the key used to add the serializer to the map.- Returns:
- A
Mapcontaining the addedserializers.
-
-