Package opennlp.tools.tokenize
Class TokenizerModel
- java.lang.Object
-
- opennlp.tools.util.model.BaseModel
-
- opennlp.tools.tokenize.TokenizerModel
-
- All Implemented Interfaces:
Serializable,ArtifactProvider
public final class TokenizerModel extends BaseModel
TheTokenizerModelis the model used by a learnableTokenizer.- See Also:
TokenizerME,TokenizerFactory, Serialized Form
-
-
Field Summary
-
Fields inherited from class opennlp.tools.util.model.BaseModel
TRAINING_CUTOFF_PROPERTY, TRAINING_EVENTHASH_PROPERTY, TRAINING_ITERATIONS_PROPERTY
-
-
Constructor Summary
Constructors Constructor Description TokenizerModel(File modelFile)Initializes aTokenizerModelinstance via a validFile.TokenizerModel(InputStream in)Initializes aTokenizerModelinstance via a validInputStream.TokenizerModel(URL modelURL)Initializes aTokenizerModelinstance via a validURL.TokenizerModel(Path modelPath)Initializes aTokenizerModelinstance via a validPath.TokenizerModel(MaxentModel tokenizerModel, Map<String,String> manifestInfoEntries, TokenizerFactory tokenizerFactory)Initializes aTokenizerModelinstance via aMaxentModeland related resources.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanequals(Object obj)DictionarygetAbbreviations()TokenizerFactorygetFactory()MaxentModelgetMaxentModel()inthashCode()booleanuseAlphaNumericOptimization()-
Methods inherited from class opennlp.tools.util.model.BaseModel
getArtifact, getLanguage, getManifestProperty, getVersion, isLoadedFromSerialized, serialize, serialize, serialize
-
-
-
-
Constructor Detail
-
TokenizerModel
public TokenizerModel(MaxentModel tokenizerModel, Map<String,String> manifestInfoEntries, TokenizerFactory tokenizerFactory)
Initializes aTokenizerModelinstance via aMaxentModeland related resources.- Parameters:
tokenizerModel- Themodelto be used.manifestInfoEntries- Additional information kept in the manifest.tokenizerFactory- TheTokenizerFactoryto be used internally.
-
TokenizerModel
public TokenizerModel(InputStream in) throws IOException
Initializes aTokenizerModelinstance via a validInputStream.- Parameters:
in- TheInputStreamused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenizerModel
public TokenizerModel(File modelFile) throws IOException
Initializes aTokenizerModelinstance via a validFile.- Parameters:
modelFile- TheFileused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenizerModel
public TokenizerModel(Path modelPath) throws IOException
Initializes aTokenizerModelinstance via a validPath.- Parameters:
modelPath- ThePathused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
TokenizerModel
public TokenizerModel(URL modelURL) throws IOException
Initializes aTokenizerModelinstance via a validURL.- Parameters:
modelURL- TheURLused for loading the model.- Throws:
IOException- Thrown if IO errors occurred during initialization.
-
-
Method Detail
-
getFactory
public TokenizerFactory getFactory()
- Returns:
- Retrieves the active
TokenizerFactory.
-
getMaxentModel
public MaxentModel getMaxentModel()
- Returns:
- Retrieves the model as
MaxentModelinstance.
-
getAbbreviations
public Dictionary getAbbreviations()
- Returns:
- Retrieves the active abbreviation
Dictionary.
-
useAlphaNumericOptimization
public boolean useAlphaNumericOptimization()
- Returns:
trueif alphanumeric optimization is active,falseotherwise.
-
-