Package | Description |
---|---|
opennlp.tools.dictionary |
Package related to parsing and storing dictionaries.
|
opennlp.tools.namefind |
Package related to finding proper names and numeric amounts.
|
opennlp.tools.ngram |
Package related to computing and storing n-gram frequencies.
|
opennlp.tools.parser |
Package containing common code for performing full syntactic parsing.
|
opennlp.tools.parser.chunking |
Package containing code for performing full syntactic parsing using shift/reduce-style decisions.
|
opennlp.tools.parser.treeinsert |
Package containing experimental code for performing full syntactic
parsing using attachment decisions.
|
opennlp.tools.postag |
Package related to part-of-speech tagging.
|
opennlp.tools.sentdetect |
Package related to identifying sentece boundries.
|
opennlp.tools.tokenize |
Contains classes related to finding token or words in a string.
|
opennlp.tools.util.featuregen |
This package contains classes for generating sequence features.
|
opennlp.tools.util.model |
Modifier and Type | Method and Description |
---|---|
static Dictionary |
Dictionary.parseOneEntryPerLine(Reader in)
Reads a dictionary which has one entry per line.
|
Constructor and Description |
---|
DictionaryNameFinder(Dictionary dictionary)
Initializes the current instance with the provided dictionary.
|
DictionaryNameFinder(Dictionary dictionary,
String type)
Initialized the current instance with he provided dictionary
and a type.
|
Modifier and Type | Method and Description |
---|---|
Dictionary |
NGramModel.toDictionary()
Creates a dictionary which contain all
StringList which
are in the current NGramModel . |
Dictionary |
NGramModel.toDictionary(boolean caseSensitive)
Creates a dictionary which contains all
StringList s which
are in the current NGramModel . |
Modifier and Type | Field and Description |
---|---|
protected Dictionary |
AbstractParserEventStream.dict |
Modifier and Type | Method and Description |
---|---|
static Dictionary |
AbstractBottomUpParser.buildDictionary(ObjectStream<Parse> data,
HeadRules rules,
int cutoff)
Creates a n-gram dictionary from the specified data stream using the specified
head rule and specified cut-off.
|
static Dictionary |
AbstractBottomUpParser.buildDictionary(ObjectStream<Parse> data,
HeadRules rules,
TrainingParameters params)
Creates a n-gram dictionary from the specified data stream using the specified
head rule and specified cut-off.
|
Constructor and Description |
---|
AbstractParserEventStream(ObjectStream<Parse> d,
HeadRules rules,
ParserEventTypeEnum etype,
Dictionary dict) |
Constructor and Description |
---|
BuildContextGenerator(Dictionary dict) |
ParserEventStream(ObjectStream<Parse> d,
HeadRules rules,
ParserEventTypeEnum etype,
Dictionary dict)
Create an event stream based on the specified data stream of the specified type using
the specified head rules.
|
Constructor and Description |
---|
ParserEventStream(ObjectStream<Parse> d,
HeadRules rules,
ParserEventTypeEnum etype,
Dictionary dict) |
Modifier and Type | Field and Description |
---|---|
protected Dictionary |
POSTaggerME.ngramDictionary |
protected Dictionary |
POSTaggerFactory.ngramDictionary |
Modifier and Type | Method and Description |
---|---|
static Dictionary |
POSTaggerME.buildNGramDictionary(ObjectStream<POSSample> samples,
int cutoff) |
Dictionary |
POSTaggerFactory.getDictionary()
Deprecated.
this will be reduced in visibility and later removed
|
Dictionary |
POSModel.getNgramDictionary()
Retrieves the ngram dictionary.
|
Modifier and Type | Method and Description |
---|---|
static POSTaggerFactory |
POSTaggerFactory.create(String subclassName,
Dictionary ngramDictionary,
TagDictionary posDictionary)
Deprecated.
|
protected void |
POSTaggerFactory.init(Dictionary ngramDictionary,
TagDictionary posDictionary)
Deprecated.
|
void |
POSTaggerFactory.setDictionary(Dictionary ngramDict)
Deprecated.
|
Constructor and Description |
---|
DefaultPOSContextGenerator(Dictionary dict)
Initializes the current instance.
|
DefaultPOSContextGenerator(int cacheSize,
Dictionary dict)
Initializes the current instance.
|
POSTaggerFactory(Dictionary ngramDictionary,
TagDictionary posDictionary)
Deprecated.
this constructor is here for backward compatibility and
is not functional anymore in the training of 1.8.x series models
|
Modifier and Type | Method and Description |
---|---|
Dictionary |
SentenceDetectorFactory.getAbbreviationDictionary() |
Dictionary |
SentenceModel.getAbbreviations() |
Modifier and Type | Method and Description |
---|---|
static SentenceDetectorFactory |
SentenceDetectorFactory.create(String subclassName,
String languageCode,
boolean useTokenEnd,
Dictionary abbreviationDictionary,
char[] eosCharacters) |
protected void |
SentenceDetectorFactory.init(String languageCode,
boolean useTokenEnd,
Dictionary abbreviationDictionary,
char[] eosCharacters) |
static SentenceModel |
SentenceDetectorME.train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations)
Deprecated.
|
static SentenceModel |
SentenceDetectorME.train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
TrainingParameters mlParams)
Deprecated.
|
Constructor and Description |
---|
SentenceDetectorFactory(String languageCode,
boolean useTokenEnd,
Dictionary abbreviationDictionary,
char[] eosCharacters)
Creates a
SentenceDetectorFactory . |
SentenceModel(String languageCode,
MaxentModel sentModel,
boolean useTokenEnd,
Dictionary abbreviations) |
SentenceModel(String languageCode,
MaxentModel sentModel,
boolean useTokenEnd,
Dictionary abbreviations,
char[] eosCharacters)
Deprecated.
Use
SentenceModel.SentenceModel(String, MaxentModel, Map, SentenceDetectorFactory)
instead and pass in a SentenceDetectorFactory |
SentenceModel(String languageCode,
MaxentModel sentModel,
boolean useTokenEnd,
Dictionary abbreviations,
char[] eosCharacters,
Map<String,String> manifestInfoEntries)
Deprecated.
Use
SentenceModel.SentenceModel(String, MaxentModel, Map, SentenceDetectorFactory)
instead and pass in a SentenceDetectorFactory |
SentenceModel(String languageCode,
MaxentModel sentModel,
boolean useTokenEnd,
Dictionary abbreviations,
Map<String,String> manifestInfoEntries) |
Modifier and Type | Method and Description |
---|---|
Dictionary |
TokenizerFactory.getAbbreviationDictionary()
Gets the abbreviation dictionary
|
Dictionary |
TokenizerModel.getAbbreviations() |
Modifier and Type | Method and Description |
---|---|
static TokenizerFactory |
TokenizerFactory.create(String subclassName,
String languageCode,
Dictionary abbreviationDictionary,
boolean useAlphaNumericOptimization,
Pattern alphaNumericPattern)
Factory method the framework uses create a new
TokenizerFactory . |
protected void |
TokenizerFactory.init(String languageCode,
Dictionary abbreviationDictionary,
boolean useAlphaNumericOptimization,
Pattern alphaNumericPattern) |
Constructor and Description |
---|
TokenizerFactory(String languageCode,
Dictionary abbreviationDictionary,
boolean useAlphaNumericOptimization,
Pattern alphaNumericPattern)
Creates a
TokenizerFactory . |
Modifier and Type | Method and Description |
---|---|
void |
DictionaryFeatureGenerator.setDictionary(Dictionary dict) |
void |
DictionaryFeatureGenerator.setDictionary(String name,
Dictionary dict) |
Constructor and Description |
---|
DictionaryFeatureGenerator(Dictionary dict) |
DictionaryFeatureGenerator(String prefix,
Dictionary dict) |
Modifier and Type | Method and Description |
---|---|
Dictionary |
DictionarySerializer.create(InputStream in) |
Modifier and Type | Method and Description |
---|---|
void |
DictionarySerializer.serialize(Dictionary dictionary,
OutputStream out) |
Copyright © 2017 The Apache Software Foundation. All rights reserved.