Package opennlp.tools.postag
Klasse POSTaggerME
java.lang.Object
opennlp.tools.postag.POSTaggerME
- Alle implementierten Schnittstellen:
POSTagger
A
part-of-speech tagger that uses maximum entropy.
Tries to predict whether words are nouns, verbs, or any of 70 other POS tags depending on their surrounding context.
-
Feldübersicht
Felder -
Konstruktorübersicht
KonstruktorenKonstruktorBeschreibungPOSTaggerME(String language) Initializes aPOSTaggerMEby downloading a default model for a givenlanguage.POSTaggerME(String language, POSTagFormat format) Initializes aPOSTaggerMEby downloading a default model for a givenlanguage.POSTaggerME(POSModel model) Initializes aPOSTaggerMEwith the providedmodel.POSTaggerME(POSModel model, POSTagFormat format) Initializes aPOSTaggerMEwith the providedmodel. -
Methodenübersicht
Modifizierer und TypMethodeBeschreibungstatic DictionarybuildNGramDictionary(ObjectStream<POSSample> samples, int cutoff) Constructs anGram dictionaryfrom anObjectStreamof samples.String[]String[]getOrderedTags(List<String> words, List<String> tags, int index) String[]getOrderedTags(List<String> words, List<String> tags, int index, double[] tprobs) static voidpopulatePOSDictionary(ObjectStream<POSSample> samples, MutableTagDictionary dict, int cutoff) Populates aPOSDictionaryfrom anObjectStreamof samples.double[]probs()voidprobs(double[] probs) Populates the specified array with the probabilities for each tag of the last tagged sentence.String[][]Returns at most the specifiednumTaggingsfor the specifiedsentence.String[]Assigns the sentence of tokens pos tags.String[]Assigns the sentence of tokens pos tags.Sequence[]topKSequences(String[] sentence) Assigns the sentence the top-ksequences.Sequence[]topKSequences(String[] sentence, Object[] additionalContext) Assigns the sentence the top-ksequences.static POSModeltrain(String languageCode, ObjectStream<POSSample> samples, TrainingParameters trainParams, POSTaggerFactory posFactory)
-
Felddetails
-
DEFAULT_BEAM_SIZE
public static final int DEFAULT_BEAM_SIZE- Siehe auch:
-
-
Konstruktordetails
-
POSTaggerME
Initializes aPOSTaggerMEby downloading a default model for a givenlanguage.- Parameter:
language- An ISO conform language code.- Löst aus:
IOException- Thrown if the model could not be downloaded or saved.
-
POSTaggerME
Initializes aPOSTaggerMEby downloading a default model for a givenlanguage.- Parameter:
language- An ISO conform language code.format- A validPOSTagFormat.- Löst aus:
IOException- Thrown if the model could not be downloaded or saved.
-
POSTaggerME
Initializes aPOSTaggerMEwith the providedmodel.- Parameter:
model- A validPOSModel.
-
POSTaggerME
Initializes aPOSTaggerMEwith the providedmodel.- Parameter:
model- A validPOSModel.format- A validPOSTagFormat.
-
-
Methodendetails
-
getAllPosTags
- Gibt zurück:
- Retrieves an array of all possible part-of-speech tags from the tagger.
-
tag
Beschreibung aus Schnittstelle kopiert:POSTaggerAssigns the sentence of tokens pos tags. -
tag
Beschreibung aus Schnittstelle kopiert:POSTaggerAssigns the sentence of tokens pos tags. -
tag
Returns at most the specifiednumTaggingsfor the specifiedsentence.- Parameter:
numTaggings- The number of tagging to be returned.sentence- An array of tokens which make up a sentence.- Gibt zurück:
- At most the specified number of taggings for the specified
sentence.
-
topKSequences
Beschreibung aus Schnittstelle kopiert:POSTaggerAssigns the sentence the top-ksequences.- Angegeben von:
topKSequencesin SchnittstellePOSTagger- Parameter:
sentence- The sentence of tokens to be tagged.- Gibt zurück:
- An array of
sequencesfor each token provided insentence.
-
topKSequences
Beschreibung aus Schnittstelle kopiert:POSTaggerAssigns the sentence the top-ksequences.- Angegeben von:
topKSequencesin SchnittstellePOSTagger- Parameter:
sentence- The sentence of tokens to be tagged.additionalContext- The context to provide additional information with.- Gibt zurück:
- An array of
sequencesfor each token provided insentence.
-
probs
public void probs(double[] probs) Populates the specified array with the probabilities for each tag of the last tagged sentence.- Parameter:
probs- An array to put the probabilities into.
-
probs
public double[] probs()- Gibt zurück:
- An array with the probabilities for each tag of the last tagged sentence.
-
getOrderedTags
-
getOrderedTags
-
train
public static POSModel train(String languageCode, ObjectStream<POSSample> samples, TrainingParameters trainParams, POSTaggerFactory posFactory) throws IOException - Löst aus:
IOException
-
buildNGramDictionary
public static Dictionary buildNGramDictionary(ObjectStream<POSSample> samples, int cutoff) throws IOException Constructs anGram dictionaryfrom anObjectStreamof samples.- Parameter:
samples- TheObjectStreamto process.cutoff- A non-negative cut-off value.- Gibt zurück:
- A valid
Dictionaryinstance holding nGrams. - Löst aus:
IOException- Thrown if IO errors occurred during dictionary construction.
-
populatePOSDictionary
public static void populatePOSDictionary(ObjectStream<POSSample> samples, MutableTagDictionary dict, int cutoff) throws IOException Populates aPOSDictionaryfrom anObjectStreamof samples.- Parameter:
samples- TheObjectStreamto process.dict- TheMutableTagDictionaryto use during population.cutoff- A non-negative cut-off value.- Löst aus:
IOException- Thrown if IO errors occurred during dictionary construction.
-