Package opennlp.tools.lemmatizer
Class DefaultLemmatizerContextGenerator
java.lang.Object
opennlp.tools.lemmatizer.DefaultLemmatizerContextGenerator
- All Implemented Interfaces:
LemmatizerContextGenerator
,BeamSearchContextGenerator<String>
Simple feature generator for learning statistical lemmatizers.
Features based on Grzegorz ChrupaĆa. 2008. Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. PhD dissertation, Dublin City University
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionString[]
getContext
(int index, String[] sequence, String[] priorDecisions, Object[] additionalContext) Returns the context for the specified position in the specifiedsequence
(list).String[]
getContext
(int index, String[] toks, String[] tags, String[] preds) Returns the contexts for lemmatizing of the specified index.
-
Constructor Details
-
DefaultLemmatizerContextGenerator
public DefaultLemmatizerContextGenerator()
-
-
Method Details
-
getContext
public String[] getContext(int index, String[] sequence, String[] priorDecisions, Object[] additionalContext) Description copied from interface:BeamSearchContextGenerator
Returns the context for the specified position in the specifiedsequence
(list).- Specified by:
getContext
in interfaceBeamSearchContextGenerator<String>
- Parameters:
index
- The index of thesequence
.sequence
- The sequence ofitems
over which the beam search is performed.priorDecisions
- The sequence of decisions made prior to the context for which this decision is being made.additionalContext
- Any additional context specific to a class implementing this interface.- Returns:
- The context for the specified
index
in the specifiedsequence
.
-
getContext
Description copied from interface:LemmatizerContextGenerator
Returns the contexts for lemmatizing of the specified index.- Specified by:
getContext
in interfaceLemmatizerContextGenerator
- Parameters:
index
- The index of the token in the specifiedtoks
array for which the context should be constructed.toks
- The tokens of the sentence. ThetoString()
methods of these objects should return the token text.tags
- The POS tags for the specifiedtoks
.preds
- The previous decisions made in the tagging of this sequence. Only indices less thani
will be examined.- Returns:
- An array of predictive contexts on which a model basis its decisions.
-