Package opennlp.tools.chunker
Class ChunkerME
- java.lang.Object
-
- opennlp.tools.chunker.ChunkerME
-
-
Field Summary
Fields Modifier and Type Field Description static intDEFAULT_BEAM_SIZE
-
Constructor Summary
Constructors Constructor Description ChunkerME(String language)Initializes theChunkerby downloading a default model.ChunkerME(ChunkerModel model)Initializes the current instance with the specifiedChunkerModel.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description String[]chunk(String[] toks, String[] tags)Generates chunk tags for the given sequence returning the result in an array.Span[]chunkAsSpans(String[] toks, String[] tags)Generates tagged chunk spans for the given sequence returning the result in a span array.double[]probs()Returns an array with the probabilities of the last decoded sequence.voidprobs(double[] probs)Populates the specified array with the probabilities of the last decoded sequence.Sequence[]topKSequences(String[] sentence, String[] tags)Returns the top k chunk sequences for the specified sentence with the specified pos-tagsSequence[]topKSequences(String[] sentence, String[] tags, double minSequenceScore)Returns the top k chunk sequences for the specified sentence with the specified pos-tagsstatic ChunkerModeltrain(String lang, ObjectStream<ChunkSample> in, TrainingParameters mlParams, ChunkerFactory factory)Starts a training of aChunkerModelwith the given parameters.
-
-
-
Field Detail
-
DEFAULT_BEAM_SIZE
public static final int DEFAULT_BEAM_SIZE
- See Also:
- Constant Field Values
-
-
Constructor Detail
-
ChunkerME
public ChunkerME(String language) throws IOException
Initializes theChunkerby downloading a default model.- Parameters:
language- The language of the model.- Throws:
IOException- Thrown if the model cannot be downloaded or saved.
-
ChunkerME
public ChunkerME(ChunkerModel model)
Initializes the current instance with the specifiedChunkerModel. TheDEFAULT_BEAM_SIZEis used.- Parameters:
model- A validmodelinstance.
-
-
Method Detail
-
chunk
public String[] chunk(String[] toks, String[] tags)
Description copied from interface:ChunkerGenerates chunk tags for the given sequence returning the result in an array.
-
chunkAsSpans
public Span[] chunkAsSpans(String[] toks, String[] tags)
Description copied from interface:ChunkerGenerates tagged chunk spans for the given sequence returning the result in a span array.- Specified by:
chunkAsSpansin interfaceChunker- Parameters:
toks- an array of the tokens or words of the sequence.tags- an array of the pos tags of the sequence.- Returns:
- an array of spans with chunk tags for each chunk in the sequence.
-
topKSequences
public Sequence[] topKSequences(String[] sentence, String[] tags)
Description copied from interface:ChunkerReturns the top k chunk sequences for the specified sentence with the specified pos-tags- Specified by:
topKSequencesin interfaceChunker- Parameters:
sentence- The tokens of the sentence.tags- The pos-tags for the specified sentence.- Returns:
- the top k chunk sequences for the specified sentence.
-
topKSequences
public Sequence[] topKSequences(String[] sentence, String[] tags, double minSequenceScore)
Description copied from interface:ChunkerReturns the top k chunk sequences for the specified sentence with the specified pos-tags- Specified by:
topKSequencesin interfaceChunker- Parameters:
sentence- The tokens of the sentence.tags- The pos-tags for the specified sentence.minSequenceScore- A lower bound on the score of a returned sequence.- Returns:
- the top k chunk sequences for the specified sentence.
-
probs
public void probs(double[] probs)
Populates the specified array with the probabilities of the last decoded sequence. The sequence was determined based on the previous call tochunk. The specified array should be at least as large as the number of tokens in the previous call tochunk.- Parameters:
probs- An array used to hold the probabilities of the last decoded sequence.
-
probs
public double[] probs()
Returns an array with the probabilities of the last decoded sequence. The sequence was determined based on the previous call tochunk.- Returns:
- An array with the same number of probabilities as tokens when
chunk(String[], String[])was last called.
-
train
public static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, TrainingParameters mlParams, ChunkerFactory factory) throws IOException
Starts a training of aChunkerModelwith the given parameters.- Parameters:
lang- The ISO conform language code.in- TheObjectStreamofChunkSampleused as input for training.mlParams- TheTrainingParametersfor the context of the training.factory- TheChunkerFactoryfor creating related objects defined viamlParams.- Returns:
- A valid, trained
ChunkerModelinstance. - Throws:
IOException- Thrown if IO errors occurred.
-
-