Package opennlp.tools.chunker
Class ChunkerME
java.lang.Object
opennlp.tools.chunker.ChunkerME
- All Implemented Interfaces:
- Chunker
The class represents a maximum-entropy-based 
Chunker. This chunker can be used to
 find flat structures based on sequence inputs such as noun phrases or named entities.- 
Field SummaryFields
- 
Constructor SummaryConstructorsConstructorDescriptionInitializes aChunkerby downloading a default model.ChunkerME(ChunkerModel model) Initializes aChunkerwith the specifiedChunkerModel.
- 
Method SummaryModifier and TypeMethodDescriptionString[]Generates chunk tags for the given sequence returning the result in an array.Span[]chunkAsSpans(String[] toks, String[] tags) double[]probs()Returns an array with the probabilities of the last decoded sequence.voidprobs(double[] probs) Populates the specified array with the probabilities of the last decoded sequence.Sequence[]topKSequences(String[] sentence, String[] tags) Computes the top k chunksequencesfor the specified sentence with the specified pos-tags.Sequence[]topKSequences(String[] sentence, String[] tags, double minSequenceScore) Computes the top k chunksequencesfor the specified sentence with the specified pos-tags.static ChunkerModeltrain(String lang, ObjectStream<ChunkSample> in, TrainingParameters mlParams, ChunkerFactory factory) Starts a training of aChunkerModelwith the given parameters.
- 
Field Details- 
DEFAULT_BEAM_SIZEpublic static final int DEFAULT_BEAM_SIZE- See Also:
 
 
- 
- 
Constructor Details- 
ChunkerMEInitializes aChunkerby downloading a default model.- Parameters:
- language- The language of the model.
- Throws:
- IOException- Thrown if the model cannot be downloaded or saved.
 
- 
ChunkerME- Parameters:
- model- A valid- modelinstance.
 
 
- 
- 
Method Details- 
chunkDescription copied from interface:ChunkerGenerates chunk tags for the given sequence returning the result in an array.
- 
chunkAsSpansDescription copied from interface:Chunker- Specified by:
- chunkAsSpansin interface- Chunker
- Parameters:
- toks- an array of the tokens or words of the sequence.
- tags- an array of the pos tags of the sequence.
- Returns:
- an array of spanswith chunk tags for each chunk in the sequence.
 
- 
topKSequencesDescription copied from interface:ChunkerComputes the top k chunksequencesfor the specified sentence with the specified pos-tags.- Specified by:
- topKSequencesin interface- Chunker
- Parameters:
- sentence- The tokens of the sentence.
- tags- The pos-tags for the specified sentence.
- Returns:
- the top k chunk sequencesfor the specified sentence.
 
- 
topKSequencesDescription copied from interface:ChunkerComputes the top k chunksequencesfor the specified sentence with the specified pos-tags.- Specified by:
- topKSequencesin interface- Chunker
- Parameters:
- sentence- The tokens of the sentence.
- tags- The pos-tags for the specified sentence.
- minSequenceScore- A lower bound on the score of a returned sequence.
- Returns:
- the top k chunk sequencesfor the specified sentence.
 
- 
probspublic void probs(double[] probs) Populates the specified array with the probabilities of the last decoded sequence. The sequence was determined based on the previous call tochunk. The specified array should be at least as large as the number of tokens in the previous call tochunk.- Parameters:
- probs- An array used to hold the probabilities of the last decoded sequence.
 
- 
probspublic double[] probs()Returns an array with the probabilities of the last decoded sequence. The sequence was determined based on the previous call tochunk(String[], String[]).- Returns:
- An array with the same number of probabilities as tokens when
         chunk(String[], String[])was last called.
 
- 
trainpublic static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, TrainingParameters mlParams, ChunkerFactory factory) throws IOException Starts a training of aChunkerModelwith the given parameters.- Parameters:
- lang- The ISO conform language code.
- in- The- ObjectStreamof- ChunkSampleused as input for training.
- mlParams- The- TrainingParametersfor the context of the training.
- factory- The- ChunkerFactoryfor creating related objects defined via- mlParams.
- Returns:
- A valid, trained ChunkerModelinstance.
- Throws:
- IOException- Thrown if IO errors occurred.
- IllegalArgumentException- Thrown if the specified- TrainerFactory.TrainerTypeis not supported.
 
 
-