Package opennlp.tools.chunker
Class ChunkerME
java.lang.Object
opennlp.tools.chunker.ChunkerME
- All Implemented Interfaces:
opennlp.tools.chunker.Chunker,opennlp.tools.ml.Probabilistic
@ThreadSafe
public class ChunkerME
extends Object
implements opennlp.tools.chunker.Chunker, opennlp.tools.ml.Probabilistic
The class represents a maximum-entropy-based
Chunker. A chunker can be used to
find flat structures based on sequence inputs such as noun phrases or named entities.
A chunker instance is thread-safe. One instance can be shared across multiple threads to save memory.
Note: In container environments with classloader isolation (e.g. Jakarta EE), ensure instances do
not outlive the application's lifecycle, as underlying components use ThreadLocal state that may
pin the classloader.
- See Also:
-
Field Summary
Fields -
Constructor Summary
ConstructorsConstructorDescriptionInitializes aChunkerby downloading a default model.ChunkerME(ChunkerModel model) Initializes aChunkerwith the specifiedChunkerModel. -
Method Summary
Modifier and TypeMethodDescriptionString[]opennlp.tools.util.Span[]chunkAsSpans(String[] toks, String[] tags) voidRemoves thread-local state to prevent classloader leaks in container environments.double[]probs()The sequence was determined based on the previous call tochunk(String[], String[]).voidprobs(double[] probs) Populates the specified array with the probabilities of the last decoded sequence.opennlp.tools.util.Sequence[]topKSequences(String[] sentence, String[] tags) opennlp.tools.util.Sequence[]topKSequences(String[] sentence, String[] tags, double minSequenceScore) static ChunkerModeltrain(String lang, opennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample> in, opennlp.tools.util.TrainingParameters mlParams, ChunkerFactory factory) Starts a training of aChunkerModelwith the given parameters.
-
Field Details
-
DEFAULT_BEAM_SIZE
public static final int DEFAULT_BEAM_SIZE- See Also:
-
-
Constructor Details
-
ChunkerME
Initializes aChunkerby downloading a default model.- Parameters:
language- The language of the model.- Throws:
IOException- Thrown if the model cannot be downloaded or saved.
-
ChunkerME
- Parameters:
model- A validmodelinstance.
-
-
Method Details
-
chunk
- Specified by:
chunkin interfaceopennlp.tools.chunker.Chunker
-
chunkAsSpans
- Specified by:
chunkAsSpansin interfaceopennlp.tools.chunker.Chunker
-
topKSequences
- Specified by:
topKSequencesin interfaceopennlp.tools.chunker.Chunker
-
topKSequences
public opennlp.tools.util.Sequence[] topKSequences(String[] sentence, String[] tags, double minSequenceScore) - Specified by:
topKSequencesin interfaceopennlp.tools.chunker.Chunker
-
probs
public void probs(double[] probs) Populates the specified array with the probabilities of the last decoded sequence. The sequence was determined based on the previous call tochunk. The specified array should be at least as large as the number of tokens in the previous call tochunk.- Parameters:
probs- An array used to hold the probabilities of the last decoded sequence.
-
probs
public double[] probs()The sequence was determined based on the previous call tochunk(String[], String[]).- Specified by:
probsin interfaceopennlp.tools.ml.Probabilistic- Returns:
- an array with the same number of probabilities as tokens when
chunk(String[], String[])was last called
-
clearThreadLocalState
public void clearThreadLocalState()Removes thread-local state to prevent classloader leaks in container environments. Call when the thread is returned to a pool or the chunker is no longer needed. -
train
public static ChunkerModel train(String lang, opennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample> in, opennlp.tools.util.TrainingParameters mlParams, ChunkerFactory factory) throws IOException Starts a training of aChunkerModelwith the given parameters.- Parameters:
lang- The ISO conform language code.in- TheObjectStreamofChunkSampleused as input for training.mlParams- TheTrainingParametersfor the context of the training.factory- TheChunkerFactoryfor creating related objects defined viamlParams.- Returns:
- A valid, trained
ChunkerModelinstance. - Throws:
IOException- Thrown if IO errors occurred.IllegalArgumentException- Thrown if the specifiedTrainerFactory.TrainerTypeis not supported.
-