public class NameFinderME extends Object implements TokenNameFinder
Modifier and Type | Field and Description |
---|---|
static String |
CONTINUE |
static int |
DEFAULT_BEAM_SIZE |
static String |
OTHER |
static String |
START |
Constructor and Description |
---|
NameFinderME(TokenNameFinderModel model) |
Modifier and Type | Method and Description |
---|---|
void |
clearAdaptiveData()
Forgets all adaptive data which was collected during previous calls to one
of the find methods.
|
static Span[] |
dropOverlappingSpans(Span[] spans)
Removes spans with are intersecting or crossing in anyway.
|
Span[] |
find(String[] tokens)
Generates name tags for the given sequence, typically a sentence,
returning token spans for any identified names.
|
Span[] |
find(String[] tokens,
String[][] additionalContext)
Generates name tags for the given sequence, typically a sentence, returning
token spans for any identified names.
|
double[] |
probs()
Returns an array with the probabilities of the last decoded sequence.
|
void |
probs(double[] probs)
Populates the specified array with the probabilities of the last decoded
sequence.
|
double[] |
probs(Span[] spans)
Returns an array of probabilities for each of the specified spans which is
the arithmetic mean of the probabilities for each of the outcomes which
make up the span.
|
static TokenNameFinderModel |
train(String languageCode,
String type,
ObjectStream<NameSample> samples,
TrainingParameters trainParams,
TokenNameFinderFactory factory) |
public static final int DEFAULT_BEAM_SIZE
public static final String START
public static final String CONTINUE
public static final String OTHER
public NameFinderME(TokenNameFinderModel model)
public Span[] find(String[] tokens)
TokenNameFinder
find
in interface TokenNameFinder
tokens
- an array of the tokens or words of the sequence, typically a sentence.public Span[] find(String[] tokens, String[][] additionalContext)
tokens
- an array of the tokens or words of the sequence, typically a sentence.additionalContext
- features which are based on context outside of the
sentence but which should also be used.public void clearAdaptiveData()
clearAdaptiveData
in interface TokenNameFinder
public void probs(double[] probs)
chunk
. The specified array should be at least as large as the
number of tokens in the previous call to chunk
.probs
- An array used to hold the probabilities of the last decoded
sequence.public double[] probs()
chunk
.chunk
when it was last called.public double[] probs(Span[] spans)
spans
- The spans of the names for which probabilities are desired.public static TokenNameFinderModel train(String languageCode, String type, ObjectStream<NameSample> samples, TrainingParameters trainParams, TokenNameFinderFactory factory) throws IOException
IOException
public static Span[] dropOverlappingSpans(Span[] spans)
The following rules are used to remove the spans:
Identical spans: The first span in the array after sorting it remains
Intersecting spans: The first span after sorting remains
Contained spans: All spans which are contained by another are removed
spans
- Copyright © 2021 The Apache Software Foundation. All rights reserved.