Package opennlp.tools.util.featuregen
package opennlp.tools.util.featuregen
This package contains classes for generating sequence features.
-
ClassDescriptionAn interface for generating features for name entity identification and for updating document level contexts.The
AdditionalContextFeatureGenerator
generates the context from the passed in additional context.TheAggregatedFeatureGenerator
aggregates a set offeature generators
and calls them to generate the features.AGeneratorFactory
that producesAggregatedFeatureGenerator
instances whenAggregatedFeatureGeneratorFactory.create()
is called.Adds bigram features based on tokens and token classes.AGeneratorFactory
that producesBigramNameFeatureGenerator
instances whenBigramNameFeatureGeneratorFactory.create()
is called.Generates Brown cluster features for token bigrams.Class to load a Brown cluster document in the format:word\tword_class\tprob
.AGeneratorFactory
that producesBrownBigramFeatureGenerator
instances whenBrownClusterBigramFeatureGeneratorFactory.create()
is called.AGeneratorFactory
that producesBrownTokenClassFeatureGenerator
instances whenBrownClusterTokenClassFeatureGeneratorFactory.create()
is called.AGeneratorFactory
that producesBrownTokenFeatureGenerator
instances whenBrownClusterTokenFeatureGeneratorFactory.create()
is called.Obtain the paths listed in the pathLengths array from the Brown class.GeneratesBrownCluster
features for current token and token class.GeneratesBrownCluster
features for current token.Caches features of the aggregatedgenerators
.AGeneratorFactory
that producesCachedFeatureGenerator
instances whenCachedFeatureGeneratorFactory.create()
is called.TheCharacterNgramFeatureGenerator
uses character ngrams to generate features about each token.AGeneratorFactory
that producesCharacterNgramFeatureGenerator
instances whenCharacterNgramFeatureGeneratorFactory.create()
is called.AGeneratorFactory
that producesOutcomePriorFeatureGenerator
instances whenDefinitionFeatureGeneratorFactory.create()
is called.TheDictionaryFeatureGenerator
uses aDictionaryNameFinder
to generate features for detected names based on theInSpanGenerator
.AGeneratorFactory
that producesDictionaryFeatureGenerator
instances whenDictionaryFeatureGeneratorFactory.create()
is called.This feature generator creates document begin features.AGeneratorFactory
that producesDocumentBeginFeatureGenerator
instances whenDocumentBeginFeatureGeneratorFactory.create()
is called.TheFeatureGeneratorResourceProvider
provides access to the resources available in the model.This class provide common utilities for feature generation.Creates a set of feature generators based on a provided XML descriptor.Generates features if the tokens are recognized by the providedTokenNameFinder
.The definition feature maps the underlying distribution of outcomes.A POS tagging driven feature generator.AGeneratorFactory
that producesPosTaggerFeatureGenerator
instances whenPosTaggerFeatureGeneratorFactory.create()
is called.Adds the token POS tag as feature.AGeneratorFactory
that producesPOSTaggerNameFeatureGenerator
instances whenPOSTaggerNameFeatureGeneratorFactory.create()
is called.A feature generator implementation that generates prefix-based features.AGeneratorFactory
that producesPrefixFeatureGenerator
instances whenPrefixFeatureGeneratorFactory.create()
is called.ThisAdaptiveFeatureGenerator
generates features indicating the outcome associated with a previously occurring word.AGeneratorFactory
that producesPreviousMapFeatureGenerator
instances whenPreviousMapFeatureGeneratorFactory.create()
is called.ThisAdaptiveFeatureGenerator
generates features indicating the outcome associated with two previously occurring words.This feature generator creates sentence begin and end features.AGeneratorFactory
that producesSentenceFeatureGenerator
instances whenSentenceFeatureGeneratorFactory.create()
is called.Recognizes predefined patterns in strings.A feature generator implementation that generates suffix-based features.AGeneratorFactory
that producesSuffixFeatureGenerator
instances whenSuffixFeatureGeneratorFactory.create()
is called.Generates features for the class of a token.AGeneratorFactory
that producesTokenClassFeatureGenerator
instances whenTokenClassFeatureGeneratorFactory.create()
is called.Generates a feature which contains a token itself.AGeneratorFactory
that producesTokenFeatureGenerator
instances whenTokenFeatureGeneratorFactory.create()
is called.Partitions tokens into sub-tokens based on character classes and generates class features for each of the sub-tokens and combinations of those sub-tokens.AGeneratorFactory
instantiatesTokenPatternFeatureGenerator
instances whenTokenPatternFeatureGeneratorFactory.create()
is called.Adds trigram features based on tokens and token classes.AGeneratorFactory
that producesTrigramNameFeatureGenerator
instances whenTrigramNameFeatureGeneratorFactory.create()
is called.Generates previous (left-sided) and next (right-sided) features for a givenAdaptiveFeatureGenerator
.AGeneratorFactory
that producesWindowFeatureGenerator
instances whenWindowFeatureGeneratorFactory.create()
is called.AnAdaptiveFeatureGenerator
implementation of a word cluster feature generator.Defines a word clusterGeneratorFactory
; it reads an element containing 'w2vwordcluster' as a tag name.