All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
| Class |
Description |
| AbstractBottomUpParser |
Abstract class which contains code to tag and chunk parses for bottom up parsing and
leaves implementation of advancing parses and completing parses to extend class.
|
| AbstractContextGenerator |
Abstract class containing many of the methods used to generate contexts for parsing.
|
| AbstractDataIndexer |
Abstract DataIndexer implementation for collecting
event and context counts used in training.
|
| AbstractEventModelSequenceTrainer |
|
| AbstractEventStream<T> |
|
| AbstractEventTrainer |
|
| AbstractModel |
|
| AbstractModel.ModelType |
|
| AbstractModelReader |
An abstract, basic implementation of a model reader.
|
| AbstractModelWriter |
An abstract, basic implementation of a model writer.
|
| AbstractObjectStream<T> |
|
| AbstractParserEventStream |
Abstract class extended by parser event streams which perform tagging and chunking.
|
| AbstractSampleStreamFactory<T,P> |
Base class for sample stream factories.
|
| AbstractToSentenceSampleStream<T> |
|
| AbstractTrainer |
|
| AdaptiveFeatureGenerator |
An interface for generating features for name entity identification and for
updating document level contexts.
|
| ADChunkSampleStream |
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the
Portuguese Chunker training.
|
| ADChunkSampleStreamFactory<P> |
A Factory to create a Arvores Deitadas ChunkStream from the command line
utility.
|
| AdditionalContextFeatureGenerator |
|
| ADNameSampleStream |
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the
Portuguese NER training.
|
| ADNameSampleStreamFactory<P> |
A Factory to create a Arvores Deitadas NameSampleDataStream from the command line
utility.
|
| ADPOSSampleStream |
Note:
Do not use this class, internal use only!
|
| ADPOSSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ADSentenceSampleStream |
Note:
Do not use this class, internal use only!
|
| ADSentenceSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ADSentenceStream |
Stream filter which merges text lines into sentences, following the Arvores
Deitadas syntax.
|
| ADSentenceStream.Sentence |
|
| ADSentenceStream.SentenceParser |
Parses a sample of AD corpus.
|
| ADSentenceStream.SentenceParser.Leaf |
Represents the AD leaf
|
| ADSentenceStream.SentenceParser.Node |
Represents the AD node
|
| ADSentenceStream.SentenceParser.TreeElement |
Represents a tree element, Node or Leaf
|
| ADTokenSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| AggregateCharSequenceNormalizer |
|
| AggregatedFeatureGenerator |
|
| AggregatedFeatureGeneratorFactory |
|
| AncoraSpanishHeadRules |
Class for storing the Ancora Spanish head rules associated with parsing.
|
| AncoraSpanishHeadRules.HeadRulesSerializer |
|
| AnnotationConfiguration |
|
| AnnotatorNoteAnnotation |
|
| arabicStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| ArrayMath |
Utility class for simple vector arithmetic.
|
| ArrayMath |
Deprecated. |
| ArtifactProvider |
Provides access to model persisted artifacts.
|
| ArtifactSerializer<T> |
|
| ArtifactToSerializerMapper |
Deprecated. |
| AttachContextGenerator |
Generates predictive contexts for deciding how constituents should be attached.
|
| AttributeAnnotation |
|
| Attributes |
|
| BagOfWordsFeatureGenerator |
Generates a feature for each word in a document.
|
| BaseLink |
Represents a minimal tuple of information.
|
| BaseModel |
This is a common base model which can be used by the components' specific
model classes.
|
| BaseToolFactory |
Base class for all tool factories.
|
| BasicContextGenerator |
|
| BasicFormatParams |
Common format parameters.
|
| BasicTrainingParams |
Common training parameters.
|
| BeamSearch<T> |
Performs k-best search over a sequence.
|
| BeamSearchContextGenerator<T> |
Interface for context generators used with a sequence beam search.
|
| BigramNameFeatureGenerator |
|
| BigramNameFeatureGeneratorFactory |
|
| BilouCodec |
The default SequenceCodec implementation according to the BILOU scheme.
|
| BilouNameFinderSequenceValidator |
|
| BinaryFileDataReader |
A DataReader that reads files from a binary format.
|
| BinaryGISModelReader |
|
| BinaryGISModelWriter |
|
| BinaryNaiveBayesModelReader |
|
| BinaryNaiveBayesModelWriter |
|
| BinaryPerceptronModelReader |
|
| BinaryPerceptronModelWriter |
|
| BinaryQNModelReader |
|
| BinaryQNModelWriter |
|
| BioCodec |
The default SequenceCodec implementation according to the BIO scheme:
B: 'beginning' of a NE
I: 'inside', the word is inside a NE
O: 'outside', the word is a regular word outside a NE
See also the paper by Roth D.
|
| BioNLP2004NameSampleStream |
A sample stream for the training files of the
BioNLP/NLPBA 2004 shared task.
|
| BioNLP2004NameSampleStreamFactory<P> |
|
| BratAnnotation |
|
| BratAnnotationStream |
Reads the annotations from the brat .ann annotation file.
|
| BratDocument |
Brat (brat rapid annotation tool) is based on the stav visualiser
which was originally made in order to visualise BioNLP'11 Shared Task data.
|
| BratDocumentParser |
|
| BratDocumentStream |
|
| BratNameSampleStream |
Generates Name Sample objects for a Brat Document object.
|
| BratNameSampleStreamFactory |
|
| BrownBigramFeatureGenerator |
Generates Brown cluster features for token bigrams.
|
| BrownCluster |
Class to load a Brown cluster document: word\tword_class\tprob
|
| BrownCluster.BrownClusterSerializer |
|
| BrownClusterBigramFeatureGeneratorFactory |
Generates Brown clustering features for token bigrams.
|
| BrownClusterTokenClassFeatureGeneratorFactory |
Generates Brown clustering features for token classes.
|
| BrownClusterTokenFeatureGeneratorFactory |
Generates Brown clustering features for current token.
|
| BrownTokenClasses |
Obtain the paths listed in the pathLengths array from the Brown class.
|
| BrownTokenClassFeatureGenerator |
Generates BrownCluster features for current token and token class.
|
| BrownTokenFeatureGenerator |
|
| BuildContextGenerator |
Generates predictive contexts for deciding how constituents should be combined.
|
| BuildContextGenerator |
Creates the features or contexts for the building phase of parsing.
|
| BuildModelUpdaterTool |
|
| ByteArraySerializer |
|
| Cache<K,V> |
Provides fixed size, pre-allocated, least recently used replacement cache.
|
| CachedFeatureGenerator |
|
| CachedFeatureGeneratorFactory |
|
| catalanStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| CensusDictionaryCreatorTool |
This tool helps create a loadable dictionary for the NameFinder,
from data collected from US Census data.
|
| CharacterNgramFeatureGenerator |
|
| CharacterNgramFeatureGeneratorFactory |
|
| CharSequenceNormalizer |
A char sequence normalizer, used to adjusting (prune, substitute, add, etc.)
characters in order to remove noise from text
|
| CheckContextGenerator |
Generates predictive context for deciding when a constituent is complete.
|
| CheckContextGenerator |
Generates predictive context for deciding when a constituent is complete.
|
| CheckModelUpdaterTool |
Trains a new check model.
|
| ChunkContextGenerator |
Creates predictive context for the pre-chunking phases of parsing.
|
| Chunker |
The interface for chunkers which provide chunk tags for a sequence of tokens.
|
| ChunkerContextGenerator |
|
| ChunkerConverterTool |
Tool to convert multiple data formats into native OpenNLP chunker training
format.
|
| ChunkerCrossValidator |
|
| ChunkerCrossValidatorTool |
|
| ChunkerDetailedFMeasureListener |
|
| ChunkerEvaluationMonitor |
A marker interface for evaluating chunkers.
|
| ChunkerEvaluator |
|
| ChunkerEvaluatorTool |
A default ChunkSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| ChunkerEventStream |
Class for creating an event stream out of data files for training a Chunker.
|
| ChunkerFactory |
|
| ChunkerME |
The class represents a maximum-entropy-based Chunker.
|
| ChunkerMETool |
|
| ChunkerModel |
|
| ChunkerModelLoader |
|
| ChunkerModelSerializer |
|
| ChunkerSampleStreamFactory<P> |
|
| ChunkerTrainerTool |
|
| ChunkEvaluationErrorListener |
|
| ChunkSample |
Class for holding chunks for a single unit of text.
|
| ChunkSampleSequenceStream |
|
| ChunkSampleStream |
Parses the conll 2000 shared task shallow parser training data.
|
| ChunkSampleStream |
|
| CollectionObjectStream<E> |
|
| ComparableEvent |
A maxent event representation which we can use to sort based on the
predicates indexes contained in the events.
|
| ComparablePredicate |
A maxent predicate representation which we can use to sort based on the
outcomes.
|
| ConfigurablePOSContextGenerator |
|
| Conll02NameSampleStream |
Parser for the Dutch and Spanish ner training files of the CONLL 2002 shared task.
|
| Conll02NameSampleStream.LANGUAGE |
|
| Conll02NameSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| Conll03NameSampleStream |
An import stream which can parse the CONLL03 data.
|
| Conll03NameSampleStream.LANGUAGE |
|
| Conll03NameSampleStreamFactory<P> |
|
| ConlluLemmaSampleStream |
|
| ConlluLemmaSampleStreamFactory<P> |
Note: Do not use this class, internal use only!
|
| ConlluPOSSampleStream |
|
| ConlluPOSSampleStreamFactory<P> |
Note: Do not use this class, internal use only!
|
| ConlluSentence |
|
| ConlluSentenceSampleStream |
|
| ConlluSentenceSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ConlluStream |
The CoNNL-U Format is specified
here.
|
| ConlluTagset |
|
| ConlluTokenSampleStream |
|
| ConlluTokenSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ConlluWordLine |
|
| ConllXPOSSampleStream |
Parses the data from the CONLL 06 shared task into POS Samples.
|
| ConllXPOSSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ConllXSentenceSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| ConllXTokenSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| Cons |
Holds feature information about a specific Parse node.
|
| ConstitParseSampleStream |
|
| ConstitParseSampleStreamFactory |
Note:
Do not use this class, internal use only!
|
| Constituent |
Holds constituents when reading parses.
|
| Context |
Class which associates a real valued parameter or expected value with a particular contextual
predicate or feature.
|
| ContextGenerator<T> |
Represents a generator of contexts for maxent decisions.
|
| CrossValidationPartitioner<E> |
Provides access to training and test partitions for n-fold cross validation.
|
| CrossValidationPartitioner.TrainingSampleStream<E> |
|
| CustomFeatureGenerator |
Deprecated. |
| CVParams |
Common cross validator parameters.
|
| DataIndexer |
Represents an indexer which compresses events in memory and performs feature selection.
|
| DataIndexerFactory |
|
| DataReader |
|
| DataStream |
An interface for objects which can deliver a stream of training data to be
supplied to an EventStream.
|
| DefaultChunkerContextGenerator |
Features based on chunking model described in Fei Sha and Fernando Pereira.
|
| DefaultChunkerSequenceValidator |
|
| DefaultEndOfSentenceScanner |
|
| DefaultLanguageDetectorContextGenerator |
A context generator for language detector.
|
| DefaultLemmatizerContextGenerator |
Simple feature generator for learning statistical lemmatizers.
|
| DefaultLemmatizerSequenceValidator |
|
| DefaultNameContextGenerator |
A NameContextGenerator implementation for determining contextual features
for a tag-chunk style named-entity recognizer.
|
| DefaultPOSContextGenerator |
|
| DefaultPOSSequenceValidator |
|
| DefaultSDContextGenerator |
Generate event contexts for maxent decisions for sentence detection.
|
| DefaultTokenContextGenerator |
|
| DefinitionFeatureGeneratorFactory |
|
| DetailedFMeasureEvaluatorParams |
EvaluatorParams for Chunker.
|
| DetokenEvaluationErrorListener |
|
| DetokenizationDictionary |
|
| DetokenizationDictionary.Operation |
|
| Detokenizer |
A Detokenizer merges tokens back to their detokenized representation.
|
| Detokenizer.DetokenizationOperation |
This enum contains an operation for every token to merge the
tokens together to their detokenized form.
|
| DetokenizerEvaluator |
|
| DetokenizerParameter |
|
| DetokenizerSampleStreamFactory<T,P> |
|
| DetokenizeSentenceSampleStream |
|
| Dictionary |
An iterable and serializable dictionary implementation.
|
| DictionaryBuilderTool |
|
| DictionaryDetokenizer |
A rule based detokenizer.
|
| DictionaryDetokenizerTool |
|
| DictionaryEntryPersistor |
A persistor used by for reading and writing dictionaries
of all kinds.
|
| DictionaryFeatureGenerator |
|
| DictionaryFeatureGeneratorFactory |
|
| DictionaryLemmatizer |
A Lemmatizer implementation that works by simple dictionary lookup into
a Map built from a file containing, for each line:
|
| DictionaryNameFinder |
|
| DictionarySerializer |
|
| DirectorySampleStream |
The directory sample stream allows for creating an ObjectStream
from a directory listing of files.
|
| DoccatConverterTool |
Tool to convert multiple data formats into native OpenNLP doccat training
format.
|
| DoccatCrossValidator |
|
| DoccatCrossValidatorTool |
|
| DoccatEvaluationErrorListener |
|
| DoccatEvaluationMonitor |
A marker interface for evaluating doccat.
|
| DoccatEvaluatorTool |
A default DocumentSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| DoccatFactory |
The factory that provides Doccat default implementations and resources.
|
| DoccatFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| DoccatModel |
A model for document categorization
|
| DoccatModelLoader |
|
| DoccatTool |
|
| DoccatTrainerTool |
|
| DocumentBeginFeatureGenerator |
|
| DocumentBeginFeatureGeneratorFactory |
|
| DocumentCategorizer |
Interface for classes which categorize documents.
|
| DocumentCategorizerEvaluator |
|
| DocumentCategorizerEventStream |
Iterator-like class for modeling document classification events.
|
| DocumentCategorizerME |
|
| DocumentNameFinder |
Interface for processing an entire document allowing a TokenNameFinder to use context
from the entire document.
|
| DocumentSample |
Class which holds a classified document and its category.
|
| DocumentSampleStream |
Reads in string encoded training samples, parses them and
outputs DocumentSample objects.
|
| DocumentSampleStreamFactory<P> |
|
| DocumentToLineStream |
Reads a plain text file and return each line as a String object.
|
| DownloadUtil |
This class facilitates the downloading of pretrained OpenNLP models.
|
| DownloadUtil.ModelType |
The type of model.
|
| DynamicEvalParameters |
|
| EmojiCharSequenceNormalizer |
|
| EmptyLinePreprocessorStream |
ObjectStream to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.
|
| EncodingParameter |
Encoding parameter.
|
| EndOfSentenceScanner |
|
| EntityLinker<T extends Span> |
EntityLinkers establish connections with external data to enrich extracted
entities.
|
| EntityLinkerFactory |
Generates a EntityLinker instances via a properties file configuration.
|
| EntityLinkerProperties |
|
| EntityLinkerTool |
|
| Entry |
|
| EntryInserter |
|
| EvalitaNameSampleStream |
Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.
|
| EvalitaNameSampleStream.LANGUAGE |
|
| EvalitaNameSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| EvalParameters |
This class encapsulates the variables used in producing probabilities from a model
and facilitates passing these variables to the eval method.
|
| EvaluationMonitor<T> |
|
| Evaluator<T> |
An abstract base class for evaluators.
|
| EvaluatorParams |
Common evaluation parameters.
|
| Event |
The context of a decision point during training.
|
| EventAnnotation |
|
| EventModelSequenceTrainer<T> |
A specialized Trainer that is based on a 'EventModelSequence' approach.
|
| EventTraceStream |
|
| EventTrainer |
|
| Experimental |
Indicates that a certain API feature is not stable
and might change with a new release.
|
| ExtensionLoader |
The ExtensionLoader is responsible to load extensions to the OpenNLP library.
|
| ExtensionNotLoadedException |
Exception indicates that an OpenNLP extension could not be loaded.
|
| ExtensionServiceKeys |
|
| Factory |
|
| Factory |
|
| FeatureGenerator |
Interface for generating features for document categorization.
|
| FeatureGeneratorResourceProvider |
|
| FeatureGeneratorUtil |
This class provide common utilities for feature generation.
|
| FileEventStream |
|
| FileToByteArraySampleStream |
Note:
Do not use this class, internal use only!
|
| FileToStringSampleStream |
Provides the ability to read the contents of files
contained in an object stream of files.
|
| FilterObjectStream<S,T> |
Abstract base class for filtering streams.
|
| FineGrainedEvaluatorParams |
Common evaluation parameters.
|
| FMeasure |
The FMeasure is a utility class for evaluators
which measures precision, recall and the resulting f-measure.
|
| Function |
Interface for a function.
|
| GapLabeler |
Represents a labeler for nodes which contain traces so that these traces can be predicted
by a Parser.
|
| GeneratorFactory |
Creates a set of feature generators based on a provided XML descriptor.
|
| GeneratorFactory.AbstractXmlFeatureGeneratorFactory |
|
| GenericModelReader |
|
| GenericModelSerializer |
|
| GenericModelWriter |
|
| GISModel |
A maximum entropy model which has been trained using the Generalized
Iterative Scaling (GIS) procedure.
|
| GISModelReader |
|
| GISModelWriter |
|
| GISTrainer |
An implementation of Generalized Iterative Scaling (GIS).
|
| Glove |
GloVe is an unsupervised learning algorithm for obtaining vector representations for words.
|
| greekStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| HashSumEventStream |
|
| HeadRules |
Encoder for head rules associated with parsing.
|
| HeadRules |
Class for storing the English HeadRules associated with parsing.
|
| HeadRules.HeadRulesSerializer |
|
| Index |
|
| indonesianStemmer |
This class implements the stemming algorithm defined by a snowball script.
|
| InputStreamFactory |
Allows repeated reads through a stream for certain model building types.
|
| InSpanGenerator |
Generates features if the tokens are recognized by the provided
TokenNameFinder.
|
| InsufficientTrainingDataException |
This exception indicates that the provided training data is
insufficient to train a desired model.
|
| Internal |
Classes, fields, or methods annotated @Internal are for OpenNLP
internal use only.
|
| InvalidFormatException |
This exception indicates that a resource violates the expected data format.
|
| IrishSentenceBankDocument |
A structure to hold an Irish Sentence Bank document, which is a collection
of tokenized sentences.
|
| IrishSentenceBankDocument.IrishSentenceBankFlex |
|
| IrishSentenceBankDocument.IrishSentenceBankSentence |
|
| IrishSentenceBankSentenceStreamFactory<P> |
|
| IrishSentenceBankTokenSampleStreamFactory<P> |
|
| irishStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| Language |
Class for holding the document language and its confidence
|
| LanguageDetector |
|
| LanguageDetectorConfig |
|
| LanguageDetectorContextGenerator |
|
| LanguageDetectorConverterTool |
Tool to convert multiple data formats into native OpenNLP language detection
training format.
|
| LanguageDetectorCrossValidator |
|
| LanguageDetectorCrossValidatorTool |
|
| LanguageDetectorEvaluationErrorListener |
|
| LanguageDetectorEvaluationMonitor |
|
| LanguageDetectorEvaluator |
|
| LanguageDetectorEvaluatorTool |
A default LanguageSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| LanguageDetectorEventStream |
Iterator-like class for modeling an event stream of samples.
|
| LanguageDetectorFactory |
|
| LanguageDetectorFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| LanguageDetectorME |
|
| LanguageDetectorModel |
|
| LanguageDetectorModelLoader |
|
| LanguageDetectorSampleStream |
|
| LanguageDetectorSampleStreamFactory<P> |
|
| LanguageDetectorTool |
|
| LanguageDetectorTrainerTool |
|
| LanguageModel |
A language model can calculate the probability p (between 0 and 1) of a
certain sequence of tokens, given its underlying vocabulary.
|
| LanguageParams |
|
| LanguageSample |
Holds a classified document and its Language.
|
| LanguageSampleStreamFactory<T,P> |
Stream factory for those streams which carry language.
|
| LeipzigLanguageSampleStream |
|
| LeipzigLanguageSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| LemmaEvaluationErrorListener |
|
| LemmaSample |
Represents a lemmatized sentence.
|
| LemmaSampleEventStream |
Class for creating an event stream out of data files for training a probabilistic Lemmatizer.
|
| LemmaSampleSequenceStream |
|
| LemmaSampleStream |
Reads data for training and testing the Lemmatizer.
|
| Lemmatizer |
The common interface for lemmatizers.
|
| LemmatizerContextGenerator |
Interface for the context generator used for probabilistic Lemmatizer.
|
| LemmatizerEvaluationMonitor |
|
| LemmatizerEvaluator |
|
| LemmatizerEvaluatorTool |
A default LemmaSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| LemmatizerFactory |
The factory that provides Lemmatizer default implementation and
resources.
|
| LemmatizerFineGrainedReportListener |
Generates a detailed report for the Lemmatizer.
|
| LemmatizerME |
|
| LemmatizerMETool |
|
| LemmatizerModel |
|
| LemmatizerModelLoader |
|
| LemmatizerSampleStreamFactory<P> |
|
| LemmatizerTrainerTool |
|
| LetsmtDocument |
A structure to hold the letsmt document.
|
| LetsmtDocument.LetsmtDocumentHandler |
|
| LetsmtDocument.LetsmtSentence |
|
| LetsmtSentenceStreamFactory<P> |
|
| LineSearch |
Class that performs line search to find minimum.
|
| LineSearch.LineSearchResult |
Represents a LineSearch result.
|
| LinkedSpan<T extends BaseLink> |
A default, extended Span that holds additional information about a Span.
|
| LogPrintStream |
This class serves as an adapter for a Logger used within a PrintStream.
|
| LogProbabilities<T> |
Class implementing the probability distribution over labels returned by
a classifier as a log of probabilities.
|
| LogProbability<T> |
A class implementing the logarithmic Probability for a label.
|
| MarkableFileInputStreamFactory |
A factory that creates MarkableFileInputStream from a File
|
| MascDocument |
|
| MascDocumentStream |
|
| MascNamedEntityParser |
A class to process the MASC Named entity stand-off annotation file
|
| MascNamedEntitySampleStream |
|
| MascNamedEntitySampleStreamFactory<P> |
|
| MascPennTagParser |
A class for parsing MASC's Penn tagging/tokenization stand-off annotation
|
| MascPOSSampleStream |
|
| MascPOSSampleStreamFactory<P> |
|
| MascSentence |
|
| MascSentenceSampleStream |
|
| MascSentenceSampleStreamFactory<P> |
|
| MascToken |
|
| MascTokenSampleStream |
|
| MascTokenSampleStreamFactory<P> |
|
| MascWord |
|
| MaxentModel |
Interface for maximum entropy models.
|
| Mean |
Calculates the arithmetic mean of values
added with the Mean.add(double) method.
|
| ModelParameterChunker |
A helper class that handles Strings with more than 64k (65535 bytes) in length.
|
| ModelType |
Enumeration of supported model types.
|
| ModelUtil |
Utility class for handling of models.
|
| MosesSentenceSampleStream |
|
| MosesSentenceSampleStreamFactory<P> |
|
| Muc6NameSampleStreamFactory |
|
| MucNameContentHandler |
|
| MucNameSampleStream |
|
| MutableContext |
An extension of Context used to store parameters or expected values
associated with this context which can be updated or assigned.
|
| MutableInt |
This is a non-thread safe mutable int.
|
| MutableTagDictionary |
Interface that allows TagDictionary entries to be added and removed.
|
| NaiveBayesEvalParameters |
Specialized parameters for the evaluation of a naive bayes classifier
|
| NaiveBayesModel |
A MaxentModel implementation of the multinomial Naive Bayes classifier model.
|
| NaiveBayesModelReader |
The base class for readers of models.
|
| NaiveBayesModelWriter |
|
| NaiveBayesTrainer |
Trains models using the combination of EM algorithm
and Naive Bayes classifier which is described in:
|
| NameContextGenerator |
Interface for generating the context for a name finder by
specifying a set of feature generators.
|
| NameEvaluationErrorListener |
|
| NameFinderCensus90NameStream |
This class helps to read the US Census data from the files to build a
StringList for each dictionary entry in the name-finder dictionary.
|
| NameFinderEventStream |
Class for creating an event stream out of data files for training an TokenNameFinder.
|
| NameFinderME |
|
| NameFinderSequenceValidator |
|
| NameSample |
Encapsulates names for a single unit of text.
|
| NameSampleCountersStream |
Counts tokens, sentences and names by type.
|
| NameSampleDataStream |
|
| NameSampleDataStreamFactory<P> |
|
| NameSampleDataStreamFactory.Parameters |
|
| NameSampleSequenceStream |
|
| NameSampleTypeFilter |
|
| NameToSentenceSampleStream |
Note:
Do not use this class, internal use only!
|
| NameToSentenceSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| NameToTokenSampleStream |
Note:
Do not use this class, internal use only!
|
| NameToTokenSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| NegLogLikelihood |
Evaluate negative log-likelihood and its gradient from DataIndexer.
|
| NewlineSentenceDetector |
The Newline SentenceDetector assumes that sentences are line delimited and
recognizes one sentence per non-empty line.
|
| NGramCharModel |
|
| NGramFeatureGenerator |
Generates ngram features for a document.
|
| NGramGenerator |
Generates an nGram, via an optional separator, and returns the grams as a list
of strings
|
| NGramLanguageModel |
|
| NGramLanguageModelTool |
|
| NGramModel |
The NGramModel can be used to crate ngrams and character ngrams.
|
| NGramUtils |
Utility class for ngrams.
|
| NKJPSegmentationDocument |
|
| NKJPSegmentationDocument.Pointer |
|
| NKJPSentenceSampleStream |
|
| NKJPSentenceSampleStreamFactory<P> |
|
| NKJPTextDocument |
The National corpus of Polish (NKJP) format.
|
| NumberCharSequenceNormalizer |
|
| ObjectDataReader |
|
| ObjectStream<T> |
|
| ObjectStreamUtils |
|
| OnePassDataIndexer |
A DataIndexer for maxent model data which handles cutoffs for uncommon
contextual predicates and provides a unique integer index for each of the
predicates.
|
| OnePassRealValueDataIndexer |
A DataIndexer for maxent model data which handles cutoffs for uncommon
contextual predicates and provides a unique integer index for each of the
predicates and maintains event values.
|
| OntoNotesFormatParameters |
|
| OntoNotesNameSampleStream |
Name Sample Stream parser for the OntoNotes 4.0 corpus.
|
| OntoNotesNameSampleStreamFactory |
|
| OntoNotesParseSampleStream |
|
| OntoNotesParseSampleStreamFactory |
|
| OntoNotesPOSSampleStreamFactory |
|
| OutcomePriorFeatureGenerator |
The definition feature maps the underlying distribution of outcomes.
|
| ParagraphStream |
|
| ParallelNegLogLikelihood |
Evaluate negative log-likelihood and its gradient in parallel
|
| Parse |
Data structure for holding parse constituents.
|
| Parser |
A shift reduce style Parser implementation
based on Adwait Ratnaparkhi's 1998 thesis.
|
| Parser |
Defines common methods for full-syntactic parsers.
|
| Parser |
A built-attach Parser implementation.
|
| ParserChunkerFactory |
|
| ParserChunkerSequenceValidator |
|
| ParserConverterTool |
Tool to convert multiple data formats into native OpenNLP parser
format.
|
| ParserCrossValidator |
|
| ParserEvaluationMonitor |
A marker interface for evaluating parsers.
|
| ParserEvaluator |
This implementation of Evaluator behaves like EVALB with no exceptions,
e.g, without removing punctuation tags, or equality between ADVP and PRT, as
in
COLLINS convention.
|
| ParserEvaluatorTool |
A default Parse-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| ParserEventStream |
|
| ParserEventStream |
|
| ParserEventTypeEnum |
Enumeration of event types for a Parser.
|
| ParserFactory |
|
| ParserModel |
|
| ParserModelLoader |
|
| ParserTool |
|
| ParserTrainerTool |
|
| ParserType |
Enumeration of supported Parser types.
|
| ParseSampleStream |
|
| ParseSampleStreamFactory<P> |
|
| ParseSampleStreamFactory.Parameters |
|
| ParseToPOSSampleStream |
Note:
Do not use this class, internal use only!
|
| ParseToPOSSampleStreamFactory |
Note:
Do not use this class, internal use only!
|
| ParseToSentenceSampleStreamFactory |
Note:
Do not use this class, internal use only!
|
| ParseToTokenSampleStreamFactory |
Note:
Do not use this class, internal use only!
|
| PerceptronModel |
A model implementation based one the perceptron algorithm.
|
| PerceptronModelReader |
The base class for readers of models.
|
| PerceptronModelWriter |
|
| PerceptronTrainer |
Trains models using the perceptron algorithm.
|
| PlainTextByLineStream |
Reads a plain text file and returns each line as a String object.
|
| PlainTextFileDataReader |
A generic DataReader implementation for plain text files.
|
| PlainTextNaiveBayesModelReader |
|
| PlainTextNaiveBayesModelWriter |
|
| PorterStemmer |
|
| PortugueseContractionUtility |
Utility class to handle Portuguese contractions.
|
| POSContextGenerator |
|
| POSDictionary |
Provides a means of determining which tags are valid for a particular word
based on a TagDictionary read from a file.
|
| POSEvaluationErrorListener |
|
| POSEvaluator |
|
| POSModel |
|
| POSModelLoader |
Loads a POSModel for the command line tools.
|
| POSModelSerializer |
|
| POSSample |
|
| POSSampleEventStream |
Reads the samples from an Iterator
and converts those samples into events which
can be used by the maxent library for training.
|
| POSSampleSequenceStream |
|
| PosSampleStream |
|
| POSTagger |
The interface for part of speech taggers.
|
| POSTaggerConverterTool |
Tool to convert multiple data formats into native OpenNLP part of speech tagging
training format.
|
| POSTaggerCrossValidator |
|
| POSTaggerCrossValidatorTool |
|
| POSTaggerEvaluationMonitor |
|
| POSTaggerEvaluatorTool |
A default POSSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| POSTaggerFactory |
The factory that provides POSTagger default implementations and resources.
|
| POSTaggerFactory.POSDictionarySerializer |
|
| PosTaggerFeatureGenerator |
|
| PosTaggerFeatureGeneratorFactory |
|
| POSTaggerFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| POSTaggerME |
|
| POSTaggerNameFeatureGenerator |
Adds the token POS Tag as feature.
|
| POSTaggerNameFeatureGeneratorFactory |
|
| POSTaggerTool |
|
| POSTaggerTrainerTool |
|
| POSToSentenceSampleStream |
Note:
Do not use this class, internal use only!
|
| POSToSentenceSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| POSToTokenSampleStream |
Note:
Do not use this class, internal use only!
|
| POSToTokenSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| PrefixFeatureGenerator |
|
| PrefixFeatureGeneratorFactory |
|
| PreviousMapFeatureGenerator |
|
| PreviousMapFeatureGeneratorFactory |
|
| PreviousTwoMapFeatureGenerator |
|
| Prior |
This interface allows one to implement a prior distribution for use in
maximum entropy model training.
|
| Probabilities<T> |
Class implementing the probability distribution over labels returned by a classifier.
|
| Probability<T> |
Class implementing the probability for a label.
|
| ProbingLanguageDetectionResult |
A data container encapsulating language detection results.
|
| QNMinimizer |
Implementation of L-BFGS which supports L1-, L2-regularization
and Elastic Net for solving convex optimization problems.
|
| QNMinimizer.Evaluator |
Evaluate quality of training parameters.
|
| QNMinimizer.L2RegFunction |
|
| QNModel |
A maximum entropy model which has been trained using the Quasi Newton (QN) algorithm.
|
| QNModelReader |
|
| QNModelWriter |
The base class for writers of models.
|
| QNTrainer |
A Maxent model Trainer using L-BFGS algorithm.
|
| RealBasicEventStream |
|
| RealValueFileEventStream |
|
| RegexNameFinder |
|
| RegexNameFinderFactory |
Returns a RegexNameFinder based on a selection of
defaults or a configuration and a selection of defaults.
|
| RegexNameFinderFactory.DEFAULT_REGEX_NAME_FINDER |
Enumeration of typical regex expressions available in OpenNLP.
|
| RegexNameFinderFactory.RegexAble |
|
| RelationAnnotation |
|
| ResetableIterator<E> |
This interface makes an Iterator resettable.
|
| ReverseListIterator<T> |
An iterator for a list which returns values in the opposite order as the typical list iterator.
|
| Sample |
Represents a generic type of processable elements.
|
| SDContextGenerator |
|
| SDCrossValidator |
|
| SDEventStream |
|
| SegmenterObjectStream<S,T> |
|
| SentenceContextGenerator |
Creates contexts/features for end-of-sentence detection in Thai text.
|
| SentenceDetector |
The interface for sentence detectors, which find the sentence boundaries in
a text.
|
| SentenceDetectorConverterTool |
Tool to convert multiple data formats into native OpenNLP sentence detector
training format.
|
| SentenceDetectorCrossValidatorTool |
|
| SentenceDetectorEvaluationMonitor |
|
| SentenceDetectorEvaluator |
|
| SentenceDetectorEvaluatorTool |
A default SentenceSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| SentenceDetectorFactory |
The factory that provides SentenceDetector default implementations and
resources
|
| SentenceDetectorME |
A sentence detector for splitting up raw text into sentences.
|
| SentenceDetectorTool |
A sentence detector which uses a maxent model to predict the sentences.
|
| SentenceDetectorTrainerTool |
|
| SentenceEvaluationErrorListener |
|
| SentenceFeatureGenerator |
This feature generator creates sentence begin and end features.
|
| SentenceFeatureGeneratorFactory |
|
| SentenceModel |
|
| SentenceSample |
A SentenceSample contains a document with
begin indexes of the individual sentences.
|
| SentenceSampleStream |
This class is a stream filter which reads a sentence by line samples from
an ObjectStream and converts them into SentenceSample objects.
|
| SentenceSampleStreamFactory<P> |
|
| Sequence<T> |
Class which models a sequence.
|
| Sequence |
Represents a weighted sequence of outcomes.
|
| SequenceClassificationModel<T> |
A classification model that can label an input Sequence.
|
| SequenceCodec<T> |
|
| SequenceStream<S> |
Interface for streams of sequences used to train sequence models.
|
| SequenceStreamEventStream |
|
| SequenceTrainer |
|
| SequenceValidator<T> |
|
| SerializableArtifact |
A marker interface so that implementing classes can refer to
the corresponding ArtifactSerializer implementation.
|
| SgmlParser |
SAX style SGML parser.
|
| SgmlParser.ContentHandler |
|
| ShrinkCharSequenceNormalizer |
|
| SimplePerceptronSequenceTrainer |
Trains models with sequences using the perceptron algorithm.
|
| SimpleTokenizer |
A basic Tokenizer implementation which performs tokenization
using character classes.
|
| SimpleTokenizerTool |
|
| SnowballStemmer |
|
| SnowballStemmer.ALGORITHM |
|
| Span |
Class for storing start and end integer offsets.
|
| SpanAnnotation |
|
| Stemmer |
The stemmer is reducing a word to its stem.
|
| StringList |
|
| StringPattern |
Recognizes predefined patterns in strings.
|
| StringUtil |
|
| SuffixFeatureGenerator |
|
| SuffixFeatureGeneratorFactory |
|
| TagDictionary |
Interface to determine which tags are valid for a particular word
based on a tag dictionary.
|
| TaggerModelReplacerTool |
|
| ThreadSafe |
Classes, fields, or methods annotated @ThreadSafe are safe to use
in multithreading contexts.
|
| TokenClassFeatureGenerator |
Generates features for different for the class of the token.
|
| TokenClassFeatureGeneratorFactory |
|
| TokenContextGenerator |
Interface for context generators required for TokenizerME.
|
| TokenEvaluationErrorListener |
|
| TokenFeatureGenerator |
Generates a feature which contains the token itself.
|
| TokenFeatureGeneratorFactory |
|
| Tokenizer |
The interface for tokenizers, which segment a string into its tokens.
|
| TokenizerConverterTool |
Tool to convert multiple data formats into native OpenNLP sentence detector
training format.
|
| TokenizerCrossValidator |
|
| TokenizerCrossValidatorTool |
|
| TokenizerEvaluationMonitor |
|
| TokenizerEvaluator |
|
| TokenizerFactory |
The factory that provides Tokenizer default implementation and
resources.
|
| TokenizerME |
A Tokenizer for converting raw text into separated tokens.
|
| TokenizerMEEvaluatorTool |
A default TokenSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| TokenizerMETool |
|
| TokenizerModel |
|
| TokenizerModelLoader |
|
| TokenizerStream |
|
| TokenizerTrainerTool |
|
| TokenNameFinder |
The interface for name finders which provide name tags for a sequence of tokens.
|
| TokenNameFinderConverterTool |
Tool to convert multiple data formats into native OpenNLP name finder
training format.
|
| TokenNameFinderCrossValidator |
|
| TokenNameFinderCrossValidatorTool |
|
| TokenNameFinderDetailedFMeasureListener |
|
| TokenNameFinderEvaluationMonitor |
|
| TokenNameFinderEvaluator |
|
| TokenNameFinderEvaluatorTool |
A default NameSample-centric implementation of AbstractEvaluatorTool
that prints to an output stream.
|
| TokenNameFinderFactory |
The factory that provides TokenNameFinder default implementations and
resources.
|
| TokenNameFinderFineGrainedReportListener |
Generates a detailed report for the NameFinder.
|
| TokenNameFinderModel |
|
| TokenNameFinderModel.FeatureGeneratorCreationError |
|
| TokenNameFinderModelLoader |
|
| TokenNameFinderTool |
|
| TokenNameFinderTrainerTool |
|
| TokenPatternFeatureGenerator |
Partitions tokens into sub-tokens based on character classes and generates
class features for each of the sub-tokens and combinations of those sub-tokens.
|
| TokenPatternFeatureGeneratorFactory |
|
| TokenSample |
|
| TokenSampleStream |
Class which produces an Iterator<TokenSample> from a file of space delimited token.
|
| TokenSampleStream |
This class is a stream filter which reads in string encoded
samples and creates samples out of them.
|
| TokenSampleStreamFactory<P> |
|
| TokenTag |
|
| TokSpanEventStream |
This class reads the samples via an Iterator
and converts the samples into events which
can be used by the maxent library for training.
|
| Trainer |
Represents a common base for training implementations.
|
| TrainerFactory |
|
| TrainerFactory.TrainerType |
|
| TrainingParameters |
Declares and handles default parameters used for or during training models.
|
| TrainingToolParams |
Common training parameters.
|
| TrigramNameFeatureGenerator |
Adds trigram features based on tokens and token classes.
|
| TrigramNameFeatureGeneratorFactory |
|
| TwentyNewsgroupSampleStream |
|
| TwentyNewsgroupSampleStreamFactory<P> |
|
| TwitterCharSequenceNormalizer |
|
| TwoPassDataIndexer |
Collecting event and context counts by making two passes over the events.
|
| UncloseableInputStream |
|
| UniformPrior |
Provide a maximum entropy model with a uniform Prior.
|
| UrlCharSequenceNormalizer |
|
| Version |
The Version class represents the OpenNLP Tools library version.
|
| WhitespaceTokenizer |
A basic Tokenizer implementation which performs tokenization
using white spaces.
|
| WhitespaceTokenStream |
|
| WindowFeatureGenerator |
|
| WindowFeatureGeneratorFactory |
|
| WordClusterDictionary |
|
| WordClusterDictionary.WordClusterDictionarySerializer |
|
| WordClusterFeatureGenerator |
|
| WordClusterFeatureGeneratorFactory |
Defines a word cluster generator factory; it reads an element containing
'w2vwordcluster' as a tag name; these clusters are typically produced by
word2vec or clark pos induction systems.
|
| WordpieceTokenizer |
A Tokenizer implementation which performs tokenization
using word pieces.
|
| WordTagSampleStream |
A stream filter which reads a sentence per line which contains
words and tags in word_tag format and outputs a POSSample objects.
|
| WordTagSampleStreamFactory<P> |
Note:
Do not use this class, internal use only!
|
| WordTagSampleStreamFactory.Parameters |
|
| WordVector |
A word vector.
|
| WordVectorTable |
A table that maps tokens to word vectors.
|
| WordVectorType |
|
| XmlUtil |
|