All Classes and Interfaces
Class
Description
A base
ObjectStream implementation.An interface for generating features for name entity identification and for
updating document level contexts.
Parser for command line arguments.
Utility class for simple vector arithmetic.
Provides access to model persisted artifacts.
Responsible to create an artifact from an
InputStream.Represents a minimal tuple of information.
Common format parameters.
Common training parameters.
Interface for context generators used with a sequence beam search.
Provides fixed size, pre-allocated, least recently used replacement cache.
A char sequence normalizer, used to adjusting (prune, substitute, add, etc.)
characters in order to remove noise from text
The interface for chunkers which provide chunk tags for a sequence of tokens.
Interface for a
BeamSearchContextGenerator used in syntactic chunking.A marker interface for evaluating
chunkers.Class for holding chunks for a single unit of text.
Encapsulates a classpath entry that is associated with a
model URI
and optional properties.Describes a scanner which detects OpenNLP specific model files in an applications's classpath.
Holds constituents when reading
parses.Class which associates a real valued parameter or expected value with a particular contextual
predicate or feature.
Common cross validator parameters.
Represents an indexer which compresses events in memory and performs feature selection.
Describes generic ways to read data from a
DataInputStream.A
Detokenizer merges tokens back to their detokenized representation.This enum contains an operation for every token to merge the
tokens together to their detokenized form.
A marker interface for evaluating
doccat.Interface for classes which categorize documents.
Interface for processing an entire document allowing a
TokenNameFinder to use context
from the entire document.Class which holds a classified document and its category.
Encoding parameter.
EntityLinkers establish connections with external data to enrich extracted
entities.
Properties wrapper for
EntityLinker implementations.Common evaluation parameters.
The context of a decision point during training.
A specialized
Trainer that is based on a 'EventModelSequence' approach.Indicates that a certain API feature is not stable
and might change with a new release.
The
ExtensionLoader is responsible to load extensions to the OpenNLP library.Exception indicates that an OpenNLP extension could not be loaded.
Interface for generating features for document categorization.
The
FeatureGeneratorResourceProvider provides access to the resources
available in the model.Common evaluation parameters.
The
FMeasure is a utility class for evaluators
which measures precision, recall and the resulting f-measure.Represents a labeler for nodes which contain traces so that these traces can be predicted
by a
Parser.Encoder for head rules associated with parsing.
Allows repeated reads through a stream for certain model building types.
This exception indicates that the provided training data is
insufficient to train a desired model.
Classes, fields, or methods annotated
@Internal are for OpenNLP
internal use only.This exception indicates that a resource violates the expected data format.
Class for holding the document language and its confidence
Validates language codes against ISO 639 standards.
The interface for
LanguageDetector which predicts the Language for a context.A context generator interface for
LanguageDetector.A marker interface for evaluating
language detectors.A language model can calculate the probability p (between 0 and 1) of a
certain sequence of tokens, given its underlying vocabulary.
Holds a classified document and its
Language.Represents a lemmatized sentence.
The common interface for lemmatizers.
Interface for the context generator used for probabilistic
Lemmatizer.A marker interface for evaluating
lemmatizers.Interface for maximum entropy models.
Calculates the arithmetic mean of values
added with the
Mean.add(double) method.A model type to pattern enumeration.
Enumeration of supported model types.
Interface that allows
TagDictionary entries to be added and removed.Interface for generating the context for a
name finder by
specifying a set of feature generators.Encapsulates names for a single unit of text.
Reads
objects from a stream.Data structure for holding parse constituents.
Defines common methods for full-syntactic parsers.
A marker interface for evaluating
parsers.Enumeration of event types for a
Parser.Enumeration of supported
Parser types.Reads a plain text file and returns each line as a
String object.Interface for a
BeamSearchContextGenerator used in POS tagging.Represents an pos-tagged
sentence.The interface for part of speech taggers.
A marker interface for evaluating
pos taggers.This interface allows one to implement a prior distribution for use in
maximum entropy model training.
A marker interface for classes with probabilistic capabilities.
This interface makes an
Iterator resettable.An iterator for a list which returns values in the opposite order as the typical list iterator.
Represents a generic type of processable elements.
Interface for
SentenceDetector context generators.The interface for sentence detectors, which find the sentence boundaries in
a text.
A
SentenceSample contains a document with
begin indexes of the individual sentences.An sentiment specific
EvaluationMonitor to be used by the evaluator.Class for holding text used for sentiment analysis.
Class which models a sequence.
Represents a weighted sequence of outcomes.
A classification model that can label an input
Sequence.A codec for sequences of type
T.Interface for streams of
sequences used to train sequence models.A marker interface so that implementing classes can refer to
the corresponding
ArtifactSerializer implementation.Class for storing start and end integer offsets.
The stemmer is reducing a word to its stem.
Stop criteria for model training.
A marker-interface for a String interner implementation.
Interface to determine which tags are valid for a particular word
based on a tag dictionary.
Exception to terminate the execution of a command line tool.
Classes, fields, or methods annotated
@ThreadSafe are safe to use
in multithreading contexts.Interface for context generators required for
tokenizer implementations.The interface for tokenizers, which segment a string into its tokens.
A marker interface for evaluating
tokenizers.The interface for name finders which provide name tags for a sequence of tokens.
A marker interface for evaluating
name finders.A
TokenSample is text with token spans.Represents a common base for training implementations.
Configuration used for model training.
Enumeration of Training measures.
An interface to capture training progress of a model.
Common training parameters.
A basic
Tokenizer implementation which performs tokenization
using white spaces.A
Tokenizer implementation which performs tokenization
using word pieces.A word vector.
A table that maps tokens to word vectors.