All Classes (Apache OpenNLP Tools 2.0.0 API)

All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
Class	Description
AbstractBottomUpParser	Abstract class which contains code to tag and chunk parses for bottom up parsing and leaves implementation of advancing parses and completing parses to extend class.
AbstractContextGenerator	Abstract class containing many of the methods used to generate contexts for parsing.
AbstractDataIndexer	Abstract class for collecting event and context counts used in training.
AbstractEventModelSequenceTrainer
AbstractEventStream<T>
AbstractEventTrainer
AbstractModel
AbstractModel.ModelType
AbstractModelReader
AbstractModelWriter
AbstractObjectStream<T>
AbstractParserEventStream	Abstract class extended by parser event streams which perform tagging and chunking.
AbstractSampleStreamFactory<T>	Base class for sample stream factories.
AbstractSequenceTrainer
AbstractToSentenceSampleStream<T>
AbstractTrainer
AdaptiveFeatureGenerator	An interface for generating features for name entity identification and for updating document level contexts.
ADChunkSampleStream	Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese Chunker training.
ADChunkSampleStreamFactory	A Factory to create a Arvores Deitadas ChunkStream from the command line utility.
AdditionalContextFeatureGenerator	The `AdditionalContextFeatureGenerator` generates the context from the passed in additional context.
ADNameSampleStream	Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese NER training.
ADNameSampleStreamFactory	A Factory to create a Arvores Deitadas NameSampleDataStream from the command line utility.
ADPOSSampleStream	Note: Do not use this class, internal use only!
ADPOSSampleStreamFactory	Note: Do not use this class, internal use only!
ADSentenceSampleStream	Note: Do not use this class, internal use only!
ADSentenceSampleStreamFactory	Note: Do not use this class, internal use only!
ADSentenceStream	Stream filter which merges text lines into sentences, following the Arvores Deitadas syntax.
ADSentenceStream.Sentence
ADSentenceStream.SentenceParser	Parses a sample of AD corpus.
ADTokenSampleStreamFactory	Note: Do not use this class, internal use only!
AggregateCharSequenceNormalizer
AggregatedFeatureGenerator	The `AggregatedFeatureGenerator` aggregates a set of `AdaptiveFeatureGenerator`s and calls them to generate the features.
AggregatedFeatureGeneratorFactory
AncoraSpanishHeadRules	Class for storing the Ancora Spanish head rules associated with parsing.
AncoraSpanishHeadRules.HeadRulesSerializer
AnnotationConfiguration
AnnotatorNoteAnnotation
arabicStemmer	This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
ArrayMath	Utility class for simple vector arithmetic.
ArrayMath	Deprecated.
ArtifactProvider	Provides access to model persisted artifacts.
ArtifactSerializer<T>	Responsible to create an artifact from an `InputStream`.
ArtifactToSerializerMapper	Deprecated.
AttachContextGenerator
AttributeAnnotation
Attributes	The `Attributes` class stores name value pairs.
BagOfWordsFeatureGenerator	Generates a feature for each word in a document.
BaseLink	Stores a minimal tuple of information.
BaseModel	This model is a common based which can be used by the components model classes.
BaseToolFactory	Base class for all tool factories.
BasicContextGenerator	Generate contexts for maxent decisions, assuming that the input given to the getContext() method is a String containing contextual predicates separated by spaces.
BasicFormatParams	Common format parameters.
BasicTrainingParams	Common training parameters.
BeamSearch<T>	Performs k-best search over sequence.
BeamSearchContextGenerator<T>	Interface for context generators used with a sequence beam search.
BigramNameFeatureGenerator
BigramNameFeatureGeneratorFactory
BilouCodec
BilouNameFinderSequenceValidator
BinaryFileDataReader
BinaryGISModelReader	A reader for GIS models stored in binary format.
BinaryGISModelWriter	Model writer that saves models in binary format.
BinaryNaiveBayesModelReader
BinaryNaiveBayesModelWriter	Model writer that saves models in binary format.
BinaryPerceptronModelReader
BinaryPerceptronModelWriter	Model writer that saves models in binary format.
BinaryQNModelReader	A reader for quasi-newton models stored in binary format.
BinaryQNModelWriter
BioCodec
BioNLP2004NameSampleStream	Parser for the training files of the BioNLP/NLPBA 2004 shared task.
BioNLP2004NameSampleStreamFactory
BratAnnotation
BratAnnotationStream	Reads the annotations from the brat .ann annotation file.
BratDocument
BratDocumentParser
BratDocumentStream
BratNameSampleStream	Generates Name Sample objects for a Brat Document object.
BratNameSampleStreamFactory
BrownBigramFeatureGenerator	Generates Brown cluster features for token bigrams.
BrownCluster	Class to load a Brown cluster document: word\tword_class\tprob http://metaoptimize.com/projects/wordreprs/ The file containing the clustering lexicon has to be passed as the value of the dict attribute of each BrownCluster feature generator.
BrownCluster.BrownClusterSerializer
BrownClusterBigramFeatureGeneratorFactory	Generates Brown clustering features for token bigrams.
BrownClusterTokenClassFeatureGeneratorFactory	Generates Brown clustering features for token classes.
BrownClusterTokenFeatureGeneratorFactory	Generates Brown clustering features for current token.
BrownTokenClasses	Obtain the paths listed in the pathLengths array from the Brown class.
BrownTokenClassFeatureGenerator	Generates Brown cluster features for current token and token class.
BrownTokenFeatureGenerator	Generates Brown cluster features for current token.
BuildContextGenerator	Class to generator predictive contexts for deciding how constituents should be combined together.
BuildContextGenerator	Creates the features or contexts for the building phase of parsing.
BuildModelUpdaterTool
ByteArraySerializer
Cache<K,V>	Provides fixed size, pre-allocated, least recently used replacement cache.
CachedFeatureGenerator	Caches features of the aggregated `AdaptiveFeatureGenerator`s.
CachedFeatureGeneratorFactory
catalanStemmer	This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
CensusDictionaryCreatorTool	This tool helps create a loadable dictionary for the `NameFinder`, from data collected from US Census data.
CharacterNgramFeatureGenerator	The `CharacterNgramFeatureGenerator` uses character ngrams to generate features about each token.
CharacterNgramFeatureGeneratorFactory
CharSequenceNormalizer	A char sequence normalizer, used to adjusting (prune, substitute, add, etc.) characters in order to remove noise from text
CheckContextGenerator	Class for generating predictive context for deciding when a constituent is complete.
CheckContextGenerator
CheckModelUpdaterTool
ChunkContextGenerator	Creates predivtive context for the pre-chunking phases of parsing.
Chunker	The interface for chunkers which provide chunk tags for a sequence of tokens.
ChunkerContextGenerator	Interface for the context generator used in syntactic chunking.
ChunkerConverterTool	Tool to convert multiple data formats into native OpenNLP chunker training format.
ChunkerCrossValidator
ChunkerCrossValidatorTool
ChunkerDetailedFMeasureListener
ChunkerEvaluationMonitor
ChunkerEvaluator	The `ChunkerEvaluator` measures the performance of the given `Chunker` with the provided reference `ChunkSample`s.
ChunkerEvaluatorTool
ChunkerEventStream	Class for creating an event stream out of data files for training a chunker.
ChunkerFactory
ChunkerME	The class represents a maximum-entropy-based chunker.
ChunkerMETool
ChunkerModel	The `ChunkerModel` is the model used by a learnable `Chunker`.
ChunkerModelLoader	Loads a Chunker Model for the command line tools.
ChunkerModelSerializer
ChunkerSampleStreamFactory	Factory producing OpenNLP `ChunkSampleStream`s.
ChunkerTrainerTool
ChunkEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
ChunkSample	Class for holding chunks for a single unit of text.
ChunkSampleSequenceStream
ChunkSampleStream	Parses the conll 2000 shared task shallow parser training data.
ChunkSampleStream
CollectionObjectStream<E>
ComparableEvent	A maxent event representation which we can use to sort based on the predicates indexes contained in the events.
ComparablePredicate	A maxent predicate representation which we can use to sort based on the outcomes.
ConfigurablePOSContextGenerator	A context generator for the POS Tagger.
Conll02NameSampleStream	Parser for the dutch and spanish ner training files of the CONLL 2002 shared task.
Conll02NameSampleStream.LANGUAGE
Conll02NameSampleStreamFactory	Note: Do not use this class, internal use only!
Conll03NameSampleStream	An import stream which can parse the CONLL03 data.
Conll03NameSampleStream.LANGUAGE
Conll03NameSampleStreamFactory
ConlluLemmaSampleStream
ConlluLemmaSampleStreamFactory	Note: Do not use this class, internal use only!
ConlluPOSSampleStream
ConlluPOSSampleStreamFactory	Note: Do not use this class, internal use only!
ConlluSentence
ConlluSentenceSampleStream
ConlluSentenceSampleStreamFactory
ConlluStream	The CoNNL-U Format is specified here: http://universaldependencies.org/format.html
ConlluTagset
ConlluTokenSampleStream
ConlluTokenSampleStreamFactory
ConlluWordLine
ConllXPOSSampleStream	Parses the data from the CONLL 06 shared task into POS Samples.
ConllXPOSSampleStreamFactory	Note: Do not use this class, internal use only!
ConllXSentenceSampleStreamFactory	Note: Do not use this class, internal use only!
ConllXTokenSampleStreamFactory	Note: Do not use this class, internal use only!
Cons	Class to hold feature information about a specific parse node.
ConstitParseSampleStream
ConstitParseSampleStreamFactory
Constituent	Class used to hold constituents when reading parses.
Context	Class which associates a real valued parameter or expected value with a particular contextual predicate or feature.
ContextGenerator<T>	Generate contexts for maxent decisions.
CrossValidationPartitioner<E>	Provides access to training and test partitions for n-fold cross validation.
CrossValidationPartitioner.TrainingSampleStream<E>	The `TrainingSampleStream` which iterates over all training elements.
CustomFeatureGenerator	Deprecated.
CVParams	Common cross validator parameters.
DataIndexer	Object which compresses events in memory and performs feature selection.
DataIndexerFactory
DataReader
DataStream	A interface for objects which can deliver a stream of training data to be supplied to an EventStream.
DefaultChunkerContextGenerator	Features based on chunking model described in Fei Sha and Fernando Pereira.
DefaultChunkerSequenceValidator
DefaultEndOfSentenceScanner	Default implementation of the `EndOfSentenceScanner`.
DefaultLanguageDetectorContextGenerator	A context generator for language detector.
DefaultLemmatizerContextGenerator	Simple feature generator for learning statistical lemmatizers.
DefaultLemmatizerSequenceValidator
DefaultNameContextGenerator	Class for determining contextual features for a tag/chunk style named-entity recognizer.
DefaultPOSContextGenerator	A context generator for the POS Tagger.
DefaultPOSSequenceValidator
DefaultSDContextGenerator	Generate event contexts for maxent decisions for sentence detection.
DefaultTokenContextGenerator	Generate events for maxent decisions for tokenization.
DefinitionFeatureGeneratorFactory
DetailedFMeasureEvaluatorParams	EvaluatorParams for Chunker.
DetokenEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
DetokenizationDictionary
DetokenizationDictionary.Operation
Detokenizer	A Detokenizer merges tokens back to their untokenized representation.
Detokenizer.DetokenizationOperation	This enum contains an operation for every token to merge the tokens together to their detokenized form.
DetokenizerEvaluator	The `DetokenizerEvaluator` measures the performance of the given `Detokenizer` with the provided reference `TokenSample`s.
DetokenizerParameter
DetokenizerSampleStreamFactory<T>	Base class for factories which need detokenizer.
DetokenizeSentenceSampleStream
Dictionary	This class is a dictionary.
DictionaryBuilderTool
DictionaryDetokenizer	A rule based detokenizer.
DictionaryDetokenizerTool
DictionaryEntryPersistor	This class is used by for reading and writing dictionaries of all kinds.
DictionaryFeatureGenerator	The `DictionaryFeatureGenerator` uses the `DictionaryNameFinder` to generated features for detected names based on the `InSpanGenerator`.
DictionaryFeatureGeneratorFactory
DictionaryLemmatizer	Lemmatize by simple dictionary lookup into a hashmap built from a file containing, for each line, word\tabpostag\tablemma.
DictionaryNameFinder	This is a dictionary based name finder, it scans text for names inside a dictionary.
DictionarySerializer
DirectorySampleStream	The directory sample stream allows for creating a stream from a directory listing of files.
DoccatConverterTool
DoccatCrossValidator	Cross validator for document categorization
DoccatCrossValidatorTool
DoccatEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
DoccatEvaluationMonitor	`EvaluationMonitor` for doccat.
DoccatEvaluatorTool
DoccatFactory	The factory that provides Doccat default implementations and resources
DoccatFineGrainedReportListener	Generates a detailed report for the POS Tagger.
DoccatModel	A model for document categorization
DoccatModelLoader	Loads a Document Categorizer Model for the command line tools.
DoccatTool
DoccatTrainerTool
DocumentBeginFeatureGenerator
DocumentBeginFeatureGeneratorFactory
DocumentCategorizer	Interface for classes which categorize documents.
DocumentCategorizerEvaluator	The `DocumentCategorizerEvaluator` measures the performance of the given `DocumentCategorizer` with the provided reference `DocumentSample`s.
DocumentCategorizerEventStream	Iterator-like class for modeling document classification events.
DocumentCategorizerME	Maxent implementation of `DocumentCategorizer`.
DocumentNameFinder	Name finding interface which processes an entire document allowing the name finder to use context from the entire document.
DocumentSample	Class which holds a classified document and its category.
DocumentSampleStream	This class reads in string encoded training samples, parses them and outputs `DocumentSample` objects.
DocumentSampleStreamFactory	Factory producing OpenNLP `DocumentSampleStream`s.
DocumentToLineStream	Reads a plain text file and return each line as a `String` object.
DownloadUtil	This class facilitates the downloading of pretrained OpenNLP models.
DownloadUtil.ModelType	The type of model.
DynamicEvalParameters
EmojiCharSequenceNormalizer	Normalizer for emojis.
EmptyLinePreprocessorStream	Stream to to clean up empty lines for empty line separated document streams. - Skips empty line at training data start - Transforms multiple empty lines in a row into one - Replaces white space lines with empty lines - TODO: Terminates last document with empty line if it is missing This stream should be used by the components that mark empty lines to mark document boundaries.
EncodingParameter	Encoding parameter.
EndOfSentenceScanner	Scans Strings, StringBuffers, and char[] arrays for the offsets of sentence ending characters.
EntityLinker<T extends Span>	EntityLinkers establish connections to external data to enrich extracted entities.
EntityLinkerFactory	Generates an EntityLinker implementation via properties file configuration
EntityLinkerProperties	Properties wrapper for the EntityLinker framework
EntityLinkerTool
Entry	An `Entry` is a `StringList` which can optionally be mapped to attributes.
EntryInserter
EvalitaNameSampleStream	Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.
EvalitaNameSampleStream.LANGUAGE
EvalitaNameSampleStreamFactory	Note: Do not use this class, internal use only!
EvalParameters	This class encapsulates the varibales used in producing probabilities from a model and facilitaes passing these variables to the eval method.
EvaluationMonitor<T>
Evaluator<T>	The `Evaluator` is an abstract base class for evaluators.
EvaluatorParams	Common evaluation parameters.
Event	The context of a decision point during training.
EventAnnotation
EventModelSequenceTrainer
EventTraceStream
EventTrainer
Experimental	Indicates that the API is not stable.
ExtensionLoader	The `ExtensionLoader` is responsible to load extensions to the OpenNLP library.
ExtensionNotLoadedException	Exception indicates that an OpenNLP extension could not be loaded.
ExtensionServiceKeys
Factory
Factory
FeatureGenerator	Interface for generating features for document categorization.
FeatureGeneratorResourceProvider	The `FeatureGeneratorResourceProvider` provides access to the resources provided in the model.
FeatureGeneratorUtil	This class provide common utilities for feature generation.
FileEventStream	Class for using a file of events as an event stream.
FileToByteArraySampleStream
FileToStringSampleStream	Provides the ability to read the contents of files contained in an object stream of files.
FilterObjectStream<S,T>	Abstract base class for filtering `ObjectStream`s.
FineGrainedEvaluatorParams	Common evaluation parameters.
FMeasure	The `FMeasure` is an utility class for evaluators which measure precision, recall and the resulting f-measure.
Function	Interface for a function
GapLabeler	Interface for labeling nodes which contain traces so that these traces can be predicted by the parser.
GeneratorFactory	Creates a set of feature generators based on a provided XML descriptor.
GeneratorFactory.AbstractXmlFeatureGeneratorFactory
GenericModelReader
GenericModelSerializer
GenericModelWriter
GISModel	A maximum entropy model which has been trained using the Generalized Iterative Scaling procedure (implemented in GIS.java).
GISModelReader	Abstract parent class for readers of GISModels.
GISModelWriter	Abstract parent class for GISModel writers.
GISTrainer	An implementation of Generalized Iterative Scaling.
Glove	Warning: Experimental new feature, see OPENNLP-1144 for details, the API might be changed anytime.
greekStemmer	This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
HashSumEventStream
HeadRules	Interface for encoding the head rules associated with parsing.
HeadRules	Class for storing the English head rules associated with parsing.
HeadRules.HeadRulesSerializer
Index	This classes indexes `StringList`s.
indonesianStemmer	This class implements the stemming algorithm defined by a snowball script.
InputStreamFactory	Allows repeated reads through a stream for certain types of model building.
InSpanGenerator	Generates features if the tokens are recognized by the provided `TokenNameFinder`.
InsufficientTrainingDataException	This exception indicates that the provided training data is insufficient to train the desired model.
InvalidFormatException	This exception indicates that a resource violates the expected data format.
IrishSentenceBankDocument	A structure to hold an Irish Sentence Bank document, which is a collection of tokenized sentences.
IrishSentenceBankDocument.IrishSentenceBankFlex
IrishSentenceBankDocument.IrishSentenceBankSentence
IrishSentenceBankSentenceStreamFactory
IrishSentenceBankTokenSampleStreamFactory
irishStemmer	This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
Language	Class for holding the document language and its confidence
LanguageDetector	The interface for LanguageDetector which provide the @`Language` according to the context.
LanguageDetectorConfig
LanguageDetectorContextGenerator	A context generator interface for language detector.
LanguageDetectorConverterTool
LanguageDetectorCrossValidator	Cross validator for language detector
LanguageDetectorCrossValidatorTool
LanguageDetectorEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
LanguageDetectorEvaluationMonitor	`EvaluationMonitor` for Language Detector.
LanguageDetectorEvaluator	The `LanguageDetectorEvaluator` measures the performance of the given `LanguageDetector` with the provided reference `LanguageSample`s.
LanguageDetectorEvaluatorTool
LanguageDetectorEventStream	Iterator-like class for modeling language detector events.
LanguageDetectorFactory	Default factory used by Language Detector.
LanguageDetectorFineGrainedReportListener	Generates a detailed report for the POS Tagger.
LanguageDetectorME	Implements learnable Language Detector
LanguageDetectorModel	A model for language detection
LanguageDetectorModelLoader	Loads a Language Detector Model for the command line tools.
LanguageDetectorSampleStream	This class reads in string encoded training samples, parses them and outputs `LanguageSample` objects.
LanguageDetectorSampleStreamFactory	Factory producing OpenNLP `DocumentSampleStream`s.
LanguageDetectorTool
LanguageDetectorTrainerTool
LanguageModel	A language model can calculate the probability p (between 0 and 1) of a certain `sequence of tokens`, given its underlying vocabulary.
LanguageParams
LanguageSample	Class which holds a classified document and its @`Language`.
LanguageSampleStreamFactory<T>	Stream factory for those streams which carry language.
LeipzigLanguageSampleStream
LeipzigLanguageSampleStreamFactory	Note: Do not use this class, internal use only!
LemmaEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
LemmaSample	Represents an lemmatized sentence.
LemmaSampleEventStream	Class for creating an event stream out of data files for training a probabilistic lemmatizer.
LemmaSampleSequenceStream
LemmaSampleStream	Reads data for training and testing the lemmatizer.
Lemmatizer	The interface for lemmatizers.
LemmatizerContextGenerator	Interface for the context generator used for probabilistic lemmatizer.
LemmatizerEvaluationMonitor	Interface for the lemmatizer evaluator.
LemmatizerEvaluator	The `LemmatizerEvaluator` measures the performance of the given `Lemmatizer` with the provided reference `LemmaSample`s.
LemmatizerEvaluatorTool
LemmatizerFactory
LemmatizerFineGrainedReportListener	Generates a detailed report for the Lemmatizer.
LemmatizerME	A probabilistic lemmatizer.
LemmatizerMETool
LemmatizerModel	The `LemmatizerModel` is the model used by a learnable `Lemmatizer`.
LemmatizerModelLoader	Loads a Lemmatizer Model for the command line tools.
LemmatizerSampleStreamFactory	Factory producing OpenNLP `LemmaSampleStream`s.
LemmatizerTrainerTool
LetsmtDocument	A structure to hold the letsmt document.
LetsmtDocument.LetsmtDocumentHandler
LetsmtDocument.LetsmtSentence
LetsmtSentenceStreamFactory
LineSearch	Class that performs line search to find minimum
LineSearch.LineSearchResult	Class to store lineSearch result
LinkedSpan<T extends BaseLink>	An "default" extended span that holds additional information about the Span
LogProbabilities<T>	Class implementing the probability distribution over labels returned by a classifier as a log of probabilities.
LogProbability<T>	Class implementing the probability for a label.
MarkableFileInputStreamFactory	A factory that creates `MarkableFileInputStream` from a `File`
MascDocument
MascDocumentStream
MascNamedEntityParser	A class to process the MASC Named entity stand-off annotation file
MascNamedEntitySampleStream
MascNamedEntitySampleStreamFactory
MascPennTagParser	A class for parsing MASC's Penn tagging/tokenization stand-off annotation
MascPOSSampleStream
MascPOSSampleStreamFactory
MascSentence
MascSentenceSampleStream
MascSentenceSampleStreamFactory
MascToken
MascTokenSampleStream
MascTokenSampleStreamFactory
MascWord
MaxentModel	Interface for maximum entropy models.
Mean	Calculates the arithmetic mean of values added with the `Mean.add(double)` method.
ModelType
ModelUtil	Utility class for handling of `MaxentModel`s.
MosesSentenceSampleStream
MosesSentenceSampleStreamFactory	Factory producing OpenNLP `MosesSentenceSampleStream`s.
Muc6NameSampleStreamFactory
MucNameContentHandler
MucNameSampleStream
MutableContext	Class used to store parameters or expected values associated with this context which can be updated or assigned.
MutableInt	This is a non-thread safe mutable int.
MutableTagDictionary	Interface that allows `TagDictionary` entries to be added and removed.
NaiveBayesEvalParameters	Parameters for the evalution of a naive bayes classifier
NaiveBayesModel	Class implementing the multinomial Naive Bayes classifier model.
NaiveBayesModelReader	Abstract parent class for readers of NaiveBayes.
NaiveBayesModelWriter	Abstract parent class for NaiveBayes writers.
NaiveBayesTrainer	Trains models using the combination of EM algorithm and Naive Bayes classifier which is described in: Text Classification from Labeled and Unlabeled Documents using EM Nigam, McCallum, et al paper of 2000
NameContextGenerator	Interface for generating the context for an name finder by specifying a set of geature generators.
NameEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
NameFinderCensus90NameStream	This class helps to read the US Census data from the files to build a StringList for each dictionary entry in the name-finder dictionary.
NameFinderEventStream	Class for creating an event stream out of data files for training an name finder.
NameFinderME	Class for creating a maximum-entropy-based name finder.
NameFinderSequenceValidator	This class is created by the `BioCodec`.
NameSample	Class for holding names for a single unit of text.
NameSampleCountersStream	Counts tokens, sentences and names by type
NameSampleDataStream	The `NameSampleDataStream` class converts tagged `String`s provided by a `DataStream` to `NameSample` objects.
NameSampleDataStreamFactory	Factory producing OpenNLP `NameSampleDataStream`s.
NameSampleDataStreamFactory.Parameters
NameSampleSequenceStream
NameSampleTypeFilter	A stream which removes Name Samples which do not have a certain type.
NameToSentenceSampleStream	Note: Do not use this class, internal use only!
NameToSentenceSampleStreamFactory	Note: Do not use this class, internal use only!
NameToTokenSampleStream	Note: Do not use this class, internal use only!
NameToTokenSampleStreamFactory	Note: Do not use this class, internal use only!
NegLogLikelihood	Evaluate negative log-likelihood and its gradient from DataIndexer.
NewlineSentenceDetector	The Newline Sentence Detector assumes that sentences are line delimited and recognizes one sentence per non-empty line.
NGramCharModel	The `NGramCharModel` can be used to create character ngrams.
NGramFeatureGenerator	Generates ngram features for a document.
NGramGenerator	Generates an nGram, with optional separator, and returns the grams as a list of strings
NGramLanguageModel	A `LanguageModel` based on a `NGramModel` using Stupid Backoff to get the probabilities of the ngrams.
NGramLanguageModelTool	Command line tool for `NGramLanguageModel`.
NGramModel	The `NGramModel` can be used to crate ngrams and character ngrams.
NGramUtils	Utility class for ngrams.
NKJPSegmentationDocument
NKJPSegmentationDocument.Pointer
NKJPSentenceSampleStream
NKJPSentenceSampleStreamFactory
NKJPTextDocument
NumberCharSequenceNormalizer	Normalizer for numbers
ObjectDataReader
ObjectStream<T>	Reads `Object`s from a stream.
ObjectStreamUtils
OnePassDataIndexer	An indexer for maxent model data which handles cutoffs for uncommon contextual predicates and provides a unique integer index for each of the predicates.
OnePassRealValueDataIndexer	An indexer for maxent model data which handles cutoffs for uncommon contextual predicates and provides a unique integer index for each of the predicates and maintains event values.
OntoNotesFormatParameters
OntoNotesNameSampleStream	Name Sample Stream parser for the OntoNotes 4.0 corpus.
OntoNotesNameSampleStreamFactory
OntoNotesParseSampleStream
OntoNotesParseSampleStreamFactory
OntoNotesPOSSampleStreamFactory
OSGiExtensionLoader	OSGi bundle activator which can use an OSGi service as an OpenNLP extension.
OutcomePriorFeatureGenerator	The definition feature maps the underlying distribution of outcomes.
ParagraphStream	Stream filter which merges text lines into paragraphs.
ParallelNegLogLikelihood	Evaluate negative log-likelihood and its gradient in parallel
Parse	Data structure for holding parse constituents.
Parser	Class for a shift reduce style parser based on Adwait Ratnaparkhi's 1998 thesis.
Parser	Interface for full-syntactic parsers.
Parser	Built/attach parser.
ParserChunkerFactory
ParserChunkerSequenceValidator
ParserConverterTool
ParserCrossValidator
ParserEvaluationMonitor
ParserEvaluator	Class for ParserEvaluator.
ParserEvaluatorTool
ParserEventStream	Wrapper class for one of four parser event streams.
ParserEventStream
ParserEventTypeEnum	Enumerated type of event types for the parser.
ParserFactory
ParserModel	This is an abstract base class for `ParserModel` implementations.
ParserModelLoader	Loads a Parser Model for the command line tools.
ParserTool
ParserTrainerTool
ParserType
ParseSampleStream
ParseSampleStreamFactory	Factory producing OpenNLP `ParseSampleStream`s.
ParseSampleStreamFactory.Parameters
ParseToPOSSampleStream	Note: Do not use this class, internal use only!
ParseToPOSSampleStreamFactory	Note: Do not use this class, internal use only!
ParseToSentenceSampleStreamFactory	Note: Do not use this class, internal use only!
ParseToTokenSampleStreamFactory	Note: Do not use this class, internal use only!
PerceptronModel
PerceptronModelReader	Abstract parent class for readers of Perceptron.
PerceptronModelWriter	Abstract parent class for Perceptron writers.
PerceptronTrainer	Trains models using the perceptron algorithm.
PlainTextByLineStream	Reads a plain text file and return each line as a `String` object.
PlainTextFileDataReader
PlainTextNaiveBayesModelReader
PlainTextNaiveBayesModelWriter	Model writer that saves models in plain text format.
PorterStemmer	Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form.
PortugueseContractionUtility	Utility class to handle Portuguese contractions.
POSContextGenerator	The interface for a context generator for the POS Tagger.
POSDictionary	Provides a means of determining which tags are valid for a particular word based on a tag dictionary read from a file.
POSEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
POSEvaluator	The `POSEvaluator` measures the performance of the given `POSTagger` with the provided reference `POSSample`s.
POSModel	The `POSModel` is the model used by a learnable `POSTagger`.
POSModelLoader	Loads a POS Tagger Model for the command line tools.
POSModelSerializer
POSSample	Represents an pos-tagged sentence.
POSSampleEventStream	This class reads the `POSSample`s from the given `Iterator` and converts the `POSSample`s into `Event`s which can be used by the maxent library for training.
POSSampleSequenceStream
PosSampleStream
POSTagger	The interface for part of speech taggers.
POSTaggerConverterTool
POSTaggerCrossValidator
POSTaggerCrossValidatorTool
POSTaggerEvaluationMonitor
POSTaggerEvaluatorTool
POSTaggerFactory	The factory that provides POS Tagger default implementations and resources
POSTaggerFactory.POSDictionarySerializer
PosTaggerFeatureGenerator
PosTaggerFeatureGeneratorFactory
POSTaggerFineGrainedReportListener	Generates a detailed report for the POS Tagger.
POSTaggerME	A part-of-speech tagger that uses maximum entropy.
POSTaggerNameFeatureGenerator	Adds the token POS Tag as feature.
POSTaggerNameFeatureGeneratorFactory
POSTaggerTool
POSTaggerTrainerTool
POSToSentenceSampleStream	Note: Do not use this class, internal use only!
POSToSentenceSampleStreamFactory	Note: Do not use this class, internal use only!
POSToTokenSampleStream	Note: Do not use this class, internal use only!
POSToTokenSampleStreamFactory	Note: Do not use this class, internal use only!
PrefixFeatureGenerator
PrefixFeatureGeneratorFactory
PreviousMapFeatureGenerator	This `AdaptiveFeatureGenerator` generates features indicating the outcome associated with a previously occuring word.
PreviousMapFeatureGeneratorFactory
PreviousTwoMapFeatureGenerator	This `AdaptiveFeatureGenerator` generates features indicating the outcome associated with two previously occuring words.
Prior	This interface allows one to implement a prior distribution for use in maximum entropy model training.
Probabilities<T>	Class implementing the probability distribution over labels returned by a classifier.
Probability<T>	Class implementing the probability for a label.
ProbingLanguageDetectionResult
QNMinimizer	Implementation of L-BFGS which supports L1-, L2-regularization and Elastic Net for solving convex optimization problems.
QNMinimizer.Evaluator	Evaluate quality of training parameters.
QNMinimizer.L2RegFunction	L2-regularized objective function
QNModel
QNModelReader
QNModelWriter
QNTrainer	Maxent model trainer using L-BFGS algorithm.
RealBasicEventStream
RealValueFileEventStream
RegexNameFinder	Name finder based on a series of regular expressions.
RegexNameFinderFactory	Returns a RegexNameFinder based on A selection of defaults or a configuration and a selection of defaults
RegexNameFinderFactory.DEFAULT_REGEX_NAME_FINDER
RegexNameFinderFactory.RegexAble
RelationAnnotation
ResetableIterator<E>	This interface makes an `Iterator` resetable.
ReverseListIterator<T>	An iterator for a list which returns values in the opposite order as the typical list iterator.
SDContextGenerator	Interface for `SentenceDetectorME` context generators.
SDCrossValidator	A cross validator for the sentence detector.
SDEventStream
SegmenterObjectStream<S,T>
SentenceContextGenerator	Creates contexts/features for end-of-sentence detection in Thai text.
SentenceDetector	The interface for sentence detectors, which find the sentence boundaries in a text.
SentenceDetectorConverterTool
SentenceDetectorCrossValidatorTool
SentenceDetectorEvaluationMonitor
SentenceDetectorEvaluator	The `SentenceDetectorEvaluator` measures the performance of the given `SentenceDetector` with the provided reference `SentenceSample`s.
SentenceDetectorEvaluatorTool
SentenceDetectorFactory	The factory that provides SentenceDetecor default implementations and resources
SentenceDetectorME	A sentence detector for splitting up raw text into sentences.
SentenceDetectorTool	A sentence detector which uses a maxent model to predict the sentences.
SentenceDetectorTrainerTool
SentenceEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
SentenceFeatureGenerator	This feature generator creates sentence begin and end features.
SentenceFeatureGeneratorFactory
SentenceModel	The `SentenceModel` is the model used by a learnable `SentenceDetector`.
SentenceSample	A `SentenceSample` contains a document with begin indexes of the individual sentences.
SentenceSampleStream	This class is a stream filter which reads a sentence by line samples from a `Reader` and converts them into `SentenceSample` objects.
SentenceSampleStreamFactory	Factory producing OpenNLP `SentenceSampleStream`s.
Sequence<T>	Class which models a sequence.
Sequence	Represents a weighted sequence of outcomes.
SequenceClassificationModel<T>	A classification model that can label an input sequence.
SequenceCodec<T>
SequenceStream	Interface for streams of sequences used to train sequence models.
SequenceStreamEventStream	Class which turns a sequence stream into an event stream.
SequenceTrainer
SequenceValidator<T>
SerializableArtifact
SgmlParser	SAX style SGML parser.
SgmlParser.ContentHandler
ShrinkCharSequenceNormalizer	Normalizer to shrink repeated spaces / chars
SimplePerceptronSequenceTrainer	Trains models for sequences using the perceptron algorithm.
SimpleTokenizer	Performs tokenization using character classes.
SimpleTokenizerTool
SnowballStemmer
SnowballStemmer.ALGORITHM
Span	Class for storing start and end integer offsets.
SpanAnnotation
Stemmer	The stemmer is reducing a word to its stem.
StringList	The `StringList` is an immutable list of `String`s.
StringPattern	Recognizes predefined patterns in strings.
StringUtil
SuffixFeatureGenerator
SuffixFeatureGeneratorFactory
TagDictionary	Interface to determine which tags are valid for a particular word based on a tag dictionary.
TaggerModelReplacerTool
TokenClassFeatureGenerator	Generates features for different for the class of the token.
TokenClassFeatureGeneratorFactory
TokenContextGenerator	Interface for `TokenizerME` context generators.
TokenEvaluationErrorListener	A default implementation of `EvaluationMonitor` that prints to an output stream.
TokenFeatureGenerator	Generates a feature which contains the token itself.
TokenFeatureGeneratorFactory
Tokenizer	The interface for tokenizers, which segment a string into its tokens.
TokenizerConverterTool
TokenizerCrossValidator
TokenizerCrossValidatorTool
TokenizerEvaluationMonitor
TokenizerEvaluator	The `TokenizerEvaluator` measures the performance of the given `Tokenizer` with the provided reference `TokenSample`s.
TokenizerFactory	The factory that provides `Tokenizer` default implementations and resources.
TokenizerME	A Tokenizer for converting raw text into separated tokens.
TokenizerMEEvaluatorTool
TokenizerMETool
TokenizerModel	The `TokenizerModel` is the model used by a learnable `Tokenizer`.
TokenizerModelLoader	Loads a Tokenizer Model for the command line tools.
TokenizerStream	The `TokenizerStream` uses a tokenizer to tokenize the input string and output `TokenSample`s.
TokenizerTrainerTool
TokenNameFinder	The interface for name finders which provide name tags for a sequence of tokens.
TokenNameFinderConverterTool	Tool to convert multiple data formats into native OpenNLP name finder training format.
TokenNameFinderCrossValidator
TokenNameFinderCrossValidatorTool
TokenNameFinderDetailedFMeasureListener
TokenNameFinderEvaluationMonitor
TokenNameFinderEvaluator	The `TokenNameFinderEvaluator` measures the performance of the given `TokenNameFinder` with the provided reference `NameSample`s.
TokenNameFinderEvaluatorTool
TokenNameFinderFactory
TokenNameFinderFineGrainedReportListener	Generates a detailed report for the NameFinder.
TokenNameFinderModel	The `TokenNameFinderModel` is the model used by a learnable `TokenNameFinder`.
TokenNameFinderModel.FeatureGeneratorCreationError
TokenNameFinderModelLoader	Loads a Token Name Finder Model for the command line tools.
TokenNameFinderTool
TokenNameFinderTrainerTool
TokenPatternFeatureGenerator	Partitions tokens into sub-tokens based on character classes and generates class features for each of the sub-tokens and combinations of those sub-tokens.
TokenPatternFeatureGeneratorFactory
TokenSample	A `TokenSample` is text with token spans.
TokenSampleStream	Class which produces an Iterator<TokenSample> from a file of space delimited token.
TokenSampleStream	This class is a stream filter which reads in string encoded samples and creates `TokenSample`s out of them.
TokenSampleStreamFactory	Factory producing OpenNLP `TokenSampleStream`s.
TokenTag
TokSpanEventStream	This class reads the `TokenSample`s from the given `Iterator` and converts the `TokenSample`s into `Event`s which can be used by the maxent library for training.
TrainerFactory
TrainerFactory.TrainerType
TrainingParameters
TrainingToolParams	Common training parameters.
TrigramNameFeatureGenerator	Adds trigram features based on tokens and token classes.
TrigramNameFeatureGeneratorFactory
TwentyNewsgroupSampleStream
TwentyNewsgroupSampleStreamFactory
TwitterCharSequenceNormalizer	Normalizer for Twitter character sequences
TwoPassDataIndexer	Collecting event and context counts by making two passes over the events.
UncloseableInputStream	An `InputStream` which cannot be closed.
UniformPrior	Provide a maximum entropy model with a uniform prior.
UrlCharSequenceNormalizer	Normalizer that removes URls and email addresses.
Version	The `Version` class represents the OpenNlp Tools library version.
WhitespaceTokenizer	This tokenizer uses white spaces to tokenize the input text.
WhitespaceTokenStream	This stream formats a `TokenSample`s into whitespace separated token strings.
WindowFeatureGenerator	Generates previous and next features for a given `AdaptiveFeatureGenerator`.
WindowFeatureGeneratorFactory
WordClusterDictionary
WordClusterDictionary.WordClusterDictionarySerializer
WordClusterFeatureGenerator
WordClusterFeatureGeneratorFactory	Defines a word cluster generator factory; it reads an element containing 'w2vwordcluster' as a tag name; these clusters are typically produced by word2vec or clark pos induction systems.
WordpieceTokenizer	A WordPiece tokenizer.
WordTagSampleStream	A stream filter which reads a sentence per line which contains words and tags in word_tag format and outputs a `POSSample` objects.
WordTagSampleStreamFactory	Note: Do not use this class, internal use only!
WordTagSampleStreamFactory.Parameters
WordVector	A word vector.
WordVectorTable	A table that maps tokens to word vectors.
WordVectorType
XmlUtil