All Classes and Interfaces

Class
Description
Base class for sample stream factories.
 
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese Chunker training.
A Factory to create a Arvores Deitadas ChunkStream from the command line utility.
 
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese NER training.
A Factory to create a Arvores Deitadas NameSampleDataStream from the command line utility.
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
Stream filter which merges text lines into sentences, following the Arvores Deitadas syntax.
 
Parses a sample of AD corpus.
Represents the AD leaf
Represents the AD node
Represents a tree element, Node or Leaf
Note: Do not use this class, internal use only!
 
Encapsulates a type to class mapping for entities, relations, events, etc.
 
 
A sample stream for the training files of the BioNLP/NLPBA 2004 shared task.
 
 
 
Reads the annotations from the brat .ann annotation file.
Brat (brat rapid annotation tool) is based on the stav visualiser which was originally made in order to visualise BioNLP'11 Shared Task data.
 
 
Generates Name Sample objects for a Brat Document object.
Note: Do not use this class, internal use only!
 
Factory producing OpenNLP ChunkSampleStreams.
 
Parser for the Dutch and Spanish ner training files of the CONLL 2002 shared task.
 
Note: Do not use this class, internal use only!
 
An import stream which can parse the CONLL03 data.
 
 
 
 
Note: Do not use this class, internal use only!
 
 
Note: Do not use this class, internal use only!
 
 
 
Note: Do not use this class, internal use only!
 
The CoNNL-U Format is specified here.
 
 
Note: Do not use this class, internal use only!
 
 
Parses the data from the CONLL 06 shared task into POS Samples.
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
 
 
Note: Do not use this class, internal use only!
Base class for factories which need a Detokenizer.
 
The directory sample stream allows for creating an opennlp.tools.util.ObjectStream<File> from a directory listing of files.
Factory producing OpenNLP DocumentSampleStreams.
 
Reads a plain text file and return each line as a String object.
Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.
 
Note: Do not use this class, internal use only!
 
 
Note: Do not use this class, internal use only!
Provides the ability to read the contents of files contained in an object stream of files.
Utility class for the OpenNLP formats package.
Parser for the GermEval 2014 Named Entity Recognition Shared Task data.
Selects which NER annotation layer to read from the GermEval 2014 data.
Note: Do not use this class, internal use only!
 
A structure to hold an Irish Sentence Bank document, which is a collection of tokenized sentences.
 
 
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
 
Factory producing OpenNLP lang detector sample streams.
 
Stream factory for those streams which carry language.
 
Note: Do not use this class, internal use only!
 
Factory producing OpenNLP LemmaSampleStreams.
 
A structure to hold the letsmt document.
A content handler to receive and process SAX events.
 
Note: Do not use this class, internal use only!
 
A factory that creates MarkableFileInputStream from a File
A simple marker interface for classes that support or refer to the Masc.MASC_FORMAT.
 
 
A class to process the MASC Named entity stand-off annotation file
 
Note: Do not use this class, internal use only!
 
A class for parsing MASC's Penn tagging/tokenization stand-off annotation
 
Note: Do not use this class, internal use only!
 
 
 
Note: Do not use this class, internal use only!
 
A specialized Span to express tokens in documents.
 
Note: Do not use this class, internal use only!
 
 
Moses is a statistical machine translation system that allows you to automatically train translation models for any language pair.
Factory producing OpenNLP MosesSentenceSampleStream objects.
 
 
 
 
 
This class helps to read the US Census data from the files to build a StringList for each dictionary entry in the name-finder dictionary.
Factory producing OpenNLP NameSampleDataStreams.
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
 
 
 
Note: Do not use this class, internal use only!
 
The National corpus of Polish (NKJP) format.
 
Name Sample Stream parser for the OntoNotes 4.0 named entity files.
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
A FilterObjectStream which merges text lines into paragraphs.
Factory producing OpenNLP ParseSampleStreams.
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
 
Utility class to handle Portuguese contractions.
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!
Note: Do not use this class, internal use only!
 
 
 
 
Factory producing OpenNLP SentenceSampleStreams.
 
Factory for creating a sample stream factory for sentiment analysis.
A SAX style SGML parser.
Defines methods to handle content produced by a SgmlParser.
 
Factory producing OpenNLP TokenSampleStreams.
 
An ObjectStream implementation for the Twenty Newsgroups text corpus.
Note: Do not use this class, internal use only!
 
Note: Do not use this class, internal use only!