Package | Description |
---|---|
opennlp.tools.chunker |
Package related to finding non-recursive syntactic annotation such as noun phrase chunks.
|
opennlp.tools.doccat |
Package for classifying a document into a category.
|
opennlp.tools.formats |
Experimental package related to converting various corpora to OpenNLP Format.
|
opennlp.tools.formats.ad | |
opennlp.tools.formats.brat | |
opennlp.tools.formats.convert | |
opennlp.tools.formats.frenchtreebank | |
opennlp.tools.formats.letsmt | |
opennlp.tools.formats.moses | |
opennlp.tools.formats.muc | |
opennlp.tools.formats.ontonotes | |
opennlp.tools.lemmatizer |
Package related with the lemmatizer tool
|
opennlp.tools.namefind |
Package related to finding proper names and numeric amounts.
|
opennlp.tools.parser |
Package containing common code for performing full syntactic parsing.
|
opennlp.tools.postag |
Package related to part-of-speech tagging.
|
opennlp.tools.sentdetect |
Package related to identifying sentece boundries.
|
opennlp.tools.tokenize |
Contains classes related to finding token or words in a string.
|
opennlp.tools.util |
Package containing utility data structures and algorithms used by multiple other packages.
|
Modifier and Type | Class and Description |
---|---|
class |
ChunkSampleStream
Parses the conll 2000 shared task shallow parser training data.
|
Modifier and Type | Class and Description |
---|---|
class |
DocumentSampleStream
This class reads in string encoded training samples, parses them and
outputs
DocumentSample objects. |
Modifier and Type | Class and Description |
---|---|
class |
ConllXPOSSampleStream
Parses the data from the CONLL 06 shared task into POS Samples.
|
class |
LeipzigDoccatSampleStream
Stream filter to produce document samples out of a Leipzig sentences.txt file.
|
Modifier and Type | Class and Description |
---|---|
class |
ADSentenceStream
Stream filter which merges text lines into sentences, following the Arvores
Deitadas syntax.
|
Modifier and Type | Class and Description |
---|---|
class |
BratNameSampleStream
Generates Name Sample objects for a Brat Document object.
|
class |
SegmenterObjectStream<S,T> |
Modifier and Type | Class and Description |
---|---|
class |
AbstractToSentenceSampleStream<T> |
class |
FileToByteArraySampleStream |
class |
FileToStringSampleStream |
class |
NameToSentenceSampleStream
Note: Do not use this class, internal use only!
|
class |
NameToTokenSampleStream
Note: Do not use this class, internal use only!
|
class |
ParseToPOSSampleStream
Note: Do not use this class, internal use only!
|
class |
POSToSentenceSampleStream
Note: Do not use this class, internal use only!
|
class |
POSToTokenSampleStream
Note: Do not use this class, internal use only!
|
Modifier and Type | Class and Description |
---|---|
class |
ConstitParseSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
DetokenizeSentenceSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
MosesSentenceSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
MucNameSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
DocumentToLineStream
Reads a plain text file and return each line as a
String object. |
class |
OntoNotesNameSampleStream
Name Sample Stream parser for the OntoNotes 4.0 corpus.
|
class |
OntoNotesParseSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
LemmaSampleStream
Reads data for training and testing the lemmatizer.
|
Modifier and Type | Class and Description |
---|---|
class |
NameSampleDataStream
The
NameSampleDataStream class converts tagged String s
provided by a DataStream to NameSample objects. |
class |
NameSampleTypeFilter
A stream which removes Name Samples which do not have a certain type.
|
Modifier and Type | Class and Description |
---|---|
class |
ParseSampleStream |
class |
PosSampleStream |
Modifier and Type | Class and Description |
---|---|
class |
WordTagSampleStream
A stream filter which reads a sentence per line which contains
words and tags in word_tag format and outputs a
POSSample objects. |
Modifier and Type | Class and Description |
---|---|
class |
EmptyLinePreprocessorStream
Stream to to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start - Transforms multiple empty lines in a row into one - Replaces white space lines with empty lines - TODO: Terminates last document with empty line if it is missing This stream should be used by the components that mark empty lines to mark document boundaries. |
class |
SentenceSampleStream
This class is a stream filter which reads a sentence by line samples from
a
Reader and converts them into SentenceSample objects. |
Modifier and Type | Class and Description |
---|---|
class |
TokenSampleStream
This class is a stream filter which reads in string encoded samples and creates
TokenSample s out of them. |
class |
WhitespaceTokenStream
This stream formats a
TokenSample s into whitespace
separated token strings. |
Modifier and Type | Class and Description |
---|---|
class |
EventTraceStream |
class |
ParagraphStream
Stream filter which merges text lines into paragraphs.
|
Copyright © 2017 The Apache Software Foundation. All rights reserved.