Package | Description |
---|---|
opennlp.tools.formats |
Experimental package related to converting various corpora to OpenNLP Format.
|
opennlp.tools.formats.convert | |
opennlp.tools.formats.letsmt | |
opennlp.tools.sentdetect |
Package related to identifying sentece boundries.
|
opennlp.tools.tokenize |
Contains classes related to finding token or words in a string.
|
Modifier and Type | Method and Description |
---|---|
protected Detokenizer |
DetokenizerSampleStreamFactory.createDetokenizer(opennlp.tools.cmdline.params.DetokenizerParameter p) |
Constructor and Description |
---|
NameToSentenceSampleStream(Detokenizer detokenizer,
ObjectStream<NameSample> samples,
int chunkSize) |
NameToTokenSampleStream(Detokenizer detokenizer,
ObjectStream<NameSample> samples) |
POSToSentenceSampleStream(Detokenizer detokenizer,
ObjectStream<POSSample> samples,
int chunkSize) |
POSToTokenSampleStream(Detokenizer detokenizer,
ObjectStream<POSSample> samples) |
Constructor and Description |
---|
DetokenizeSentenceSampleStream(Detokenizer detokenizer,
ObjectStream<SentenceSample> samples) |
Constructor and Description |
---|
SentenceSample(Detokenizer detokenizer,
String[][] sentences) |
Modifier and Type | Class and Description |
---|---|
class |
DictionaryDetokenizer
A rule based detokenizer.
|
Constructor and Description |
---|
TokenSample(Detokenizer detokenizer,
String[] tokens) |
Copyright © 2017 The Apache Software Foundation. All rights reserved.