Package opennlp.tools.langdetect
Class DefaultLanguageDetectorContextGenerator
java.lang.Object
opennlp.tools.langdetect.DefaultLanguageDetectorContextGenerator
- All Implemented Interfaces:
Serializable
,LanguageDetectorContextGenerator
public class DefaultLanguageDetectorContextGenerator
extends Object
implements LanguageDetectorContextGenerator
A context generator for
language detector
.- See Also:
-
Constructor Summary
ConstructorDescriptionDefaultLanguageDetectorContextGenerator
(int minLength, int maxLength, CharSequenceNormalizer... normalizers) Creates a customizableDefaultLanguageDetectorContextGenerator
that computes ngrams from text. -
Method Summary
Modifier and TypeMethodDescription<T extends CharSequence>
T[]getContext
(CharSequence document) Retrieves the contexts for adocument
using character ngrams.
-
Constructor Details
-
DefaultLanguageDetectorContextGenerator
public DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers) Creates a customizableDefaultLanguageDetectorContextGenerator
that computes ngrams from text.- Parameters:
minLength
- The min number of ngrams characters. Must be greater than0
.maxLength
- The max number of ngrams characters. Must be greater than0
and must be greater thanminLength
.normalizers
- Zero or more normalizers to be applied in to the text before extracting ngrams.
-
-
Method Details
-
getContext
Description copied from interface:LanguageDetectorContextGenerator
Retrieves the contexts for adocument
using character ngrams.- Specified by:
getContext
in interfaceLanguageDetectorContextGenerator
- Parameters:
document
- The textual input used to extract context from.- Returns:
- An array of contexts on which a model basis its decisions.
-