Package opennlp.tools.langdetect
Class DefaultLanguageDetectorContextGenerator
- java.lang.Object
-
- opennlp.tools.langdetect.DefaultLanguageDetectorContextGenerator
-
- All Implemented Interfaces:
Serializable
,LanguageDetectorContextGenerator
public class DefaultLanguageDetectorContextGenerator extends Object implements LanguageDetectorContextGenerator
A context generator for language detector.- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers)
Creates a customizableDefaultLanguageDetectorContextGenerator
that computes ngrams from text.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description String[]
getContext(CharSequence document)
Retrieves the contexts for adocument
using character ngrams.
-
-
-
Constructor Detail
-
DefaultLanguageDetectorContextGenerator
public DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers)
Creates a customizableDefaultLanguageDetectorContextGenerator
that computes ngrams from text.- Parameters:
minLength
- The min number of ngrams characters. Must be greater than0
.maxLength
- The max number of ngrams characters. Must be greater than0
and must be greater thanminLength
.normalizers
- Zero or more normalizers to be applied in to the text before extracting ngrams.
-
-
Method Detail
-
getContext
public String[] getContext(CharSequence document)
Description copied from interface:LanguageDetectorContextGenerator
Retrieves the contexts for adocument
using character ngrams.- Specified by:
getContext
in interfaceLanguageDetectorContextGenerator
- Parameters:
document
- The textual input used to extract context from.- Returns:
- An array of contexts on which a model basis its decisions.
-
-