Package opennlp.tools.langdetect
Class DefaultLanguageDetectorContextGenerator
- java.lang.Object
-
- opennlp.tools.langdetect.DefaultLanguageDetectorContextGenerator
-
- All Implemented Interfaces:
Serializable,LanguageDetectorContextGenerator
public class DefaultLanguageDetectorContextGenerator extends Object implements LanguageDetectorContextGenerator
A context generator for language detector.- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers)Creates a customizableDefaultLanguageDetectorContextGeneratorthat computes ngrams from text.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description String[]getContext(CharSequence document)Retrieves the contexts for adocumentusing character ngrams.
-
-
-
Constructor Detail
-
DefaultLanguageDetectorContextGenerator
public DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers)Creates a customizableDefaultLanguageDetectorContextGeneratorthat computes ngrams from text.- Parameters:
minLength- The min number of ngrams characters. Must be greater than0.maxLength- The max number of ngrams characters. Must be greater than0and must be greater thanminLength.normalizers- Zero or more normalizers to be applied in to the text before extracting ngrams.
-
-
Method Detail
-
getContext
public String[] getContext(CharSequence document)
Description copied from interface:LanguageDetectorContextGeneratorRetrieves the contexts for adocumentusing character ngrams.- Specified by:
getContextin interfaceLanguageDetectorContextGenerator- Parameters:
document- The textual input used to extract context from.- Returns:
- An array of contexts on which a model basis its decisions.
-
-