Class LanguageDetectorME
- java.lang.Object
-
- opennlp.tools.langdetect.LanguageDetectorME
-
- All Implemented Interfaces:
Serializable,LanguageDetector
public class LanguageDetectorME extends Object implements LanguageDetector
Implements learnable Language DetectorThis will process the entire string when called with
predictLanguage(CharSequence)orpredictLanguages(CharSequence).If you want this to stop early, use
probingPredictLanguages(CharSequence)orprobingPredictLanguages(CharSequence, LanguageDetectorConfig). When run in probing mode, this starts at the beginning of the charsequence and runs language detection on chunks of text. If the end of the string is reached or there areLanguageDetectorConfig.getMinConsecImprovements()consecutive predictions for the best language and the confidence increases over those last predictions and if the difference in confidence between the highest confidence language and the second highest confidence language is greater thanLanguageDetectorConfig.getMinDiff(), the language detector will stop and report the results.The authors wish to thank Ken Krugler and Yalder} for the inspiration for many of the design components of this detector.
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description LanguageDetectorME(LanguageDetectorModel model)Initializes the current instance with a language detector model.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description String[]getSupportedLanguages()LanguagepredictLanguage(CharSequence content)This will process the full content length.Language[]predictLanguages(CharSequence content)This will process the full content length.ProbingLanguageDetectionResultprobingPredictLanguages(CharSequence content)This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.ProbingLanguageDetectionResultprobingPredictLanguages(CharSequence content, LanguageDetectorConfig config)This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.static LanguageDetectorModeltrain(ObjectStream<LanguageSample> samples, TrainingParameters mlParams, LanguageDetectorFactory factory)
-
-
-
Constructor Detail
-
LanguageDetectorME
public LanguageDetectorME(LanguageDetectorModel model)
Initializes the current instance with a language detector model. Default feature generation is used.- Parameters:
model- the language detector model
-
-
Method Detail
-
predictLanguages
public Language[] predictLanguages(CharSequence content)
This will process the full content length.- Specified by:
predictLanguagesin interfaceLanguageDetector- Parameters:
content-- Returns:
- the predicted languages
-
predictLanguage
public Language predictLanguage(CharSequence content)
This will process the full content length.- Specified by:
predictLanguagein interfaceLanguageDetector- Parameters:
content-- Returns:
- the language with the highest confidence
-
getSupportedLanguages
public String[] getSupportedLanguages()
- Specified by:
getSupportedLanguagesin interfaceLanguageDetector
-
probingPredictLanguages
public ProbingLanguageDetectionResult probingPredictLanguages(CharSequence content)
This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.- Parameters:
content- content to be processed- Returns:
- result
-
probingPredictLanguages
public ProbingLanguageDetectionResult probingPredictLanguages(CharSequence content, LanguageDetectorConfig config)
This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.- Parameters:
content- content to processconfig- config to customize detection- Returns:
-
train
public static LanguageDetectorModel train(ObjectStream<LanguageSample> samples, TrainingParameters mlParams, LanguageDetectorFactory factory) throws IOException
- Throws:
IOException
-
-