Class LanguageDetectorME
- All Implemented Interfaces:
- LanguageDetector
LanguageDetector.
 
 This will process the entire string when called with
 predictLanguage(CharSequence) or
 predictLanguages(CharSequence).
 
 If you want this to stop early, use probingPredictLanguages(CharSequence)
 or probingPredictLanguages(CharSequence, LanguageDetectorConfig).
 When run in probing mode, this starts at the beginning of the char sequence
 and runs language detection on chunks of text.  If the end of the
 string is reached or there are LanguageDetectorConfig.getMinConsecImprovements()
 consecutive predictions for the best language and the confidence
 increases over those last predictions and if the difference
 in confidence between the highest confidence language
 and the second highest confidence language is greater than
 LanguageDetectorConfig.getMinDiff(), the language detector will
 stop and report the results.
 
The authors wish to thank Ken Krugler and Yalder} for the inspiration for many of the design components of this detector.
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionString[]predictLanguage(CharSequence content) Predicts theLanguagefor the fullcontentlength.Language[]predictLanguages(CharSequence content) Predicts thelanguagesfor the fullcontentlength.probingPredictLanguages(CharSequence content) This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.probingPredictLanguages(CharSequence content, LanguageDetectorConfig config) This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.static LanguageDetectorModeltrain(ObjectStream<LanguageSample> samples, TrainingParameters mlParams, LanguageDetectorFactory factory) Starts a training of aLanguageDetectorModelwith the given parameters.
- 
Constructor Details- 
LanguageDetectorMEInitializes an instance with a specificLanguageDetectorModel. Default feature generation is used.- Parameters:
- model- the- LanguageDetectorModelto be used.
 
 
- 
- 
Method Details- 
predictLanguagesDescription copied from interface:LanguageDetectorPredicts thelanguagesfor the fullcontentlength.- Specified by:
- predictLanguagesin interface- LanguageDetector
- Parameters:
- content- The textual content to detect potential- languagesfrom.
- Returns:
- the predicted languages
 
- 
predictLanguageDescription copied from interface:LanguageDetectorPredicts theLanguagefor the fullcontentlength.- Specified by:
- predictLanguagein interface- LanguageDetector
- Parameters:
- content- The textual content to detect potential- languagesfrom.
- Returns:
- the language with the highest confidence
 
- 
getSupportedLanguages- Specified by:
- getSupportedLanguagesin interface- LanguageDetector
- Returns:
- Retrieves an array of language (codes) that are supported by a LanguageDetector.
 
- 
probingPredictLanguagesThis will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.- Parameters:
- content- content to be processed
- Returns:
- A computed ProbingLanguageDetectionResult.
 
- 
probingPredictLanguagespublic ProbingLanguageDetectionResult probingPredictLanguages(CharSequence content, LanguageDetectorConfig config) This will stop processing early if the stopping criteria specified inLanguageDetectorConfig.DEFAULT_LANGUAGE_DETECTOR_CONFIGare met.- Parameters:
- content- The textual content to process.
- config- The- LanguageDetectorConfigto customize detection.
- Returns:
- A computed ProbingLanguageDetectionResult.
 
- 
trainpublic static LanguageDetectorModel train(ObjectStream<LanguageSample> samples, TrainingParameters mlParams, LanguageDetectorFactory factory) throws IOException Starts a training of aLanguageDetectorModelwith the given parameters.- Parameters:
- samples- The- ObjectStreamof- LanguageSampleused as input for training.
- mlParams- The- TrainingParametersfor the context of the training.
- factory- The- LanguageDetectorFactoryfor creating related objects defined via- mlParams.
- Returns:
- A valid, trained LanguageDetectorModelinstance.
- Throws:
- IOException- Thrown if IO errors occurred.
 
 
-