Fork me on GitHub

Language Detector Model for Apache OpenNLP released

TThe Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.

The Apache OpenNLP team is pleased to announce the release of Language Detector Model 1.8.3 for Apache OpenNLP 1.8.3. The Language Detector Model can detect 103 languages and outputs ISO 639-3 codes.

Apache OpenNLP model and reports are available for download from our model download page: http://opennlp.apache.org/models.html

This is the first release of the Language Detector Model. It is compatible with Apache OpenNLP 1.8.3 or better.

It is important to note that this model is trained for and works well with longer texts that have at least 2 sentences or more from the same language.

More information about this release can be found in the README.txt at: https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/README.txt

Details about this model effectiveness can be found in the following report: https://www.apache.org/dist/opennlp/models/langdetect/1.8.3/langdetect-183.bin.report.txt

--The Apache OpenNLP Team

02 November 2017