Fork me on GitHub

Apache OpenNLP 1.9.1 released

The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.1.

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.

It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.

Apache OpenNLP 1.9.1 binary and source distributions are available for download from our download page: download page

The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: Maven Dependency

What’s new in Apache OpenNLP 1.9.1

  • Add TrigramNameFeatureGeneratorFactory

  • Documentation updates.

  • Unit test improvements.

  • TokenFeatureGeneratorFactory now allows to set lowercase flag.

  • Use ja for Japanese language code rather than jp.

  • Use hash to avoid linear search in DefaultEndOfSentenceScanner.

  • Opennlp allows setting the heap size.

  • Builds with Java 11.

  • Use daemon threads in executor services.

  • Allow for iterating through word vector table tokens.

A detailed list of the issues related to this release can be found in the release notes.

For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.

--The Apache OpenNLP Team

31 December 2018