The Apache OpenNLP team is pleased to announce the release of Apache OpenNLP 1.9.0.
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.
It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution.
Apache OpenNLP 1.9.0 binary and source distributions are available for download from our download page: download page
The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: Maven Dependency
This release introduces new features, improvements and bug fixes. Java 1.8 and Maven 3.3.9 are required.
Additionally, the release contains the following changes:
Brat Document Parser should support name type filters
Brat format support fails on multi fragment annotations
Remove MD5 hashes from Release process
Use String[] instead of StringList in LanguageModel API
BRAT Annotator service Fails to start
Token model creation fails without at least one <SPLIT> tag
Update Penn Treebank URL
Explain the new format of feature generator XML config
Unify code to sum up input context features
FeatureGeneratorUtil can recognize Japanese Hiragana and Katakana letters
A detailed list of the issues related to this release can be found in the release notes.
For a complete list of fixed bugs and improvements please see the README.html file included in the distribution.
--The Apache OpenNLP Team
02 July 2018