The Apache OpenNLP team is pleased to announce the release of version 3.0.0-M1 of Apache OpenNLP. The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, and parsing.
The OpenNLP 3.0.0-M1 binary and source distributions are available for download from our download page: https://opennlp.apache.org/download.html The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: https://opennlp.apache.org/maven-dependency.html Changes in this version: The 3.x release line of Apache OpenNLP introduces no known breaking changes while significantly modularizing the project to improve library usage and future extensibility. The core API remains stable and fully compatible with 2.x, so existing projects can continue using the opennlp-tools artifact without modifications. Key Highlights and Recommendations: - Modularization: The project is now organized into multiple modules: opennlp-api, opennlp-core, opennlp-cli, opennlp-extensions, ML modules (e.g., opennlp-ml-maxent, opennlp-ml-perceptron), and more. - Users can include only the modules needed, reducing dependency footprint. - Only opennlp-runtime is mandatory for basic functionality. - CLI Stability: Existing command-line usage remains unchanged. For a complete list of fixed bugs and improvements please see the RELEASE_NOTES file included in the distribution. The Apache OpenNLP Team
