The Apache UIMA team is pleased to announce the release of the Apache UIMA-DUCC version 2.2.1.
The Unstructured Information Management Architecture (UIMA) is a component framework supporting development, discovery, composition, and deployment of multi-modal analytics tasked with the analysis of unstructured information. Apache UIMA is an Apache licensed open source implementation of the UIMA specification which is being developed by a technical committee within OASIS, a standards organization. The implementation comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL. DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster management system providing tooling,management, and scheduling facilities to automate the scale-out of applications written to the UIMA framework. Core UIMA provides a generalized framework for applications that process unstructured information such as human language, but does not provide a scale-out mechanism. UIMA-AS provides a scale-out mechanism to distribute UIMA pipelines over a cluster of computing resources, but does not provide job or cluster management of the resources. DUCC defines a formal job model that closely maps to a standard UIMA pipeline. Around this job model DUCC provides cluster management services to automate the scale-out of UIMA pipelines over computing clusters. This release contains a number of improvements and bug fixes. Notable updates in this release include: -The userid of a privileged DUCC installation does not have to be "ducc" -ducc-mon login can be used on systems where users do not have password login -The DUCC head-node daemons may be moved to another host without breaking working applications -The deployment descriptor for a UIMA-AS service can be loaded from the classpath -Interactive applications run correctly with viaducc (fixed lost inputs) -Files created by DUCC jobs inherit the permissions of the launching shell's umask. -DUCC performance breakdown for scaled synchronous pipelines is now correct -Fixed javadoc method headers to enable ducc build with java 8 -Fixed JP communication threads wait logic when JD returns no work -GC stats not available sometimes from remote JP For a complete list of bugs and improvements included in this release please see https://uima.apache.org/d/uima-ducc-2.2.1/issuesFixed/jira-report.html -- Jerry Cwiklik, for the Apache UIMA development team