RELEASE_NOTES.html (61 lines of code) (raw):

<html> <!-- *************************************************************** * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, * software distributed under the License is distributed on an * "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY * KIND, either express or implied. See the License for the * specific language governing permissions and limitations * under the License. *************************************************************** --> <head> <title>Apache Distributed UIMA Cluster Computing (DUCC) 3.0.0 Release Notes</title> </head> <body> <h1>Apache UIMA-DUCC (Unstructured Information Management Architecture - Distributed UIMA Cluster Computing ) v.3.0.0 Release Notes</h1> <h2>Contents</h2> <p> <a href="#what.is.uima-ducc">1. What is UIMA-DUCC?</a><br/> <a href="#major.changes">2. Major Changes in this Release</a><br/> <a href="#migration">3. Migration from a Prior Release</a><br/> <a href="#migration">4. Limitations</a><br/></p> <h2><a name="what.is.uima-ducc">1. What is UIMA-DUCC?</a></h2> <p> DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster management system providing tooling, management, and scheduling facilities to automate the scale-out of applications written to the UIMA framework. Core UIMA provides a generalized framework for applications that process unstructured information such as human language, but does not provide a scale-out mechanism. UIMA-AS provides a scale-out mechanism to distribute UIMA pipelines over a cluster of computing resources, but does not provide job or cluster management of the resources. DUCC defines a formal job model that closely maps to a standard UIMA pipeline. Around this job model DUCC provides cluster management services to automate the scale-out of UIMA pipelines over computing clusters. </p> <h2><a name="major.changes">2. Major Changes in this Release</a></h2> <p> Apache UIMA DUCC 3.0.0 is a maintenance release containing new features and bug fixes. What's new:<br> <ul> <li>Support for UIMA v2 and v3</li> <li>Added support for reliable DUCC - automatic head node failover</li> <li>Created new pull service that can be run with or without DUCC</li> <li>Enable DUCC to run without shared file system</li> <li>Add new DUCC stop options, including quiesce</li> <li>Add "CASes processed" to annotator performance metrics</li> <li>Upgraded to Cassandra Server v.3.11.3, Cassandra Driver v.3.6.0, Jetty v.9.4.14.v20181114, guava v.18.0, joda v.2.4, commons.lang v.3.1, commons.math v.3.2, netty v.4.0.44, snappy v.1.1.1.7</li> </ul> <br><br> For a complete list of issues fixed and up-to-date information on UIMA-DUCC issues, see our issue tracker: <a href="https://issues.apache.org/jira/issues/?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%223.0.0-Ducc%22%20">https://issues.apache.org/jira/issues/?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%223.0.0-Ducc%22%20</a> </p> <h2><a name="migration">3. Migration from a Prior Release</a></h2> When upgrading from an existing installation the ducc_update script may be used to replace the system files while leaving the site-specific configuration files in place. For more information see <strong>ducc_update</strong> in the Administrative Commands section of the DuccBook. <p> </p> <h2><a name="limitations">4. Limitations</a></h2> On some systems cgroups swap accounting is not enabled and duccmon will show N/A for swap. To confirm, please check memory.stat file in <cgroups base dir>/ducc/ folder. If swap accounting is enabled there should be "swap" property defined. If it's missing, you need to add a kernel parameter swapaccount=1. Details of how to do this can be found <a href="http://unix.stackexchange.com/questions/147158/how-to-enable-swap-accounting-for-memory-cgroup-in-archlinux">here</a>. <p> Due to a bug in uima sdk, the uima AnalysisEngineProcessException cannot be serialized as a Java object. If your analysis engine throws an exception in process(), the ducc framework will stringify it and wrap it in java RuntimeException. If you have a custom error handler plugged in into a job driver you will not be able to test for AnalysisEngineProcessException in a stack trace with a code like this: if ( error instanceof AnalysisEngineProcessException ) ... </p> <p> To use OS-based login with the WebServer while running DUCC with IBM java, the minimum JDK version is Java 8 SR4 FP5 (8.0.4.5). </p> </body> </html>