ConceptMapper/RELEASE_NOTES.html (97 lines of code) (raw):

<!-- *************************************************************** * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, * software distributed under the License is distributed on an * "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY * KIND, either express or implied. See the License for the * specific language governing permissions and limitations * under the License. *************************************************************** --> <html> <head> <title>Apache UIMA Addon: Concept Mapper: Release Notes</title> </head> <body> <h1>Apache UIMA (Unstructured Information Management Architecture) Addon Concept Mapper v2.10.2 Release Notes</h1> <h2>Contents</h2> <p> <a href="#what.is.uima">What is UIMA?</a><br/> <a href="#what.is.concept.mapper">What is the Concept Mapper</a><br> <a href="#major.changes">Major Changes in this Release</a><br/> <a href="#get.involved">How to Get Involved</a><br/> <a href="#report.issues">How to Report Issues</a><br/> <a href="#list.issues">List of JIRA Issues Fixed in this Release</a><br/> </p> <h2><a id="what.is.uima">1. What is UIMA?</a></h2> <p> Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a framework and SDK for developing such applications. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example "language identification" -&gt; "language specific segmentation" -&gt; "sentence boundary detection" -&gt; "entity detection (person/place names etc.)". Each component must implement interfaces defined by the framework and must provide self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes. </p> <p> Apache UIMA is an Apache-licensed open source implementation of the UIMA specification (that specification is, in turn, being developed concurrently by a technical committee within <a href="http://www.oasis-open.org">OASIS</a>, a standards organization). We invite and encourage you to participate in both the implementation and specification efforts. </p> <p> UIMA is a component framework for analysing unstructured content such as text, audio and video. It comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL. </p> <h2><a id="what.is.uima">2. What is the Concept Mapper?</a></h2> <p>ConceptMapper is a highly configurable, high performance dictionary lookup tool, implemented as a UIMA component. <p> Using one of several matching algorithms, it maps entries in a dictionary onto input documents, producing UIMA annotations. <p> The reference manual is <a target="_blank" href="https://uima.apache.org/d/uima-addons-current/ConceptMapper/ConceptMapperAnnotatorUserGuide.html">here</a>. <h2><a id="major.changes">Major Changes in this Release</a></h2> <p>This release updates the Concept Mapper to use the current UIMA annotator APIs. It also adds the improvements: <ul> <li>configures buffered I/O for compiled dictionaries</li> <li>adds the ability to specify the tokenizer descriptor by using a reference to a descriptor in the classpath</li> <li>supports creating the compiled dictionary as a side effect of loading a non-compiled dictionary</li> </ul> Finally, an API Change report is included, documenting the changes in the APIs between this version and version 2.3.1. Most users will be unaffected by this, because they do not use the APIs, but rather configure and run the ConceptMapper as a UIMA Annotator. </p> <p>Changes were also made to build specification to allow this project to be separately released. The binary build was changed to use the normal Maven conventions for naming the JAR artifact.</p> <h2><a id="get.involved">How to Get Involved</a></h2> <p> The Apache UIMA project really needs and appreciates any contributions, including documentation help, source code and feedback. If you are interested in contributing, please visit <a href="http://uima.apache.org/get-involved.html"> http://uima.apache.org/get-involved.html</a>. </p> <h2><a id="report.issues">How to Report Issues</a></h2> <p> The Apache UIMA project uses JIRA for issue tracking. Please report any issues you find at <a href="http://issues.apache.org/jira/browse/uima">http://issues.apache.org/jira/browse/uima</a> </p> </body> </html>