2nd UIMA@GSCL Workshop - Final Call for Participation

Richard Eckart de Castilho Wed, 16 Sep 2009 14:42:40 -0700

=
=
=
=
=
=
=
=
========================================================================


Final Call for Participation

Unstructured Information Management Architecture (UIMA)
2nd u...@gscl Workshop

October 1st, 2009
Potsdam, Germany

http://www.ling.uni-potsdam.de/acl-lab/gscl09/workshops.en.html

=
=
=
=
=
=
=
=
========================================================================

-------------------
Program
-------------------

09:00 - 10:00   -       UIMA Tutorial, Graham Wilcock

10:00 - 10:30   -       Coffee Break

10:30 - 10:45   -       Opening

10:45 - 11:15 - ClearTK: A Framework for Statistical Natural LanguageProcessing (Philip V. Ogren, Philipp G. Wetzler, and Steven J. Bethard)11:15 - 11:45 - Multimedia Feature Extraction in the SAPIR Project(Aaron Kaplan, Jonathan Mamou, Francesco Gallo, and Benjamin Sznajder)11:45 - 12:15 - TextMarker: A Tool for Rule-Based InformationExtraction (Peter Kluegl, Martin Atzmueller, and Frank Puppe)


12:15 - 13:00   -       Lunch Break

13:00 - 13:30 - LuCas - A Lucene CAS Indexer (Erik Faessler, RicoLandefeld, Katrin Tomanek, and Udo Hahn)13:30 - 14:00 - Abstracting the types away from a UIMA type system(Karin Verspoor, William Baumgartner Jr., Christophe Roeder, andLawrence Hunter)


14:00 - 14:30   -       Poster Session

14:30 - 15:00   -       Round Table/Discussion


-----------------------------
Workshop Description
-----------------------------

For many decades, NLP has suffered from low software engineeringstandards causing a limited degree of re-usability of code andinteroperability of different modules within larger NLP systems. Whilethis did not really hamper success in limited task areas (such asimplementing a parser), it caused serious problems for the emergingfield of language technology where the focus is on building complexintegrated software systems, e.g., for information extraction ormachine translation. This lack of integration has led to duplicatedsoftware development, work-arounds for programs written in different(versions of) programming languages, and ad-hoc tweaking of interfacesbetween modules developed at different sites.

In recent years, the Unstructured Information Management Architecture(UIMA) framework has been proposed as a middleware platform whichoffers integration by design through common type systems andstandardized communication methods for components analysing streams ofunstructured information, such as natural language. The UIMA frameworkoffers a solid processing infrastructure that allows developers toconcentrate on the implementation of the actual analytics components.An increasing number of members of the NLP community thus have adoptedUIMA as a platform facilitating the creation of reusable NLPcomponents that can be assembled to address different NLP tasksdepending on their order, combination and configuration.

This workshop aims at bringing together members of the NLP communitythat are users, developers or providers of either UIMA components orUIMA-related tools in order to explore and discuss the opportunitiesand challenges in using UIMA as a platform for modern, well-engineeredNLP. In the context of an emerging NLP-oriented UIMA community, thechallenge to create not only reusable, but also interoperablecomponents raises particular interest. From a methodologicalperspective, interoperability relies largely on UIMA type systems.Technically, it includes issues related to the packaging anddistribution of UIMA components. Also, tools are important, forexample to assemble complex processing work flows, to manage thebodies of data that are to be analysed and to visualize, explore, andfurther deploy the analysis results. Finally, interoperability is alsoaffected by legal issues, such as potentially incompatible licensesofcomponents and tools.

The availability of ready-to-use components plays a major role inchoosing UIMA over other alternatives. To accentuate this, theworkshop puts a focus on UIMA-based components and tools that arefreely available for research.


--------------
Topics
--------------

Participants are invited to present applications realized using UIMA,general experiences using UIMA as a platform for natural languageprocessing, as well as technical papers on particular aspects of theUIMA framework. Alternatives to and comparisons of other frameworks -e.g. GATE, LingPipe, etc. - with UIMA are of interest, too. Morespecifically, workshop topics include, but are not limited to:

• UIMA components with a special focus on genericity and type-systemindependence

• repositories of ready-to-use UIMA-based components
• (generic) type systems for UIMA

• distribution of UIMA components: documentation, licensing andpackaging

• sophisticated tools to build and manage complex processing pipelines

• experience reports combining UIMA-based components from differentsources, as well as solutions to interoperability issues• processing of very large data collections: scale-out,parallelization, and performance optimization• analysis of results: exploration, evaluation, visualization, andstatistical analysis• developing for UIMA: simplified APIs, debugging, unit testing, andlimitations of UIMA



---------------------------------
Organizers and Contact
---------------------------------

• JULIE Lab, Friedrich-Schiller-Universität Jena
  • Udo Hahn
  • Katrin Tomanek
• UKP Lab, Technische Universität Darmstadt
  • Iryna Gurevych
  • Richard Eckart de Castilho

Please address any inquiries regarding the workshop to:
[email protected]

---------------------------------
Program Committee
---------------------------------


• Anni R. Coden, IBM T.J. Watson Research Center, USA
• Branimir K. Boguraev, IBM T.J. Watson Research Center, USA
• Graham Wilcock, University of Helsinki, Finland
• Iryna Gurevych, Technische Universität Darmstadt, Germany
• Katrin Tomanek, Friedrich-Schiller-Universität Jena, Germany
• Leo Ferres, University of Concepcion, Chile
• Michael Tanenblatt, IBM T.J. Watson Research Center, USA
• Nicolas Hernandez, Université de Nantes, France
• Philipp Cimiano, Delft University of Technology, Netherlands
• Richard Eckart de Castilho, Technische Universität Darmstadt, Germany
• Sophia Ananiadou, University of Manchester, Great Britain
• Stefan Geißler, TEMIS GmbH, Germany
• Udo Hahn, Friedrich-Schiller-Universität Jena, Germany

2nd UIMA@GSCL Workshop - Final Call for Participation

Reply via email to