The UKP Lab at Darmstadt University would like to announce its second release:

        DKPro for User-Generated Discourse 1.0 (DKPro-UGD).
User-generated discourse from Web 2.0 poses particular challenges to natural language processing (NLP) due to its noise and error proneness. A data cleansing step preceding the analysis steps in an NLP pipeline can reduce the problems. DKPro for User-Generated Discourse (DKPro-UGD) provides components for data-cleansing as well as dictionaries for artifacts commonly found on user-generated discourse on the web, such as emoticons, swearwords and shorthand.

DKPro-UGD is part of the comprehensive Darmstadt Knowledge Processing Software Repository project (DKPro). DKPro leverages the flexibility of the UIMA framework to facilitate the development and integration of new NLP algorithms and easy configuration of NLP experiments.

The DKPro-UGD includes updated versions of some of the components that were shipped with DKPro-Core 1.0. Most notable a new and more robust TreeTagger wrapper component is included.

The release is freely available for research purposes and can be obtained from the DKPro website at

        http://www.ukp.tu-darmstadt.de/software/dkpro/

If you have questions or comments, please write to [email protected] .

Best regards,

Richard Eckart de Castilho

--
-------------------------------------------------------------------
Richard Eckart de Castilho
Software Engineer
Ubiquitous Knowledge Processing Lab
FB 20 Computer Science Department
Technische Universität Darmstadt
Hochschulstr. 10, D-64289 Darmstadt, Germany
phone +49 (6151) 16 - 6218, fax -5455, room S2/02/E225
[email protected]
www.ukp.tu-darmstadt.de
-------------------------------------------------------------------



Reply via email to