Dear list members,

We are happy to announce the release of version 1.8 of our multilingual, cross-domain, and easy-to-extend temporal tagger HeidelTime. [1]

In the context of the new version, Croatian resources were added - developed and kindly provided by Luka Shukan et al. (University of Zagreb). [2] Furthermore, the Italian resources were significantly improved in the context of the EVALITA-2014 EVENTI task. [3] Finally, we have made some processing speed and stability improvements affecting the UIMA kit and standalone versions.

In the meanwhile, 11 languages are supported (ordered alphabetically):
Arabic, Chinese, Croatian, Dutch, English, French, German, Italian, Russian, Spanish, and Vietnamese.

In addition, HeidelTime distinguishes between news-style documents and narrative-style documents (e.g., Wikipedia articles) in all languages. In addition, English colloquial (e.g., Tweets and SMS) and scientific articles (e.g., clinical trails) are supported.

HeidelTime is available at Google Code [1] as a UIMA component and as a Java standalone version. If you want to briefly test it, there is also an online demo. [4]

In addition to HeidelTime itself, the UIMA HeidelTime kit contains several collection readers and CAS consumers (mainly for processing temporally annotated corpora) as well as analysis engines wrapping several part-of-speech taggers to perform linguistic preprocessing in all supported languages.

Any kind of feedback is highly appreciated!

Best regards,
The HeidelTime Team
http://code.google.com/p/heideltime/
https://twitter.com/HeidelTime



[1] <http://code.google.com/p/heideltime/>http://code.google.com/p/heideltime/wiki/Downloads

[2] Luka Skukan, Goran Glavas(, and Jan S(najder (2014): HeidelTime.Hr: Extracting and Normalizing Temporal Expressions in Croatian. In Proceedings of the 9th Language Technologies Conference, pages 99-103. ( http://nl.ijs.si/isjt14/proceedings/isjt2014_17.pdf)

[3] Giulio Manfredi, Jannik Strötgen, Julian Zell, and Michael Gertz (2014): HeidelTime at EVENTI: Tuning Italian Resources and Addressing TimeML's Empty Tags. In Proceedings of the 4th International Workshop EVALITA-2014, pages 39-43. ( http://dbs.ifi.uni-heidelberg.de/fileadmin/Team/jannik/publications/2014_EVALITA_ManfrediEtAl.pdf)

[4] http://heideltime.ifi.uni-heidelberg.de/heideltime/

--
Jannik Strötgen
Institute of Computer Science
Im Neuenheimer Feld 348
69120 Heidelberg
Germany
Phone: +49 (0) 6221 / 54-5709
eMail: [email protected]

Reply via email to