I'm happy to report we've just finished the newest SenseClusters related
paper, and it turns out to be the first paper that will be published in
2005.

Name Discrimination by Clustering Similar Contexts (Pedersen, Purandare,
and Kulkarni) - Appears in the Proceedings of the Sixth International
Conference on Intelligent Text Processing and Computational Linguistics,
February 13-19, 2005, Mexico City

http://www.d.umn.edu/~tpederse/Pubs/cicling2005.pdf

This represents an application and extension of SenseClusters to the
problem of name discrimination. All of the experiments discussed in this
paper can be carried out via the use of the SenseClusters package
(http://senseclusters.sourceforge.net) version 0.55.

The experimental data consists of pseudo names created from the English
GigaWord corpus using the Name Conflate program
(http://www.d.umn.edu/~kulka020/kanaghaName.html).

You may obtain this pseudo word data (which includes the "correct" name
as well) here: http://www.d.umn.edu/~tpederse/Data/cicling2005-data.tar

Enjoy!

--
Ted Pedersen
http://www.d.umn.edu/~tpederse


-------------------------------------------------------
This SF.Net email is sponsored by: InterSystems CACHE
FREE OODBMS DOWNLOAD - A multidimensional database that combines
robust object and relational technologies, making it a perfect match
for Java, C++,COM, XML, ODBC and JDBC. www.intersystems.com/match8
_______________________________________________
senseclusters-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/senseclusters-users

Reply via email to