I'm happy to report we've just finished the newest SenseClusters related paper, and it turns out to be the first paper that will be published in 2005.
Name Discrimination by Clustering Similar Contexts (Pedersen, Purandare, and Kulkarni) - Appears in the Proceedings of the Sixth International Conference on Intelligent Text Processing and Computational Linguistics, February 13-19, 2005, Mexico City http://www.d.umn.edu/~tpederse/Pubs/cicling2005.pdf This represents an application and extension of SenseClusters to the problem of name discrimination. All of the experiments discussed in this paper can be carried out via the use of the SenseClusters package (http://senseclusters.sourceforge.net) version 0.55. The experimental data consists of pseudo names created from the English GigaWord corpus using the Name Conflate program (http://www.d.umn.edu/~kulka020/kanaghaName.html). You may obtain this pseudo word data (which includes the "correct" name as well) here: http://www.d.umn.edu/~tpederse/Data/cicling2005-data.tar Enjoy! -- Ted Pedersen http://www.d.umn.edu/~tpederse ------------------------------------------------------- This SF.Net email is sponsored by: InterSystems CACHE FREE OODBMS DOWNLOAD - A multidimensional database that combines robust object and relational technologies, making it a perfect match for Java, C++,COM, XML, ODBC and JDBC. www.intersystems.com/match8 _______________________________________________ senseclusters-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/senseclusters-users
