Dear Sirs! Your OpenOffice is really great thing, once day it helped me to do the term paper undertime. I'm sure OpenOffice is helpful not only for me but for thousands users who have chosen open source software. Thank you very much!
And I'm willing to help your community to develop this wonderful project: I can start working on Development of a linguistic component for improvement of the work of OpenOffice’s thesaurus and spell-checker. This theme presents only in your ToDo list http://development.openoffice.org/todo.html not in Summer Projects list http://development.openoffice.org/summerprojects.html. So can I choose the current theme and can I take part in Summer of Code under your supervision? And do I have chances to work with you? Here is the project description: PROJECT TITLE Development of a linguistic component for improvement of the work of OpenOffice’s thesaurus and spell-checker. SYNOPSIS A Text processor is probably one of the most popular and prevalent program on every computer. If draw an analogy with the Internet then a Text processor we might compare with a search engine. And if the world’s best search engine is Google then the best open source set of office applications probably is OpenOffice. My task is to improve such important components in OpenOffice as thesaurus and spell-checker in order to users from different countries can easily select synonyms for necessary words and easily check the spelling. OPENOFFICE.ORG BENEFITS OpenOffice.org will get rather experienced and skilled worker who is very responsible and who is really interested in developing the open source software. It is necessary to note that I devote myself to a problem of automation of morphological and statistical analysis of natural language texts since April 2003. And I have achieved positive results: I have developed the program product which completely analyzes Russian words to morphemes, gives maximum morphological information about them and gather statistics of usage different morphemes and words itself. You can acquaint with this project at http://www.cryptology.ru/Alekseyenko.zip. So I have high experience in morphological analysis and statistics gathering and I hope that it would be helpful in present work. PROJECT DETAILS The first step is compilation of really qualitative base of prefixes and suffixes with corresponding morphological information. As my practice shows qualitative base of morphemes is half of success. I know only English and Russian but I have good friends’ linguists who can help me to make base for other languages. The second step is developing of the module that will directly choose words from the text and mark prefix and suffix in them. I think it’s better to look over all possible variants of parsing by compilation of binary trees and after that in compliance with weight coefficients of morphemes choose the optimum alternative. Besides all other alternatives of parsing can be shown in order to give a chance to user or beta-tester teach the system in case it incorrectly makes the parsing. The third step is a search of synonym to the parsing words. The fourth step is the addition of proper prefix and suffix to the words for creation new synonyms with correct tense, gender, number and so on. PROJECT SCHEDULE I can start working about the 20th of June: as soon as I pass my exams. About a week would be spent on design of linguistic component and familiarization with existent linguistic developments of OpenOffice. By the middle of July I'm going to work out the base of affixes with attendant morphological information. By the beginning of August I'm going to write linguistic component itself. The rest of time I'm planning to devote to work over search of synonyms and compilation of new synonyms. Besides if everything would be successful I will continue my work in OpenOffice.org even after Summer of Code. BIO I study at Computing Engineering Department of Southern Ural State University. My favorite programming languages are C/C++. Why exactly I match you? Firstly, I have enough high experience both at programming and in computational linguistics. Secondly I have thought out more ideas then I was working about this theme. Thirdly, I am very responsible in work: I fulfill it well and in a pointed time. Fourthly, I'm just willing to work side by side with senior OpenOffice developers. Fifthly, I want to make my contribution in development of open source software. -- Yours faithfully, Dmitriy Alekseyenko --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
