Forwarding to the lingucomponent list. Subscribe here: http://lingucomponent.openoffice.org/index.html regards, Richard.
On Friday 10 June 2005 02:50 pm, Dmitriy Alekseyenko wrote: > Dear Sirs! > > Your OpenOffice is really great thing, once day it helped me to do the > term paper undertime. I'm sure OpenOffice is helpful not only for me but for > thousands users who have chosen open source software. Thank you very > much! > > And I'm willing to help your community to develop this wonderful > project: I can start working on Development of a linguistic component > for improvement of the work of OpenOffice’s thesaurus and > spell-checker. > > This theme presents only in your ToDo list > http://development.openoffice.org/todo.html not in Summer Projects > list http://development.openoffice.org/summerprojects.html. > > So can I choose the current theme and can I take part in Summer of > Code under your supervision? And do I have chances to work with you? > > Here is the project description: > > PROJECT TITLE > Development of a linguistic component for improvement of the work of > OpenOffice’s thesaurus and spell-checker. > > SYNOPSIS > A Text processor is probably one of the most popular and prevalent > program on every computer. If draw an analogy with the Internet then a > Text processor we might compare with a search engine. And if the > world’s best search engine is Google then the best open source set of > office applications probably is OpenOffice. My task is to improve such > important components in OpenOffice as thesaurus and spell-checker in > order to users from different countries can easily select synonyms for > necessary words and easily check the spelling. > > OPENOFFICE.ORG BENEFITS > OpenOffice.org will get rather experienced and skilled worker who is > very responsible and who is really interested in developing the open > source software. > It is necessary to note that I devote myself to a problem of > automation of morphological and statistical analysis of natural > language texts since April 2003. And I have achieved positive results: > I have developed the program product which completely analyzes Russian > words to morphemes, gives maximum morphological information about them > and gather statistics of usage different morphemes and words itself. > You can acquaint with this project at > http://www.cryptology.ru/Alekseyenko.zip. So I have high experience in > morphological analysis and statistics gathering and I hope that it > would be helpful in present work. > > PROJECT DETAILS > The first step is compilation of really qualitative base of prefixes > and suffixes with corresponding morphological information. As my > practice shows qualitative base of morphemes is half of success. I > know only English and Russian but I have good friends’ linguists who > can help me to make base for other languages. The second step is > developing of the module that will directly choose words from the text > and mark prefix and suffix in them. I think it’s better to look over > all possible variants of parsing by compilation of binary trees and > after that in compliance with weight coefficients of morphemes choose > the optimum alternative. Besides all other alternatives of parsing can > be shown in order to give a chance to user or beta-tester teach the > system in case it incorrectly makes the parsing. The third step is a > search of synonym to the parsing words. The fourth step is the > addition of proper prefix and suffix to the words for creation new > synonyms with correct tense, gender, number and so on. > > PROJECT SCHEDULE > I can start working about the 20th of June: as soon as I pass my > exams. About a week would be spent on design of linguistic component > and familiarization with existent linguistic developments of > OpenOffice. By the middle of July I'm going to work out the base of > affixes with attendant morphological information. By the beginning of > August I'm going to write linguistic component itself. The rest of > time I'm planning to devote to work over search of synonyms and > compilation of new synonyms. Besides if everything would be successful > I will continue my work in OpenOffice.org even after Summer of Code. > > BIO > I study at Computing Engineering Department of Southern Ural State > University. My favorite programming languages are C/C++. > Why exactly I match you? Firstly, I have enough high experience both > at programming and in computational linguistics. Secondly I have > thought out more ideas then I was working about this theme. Thirdly, I > am very responsible in work: I fulfill it well and in a pointed time. > Fourthly, I'm just willing to work side by side with senior OpenOffice > developers. Fifthly, I want to make my contribution in development of > open source software. > > -- > Yours faithfully, > Dmitriy Alekseyenko > > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
