Dear Sirs!

Your OpenOffice is really great thing, once day it helped me to do the
term paper undertime. I'm sure OpenOffice is helpful not only for me but for
thousands users who have chosen open source software. Thank you very
much!

And I'm willing to help your community to develop this wonderful
project: I can start working on Development of a linguistic component
for improvement of the work of OpenOffice’s thesaurus and
spell-checker.

This theme presents only in your ToDo list
http://development.openoffice.org/todo.html not in Summer Projects
list http://development.openoffice.org/summerprojects.html.

So can I choose the current theme and can I take part in Summer of
Code under your supervision? And do I have chances to work with you?

Here is the project description:

PROJECT TITLE
Development of a linguistic component for improvement of the work of
OpenOffice’s thesaurus and spell-checker. 

SYNOPSIS
A Text processor is probably one of the most popular and prevalent
program on every computer. If draw an analogy with the Internet then a
Text processor we might compare with a search engine. And if the
world’s best search engine is Google then the best open source set of
office applications probably is OpenOffice. My task is to improve such
important components in OpenOffice as thesaurus and spell-checker in
order to users from different countries can easily select synonyms for
necessary words and easily check the spelling.

OPENOFFICE.ORG BENEFITS
OpenOffice.org will get rather experienced and skilled worker who is
very responsible and who is really interested in developing the open
source software.
It is necessary to note that I devote myself to a problem of
automation of morphological and statistical analysis of natural
language texts since April 2003. And I have achieved positive results:
I have developed the program product which completely analyzes Russian
words to morphemes, gives maximum morphological information about them
and gather statistics of usage different morphemes and words itself.
You can acquaint with this project at
http://www.cryptology.ru/Alekseyenko.zip. So I have high experience in
morphological analysis and statistics gathering and I hope that it
would be helpful in present work.         

PROJECT DETAILS
The first step is compilation of really qualitative base of prefixes
and suffixes with corresponding morphological information. As my
practice shows qualitative base of morphemes is half of success. I
know only English and Russian but I have good friends’ linguists who
can help me to make base for other languages. The second step is
developing of the module that will directly choose words from the text
and mark prefix and suffix in them. I think it’s better to look over
all possible variants of parsing by compilation of binary trees and
after that in compliance with weight coefficients of morphemes choose
the optimum alternative. Besides all other alternatives of parsing can
be shown in order to give a chance to user or beta-tester teach the
system in case it incorrectly makes the parsing. The third step is a
search of synonym to the parsing words. The fourth step is the
addition of proper prefix and suffix to the words for creation new
synonyms with correct tense, gender, number and so on.              

PROJECT SCHEDULE
I can start working about the 20th of June: as soon as I pass my
exams. About a week would be spent on design of linguistic component
and familiarization with existent linguistic developments of
OpenOffice. By the middle of July I'm going to work out the base of
affixes with attendant morphological information. By the beginning of
August I'm going to write linguistic component itself. The rest of
time I'm planning to devote to work over search of synonyms and
compilation of new synonyms. Besides if everything would be successful
I will continue my work in OpenOffice.org even after Summer of Code.        
 
BIO
I study at Computing Engineering Department of Southern Ural State
University. My favorite programming languages are C/C++. 
Why exactly I match you? Firstly, I have enough high experience both
at programming and in computational linguistics. Secondly I have
thought out more ideas then I was working about this theme. Thirdly, I
am very responsible in work: I fulfill it well and in a pointed time.
Fourthly, I'm just willing to work side by side with senior OpenOffice
developers. Fifthly, I want to make my contribution in development of
open source software.

--
Yours faithfully,
Dmitriy Alekseyenko


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to