Re: [Apertium-stuff] [Fwd: GSoC - assimilation evaluation]

Kristina Fedorenko Thu, 05 Apr 2012 09:32:32 -0700

Thank you so much for your answers!
>>> 
>>> 1) I'm not sure how the files with the source sentences are obtained. Are 
>>> they hand-crafted / auto-generated? Or are they taken from the same texts 
>>> used in creating the dictionaries? 
>> Not necessarily. They should be sentences taken from texts of the style that 
>> is supposed to be processed by the particular machine translation system.


Just to clarify: to make it work for any given pair, should there be some sort 
of database with parallel texts (will have to be manually created/translated)? 
Or can those texts be provided as inputs?

Also, I have submitted my proposal draft (Kristina Fedorenko: 
http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/krisfed/1). 
Any feedback would be immensely appreciated!

Thanks a lot.
Kris.


On Apr 4, 2012, at 10:31 AM, Mikel L. Forcada wrote:

> Dear Kristina:
> 
> You write:
>> 
>> 1) I'm not sure how the files with the source sentences are obtained. Are 
>> they hand-crafted / auto-generated? Or are they taken from the same texts 
>> used in creating the dictionaries? 
> Not necessarily. They should be sentences taken from texts of the style that 
> is supposed to be processed by the particular machine translation system.
>> 2) Does "reference translation" mean sentence translated by human, and is it 
>> in target language?
> Yes, and yes. The easiest way to do this would be to have a parallel set of 
> sentences in the source and target language, so that there is no specific 
> preparation needed.
>> 3) Will all the possible combinations of information be compared? i.e.
>> just the reference translation / source sentence + reference translation / 
>> machine translation + reference translation / all three ?
> 
> I think tests should contain questions of all  types:
> No hints (just fill in the holes using your imagination=
> Source sentence as a hint
> Target-language output of MT as a hint
> Both source and MT output.
> And, optionally,
> [if available] an additional reference translation.
>> 4) I was thinking of using Python for this, is that alright? I will be happy 
>> to use (and learn if needed) other languages as well.
> 
> Python could be a choice, yes. As you are planning to have a user interface, 
> you should consider the possibility of using some web scripting language for 
> the web interface so that the test can be embedded.
> Try to have a proposal draft as early as possible so that you can get 
> feedback from mentors.
>> I am also worried that I have not participated in open source projects 
>> before, so any feedback/advice is greatly appreciated.
> 
> If there is anything specific that you would like to ask, just email me as I 
> am one of the mentors in that task. But the best is to hang around in the 
> #apertium IRC channel at freenode.net and ask the developers there.
>> Thank you very much for your time.
> 
> Good luck
> Mikel L. Forcada
>> Kris.
> Mikel L. Forcada
> ------------------------------------------------------------------------------
> Better than sec? Nothing is better than sec when it comes to
> monitoring Big Data applications. Try Boundary one-second 
> resolution app monitoring today. Free.
> http://p.sf.net/sfu/Boundary-dev2dev_______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff

------------------------------------------------------------------------------
Better than sec? Nothing is better than sec when it comes to
monitoring Big Data applications. Try Boundary one-second 
resolution app monitoring today. Free.
http://p.sf.net/sfu/Boundary-dev2dev

_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] [Fwd: GSoC - assimilation evaluation]

Reply via email to