Re: [Apertium-stuff] [Fwd: GSoC - assimilation evaluation]

Mikel L. Forcada Wed, 04 Apr 2012 02:50:54 -0700

Dear Kristina:

You write:


1) I'm not sure how the files with the source sentences are obtained. Are they 
hand-crafted / auto-generated? Or are they taken from the same texts used in 
creating the dictionaries?

Not necessarily. They should be sentences taken from texts of the stylethat is supposed to be processed by the particular machine translationsystem.

2) Does "reference translation" mean sentence translated by human, and is it in 
target language?

Yes, and yes. The easiest way to do this would be to have a parallel setof sentences in the source and target language, so that there is nospecific preparation needed.

3) Will all the possible combinations of information be compared? i.e.
just the reference translation / source sentence + reference translation / 
machine translation + reference translation / all three ?


I think tests should contain questions of all  types:

  1. No hints (just fill in the holes using your imagination=
  2. Source sentence as a hint
  3. Target-language output of MT as a hint
  4. Both source and MT output.

And, optionally,

  1. [if available] an additional reference translation.

4) I was thinking of using Python for this, is that alright? I will be happy to 
use (and learn if needed) other languages as well.

Python could be a choice, yes. As you are planning to have a userinterface, you should consider the possibility of using some webscripting language for the web interface so that the test can be embedded.

Try to have a proposal draft as early as possible so that you can getfeedback from mentors.

I am also worried that I have not participated in open source projects before, 
so any feedback/advice is greatly appreciated.

If there is anything specific that you would like to ask, just email meas I am one of the mentors in that task. But the best is to hang aroundin the #apertium IRC channel at freenode.net and ask the developers there.

Thank you very much for your time.


Good luck

Mikel L. Forcada

Kris.


Mikel L. Forcada

------------------------------------------------------------------------------
Better than sec? Nothing is better than sec when it comes to
monitoring Big Data applications. Try Boundary one-second 
resolution app monitoring today. Free.
http://p.sf.net/sfu/Boundary-dev2dev

_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] [Fwd: GSoC - assimilation evaluation]

Reply via email to