Dear Kristina:

You write:


1) I'm not sure how the files with the source sentences are obtained. Are they 
hand-crafted / auto-generated? Or are they taken from the same texts used in 
creating the dictionaries?

Not necessarily. They should be sentences taken from texts of the style that is supposed to be processed by the particular machine translation system.

2) Does "reference translation" mean sentence translated by human, and is it in 
target language?

Yes, and yes. The easiest way to do this would be to have a parallel set of sentences in the source and target language, so that there is no specific preparation needed.

3) Will all the possible combinations of information be compared? i.e.
just the reference translation / source sentence + reference translation / 
machine translation + reference translation / all three ?

I think tests should contain questions of all  types:

  1. No hints (just fill in the holes using your imagination=
  2. Source sentence as a hint
  3. Target-language output of MT as a hint
  4. Both source and MT output.

And, optionally,

  1. [if available] an additional reference translation.

4) I was thinking of using Python for this, is that alright? I will be happy to 
use (and learn if needed) other languages as well.

Python could be a choice, yes. As you are planning to have a user interface, you should consider the possibility of using some web scripting language for the web interface so that the test can be embedded.

Try to have a proposal draft as early as possible so that you can get feedback from mentors.

I am also worried that I have not participated in open source projects before, 
so any feedback/advice is greatly appreciated.

If there is anything specific that you would like to ask, just email me as I am one of the mentors in that task. But the best is to hang around in the #apertium IRC channel at freenode.net and ask the developers there.

Thank you very much for your time.

Good luck

Mikel L. Forcada

Kris.

Mikel L. Forcada
------------------------------------------------------------------------------
Better than sec? Nothing is better than sec when it comes to
monitoring Big Data applications. Try Boundary one-second 
resolution app monitoring today. Free.
http://p.sf.net/sfu/Boundary-dev2dev
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to