1. change the code to do your own unknown word processing. The function 
        TranslationOptionCollection::ProcessOneUnknownWord() (line 185 in
TranslationOptionCollection.cpp)
   is the place where unknown words are processed


Hieu Hoang
www.hoang.co.uk/hieu


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.


-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
On Behalf Of Vineet Kashyap
Sent: 12 July 2008 11:16
To: [email protected]
Subject: [Moses-support] Unknown words

Hi all

1. is there a way to output unknown words to a separate file instead of
dropping them as i think we can add those words to the dictionary  which
will improve the accuracy ?

2. also, when adding dictionary to the parallel corpus as suggested by
Phillip in the previous post you have one word in the source language and
the other in the target language is that correct?

3. Does BLEU uses a reference file with accurate human translations to
estimate a score ? And if not would it be better to evaluate the system with
such a reference file with accurate translations ? 

4. what value of BLEU means good translations ? in percentage...
   and for comparison purposes how would a human judge a MT system's
   performance ?

5. can we train higher order language models with SRILM with a small corpus
or have to use IRSTLM ?


Thanks a lot in advance for taking the time in answering these questions.

Regards, Vineet

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to