Re: [Moses-support] Getting counts in Moses instead of probabilities

Hieu Hoang Thu, 09 Jul 2015 01:03:13 -0700

The counts are written in the 5th column in the phrase table.
   http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases

This is for debugging purposes only, they don't influence decoding inanyway.

IF you want to know more about how it works - the counts are stored inthe file extract.*.sorted.gz and extract.*.inv.sorted.gz. The counts aresummed and the probability is calculated by the score program. Thesource code for the score program is in

   phrase-extract/score-main.cpp


On 08/07/2015 18:05, Harshit Gupta wrote:

Hi, I am currently working on Moses platform and in the phrase tables,I am interested in the counts of phrases instead of phrase translationprobabilities. Can I get to know this counts ?In the Moses manual, it is mentioned that in training process incalculating phrase scores that"To estimate the phrase translation probability φ(e|f) we proceed asfollows: First, the extract file is sorted. This ensures that allEnglish phrase translations for an foreign phrase are next to eachother in the file. Thus, we can process the file, one foreign phraseat a time, *collect counts* and compute φ(e|f) for that foreign phrase f."
Where are these counts collected ? Where can I get these counts ?

Regards
Harshit

--
Harshit Gupta
Third Year Undergraduate
Electrical Engineering
IIT Madras


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support


--
Hieu Hoang
Researcher
New York University, Abu Dhabi
http://www.hoang.co.uk/hieu

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Getting counts in Moses instead of probabilities

Reply via email to