Hi, > I am familiar with the architecture of Moses, and know that the 2nd and 3rd > steps involve computing alignments in both directions while theĀ 4th step > applies the heuristic(grow,union ...) to obtain the final alignments. These > alignments are further used to extract the phrase-pairs. Now my question is, > what would be the best way to incorporate the alignments into Moses. > One way would be to duplicate the files generated by GIZA++ in both step 2 > and 3, and startĀ the training procedure from step:4. However, I was > wondering is there was a much simpler method to use the customized > alignments in Moses.
If you have your own alignment method, it would be best to skip the word alignment steps of the training steps and start with step 4. http://www.statmt.org/moses/?n=FactoredTraining.HomePage > Also in the process of MT, if I wanted to use a bilingual dictionary, would > it be ideal to use the dictionary in GIZA++ while computing the alignments, > or to augment the corpus with the entries in the dictionary. Most of the > target words for the entries in the dictionary are MWEs, and hence > augmenting the corpus did not bring about any improvements when we conducted > the experiments. Could you kindly suggest an appropriate method to be used > in this context. I am not sure what the problem is here - the inclusion of a dictionary as additional parallel corpus data is the standard method. I am not entirely sure why their translations as MWEs should be a problem. -phi _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
