Re: [Moses-support] Get the input - output word mapping

Hieu Hoang Thu, 05 Nov 2009 11:18:11 -0800

i wrote it to do some other stuff so the alignment info was stored 
inefficiently. my bad


Joerg Tiedemann wrote:
> OK - thanks!
> I don't really need it. And I could just get the word alignment directly 
> from the phrase table I guess (using the segmentation info to get the 
> phrases used). Why did it take so much memory to include it? Because of 
> caching phrase translations?
>
> Jorg
>
> Barry Haddow wrote:
>   
>> Hi Joerg
>>
>> The word alignment info code got removed as it was using too much memory. If 
>> you really need it, then you could go back in svn to the time before the 
>> multi-threaded code was merged in (before r2477, I think)
>>
>> regards
>> Barry
>>
>> On Thursday 05 November 2009 13:53, Jörg Tiedemann wrote:
>>     
>>> Hmm ...
>>> I just tried the -print-alignment-info flag and did not get any word
>>> alignment info. I checked again and I do have the alignment fields in my
>>> phrase table. I also tried a binarized version including word alignments
>>>   but nothing happens. Is there anything else I have to switch on to get
>>> the "verbose" output? I have no problem to get the phrase-mappings (like
>>> 0-3=0-1) in an n-best list when switching it on with
>>> -include-alignment-in-n-best.
>>>
>>> here is my command
>>> moses -f moses.ini -use-alignment-info -print-alignment-info < input
>>>
>>> this is what I get (for a very small test corpus ...):
>>> ....
>>> Collecting options took 2.470 seconds
>>> Search took 8.030 seconds
>>> BEST TRANSLATION: wir sind , die sich aus der gemeinsamen währung , die
>>> weit von uns , um die vorteile und flexibilität einer gemeinsamen
>>> währung einkerkert uns künstliche straitjacket|UNK|UNK|UNK , die über
>>> die völker europas . [11111111111111111111111111111111111111111111]
>>> [total=-257.879] <<-28.000, -34.000, -100.000, -9.775, -1.609, -5.646,
>>> -13.490, -1.609, -4.243, -247.338, -78.555, -108.830, -33.287, -47.820,
>>> 23.998>>
>>> wir sind , die sich aus der gemeinsamen währung , die weit von uns , um
>>> die vorteile und flexibilität einer gemeinsamen währung einkerkert uns
>>> künstliche straitjacket , die über die völker europas .
>>> Translation took 8.040 seconds
>>> Finished translating
>>> ....
>>>
>>> and my phrase-table looks like this:
>>> ! " ||| ! " ||| (0) (1) ||| (0) (1) ||| 0.75 0.0320773 1 0.595087 2.718
>>> ! the total ||| denn die vorgesehene ||| () (1) () ||| () (1) () |||
>>> 0.125 1.86531e-09 0.25 2.46276e-08 2.718
>>> ! the total ||| denn die ||| () (1) () ||| () (1) ||| 0.0163934
>>> 1.86531e-09 0.25 0.000386013 2.718
>>> ....
>>>
>>> and I use the latest Moses (checked out yesterday)
>>> Well, it's not so important. Just wanted to know if there's something
>>> wrong with my command line options ...
>>>
>>> Jörg
>>>
>>>
>>> (By the way, the multi-threaded decoder works like a charm!)
>>>
>>> Nicola Bertoldi wrote:
>>>       
>>>> Waleed,
>>>> if you are interested in a word-to-word alignment (instead of
>>>> phrase-to-phrase alignment) you could give a look to this feature of
>>>> Moses
>>>> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc6
>>>>
>>>> Pay attention that you have to re-train your phrase tables including the
>>>> word-to-word alignments; see also
>>>> http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases
>>>> and
>>>> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#BinaryPhraseTable
>>>>
>>>> regards,
>>>> Nicola
>>>>
>>>> On 11/1/09 4:33 PM, "Felipe Sánchez Martínez" <[email protected]>
>>>> wrote:
>>>>
>>>> Hello,
>>>>
>>>> To obtain the correspondence between source phrases and target phrases
>>>> run the decoder with -report-segmentation. Also note that in
>>>> phrase-based SMT each target phrases is the translation of a source
>>>> phrases, there are no target phrases "aligned" with null.
>>>>
>>>> Regards
>>>> --
>>>> Felipe
>>>>
>>>> El dom, 01-11-2009 a las 17:12 +0200, Waleed Oransa escribió:
>>>>         
>>>>> Hello all,
>>>>>
>>>>> I would like to know if moses can output the mapping between input
>>>>> words and output words. In other way, how can I know the source word
>>>>> (or null) for each word in the translated sentence. I appreciate your
>>>>> prompt response. Thank you!
>>>>>
>>>>> Waleed
>>>>> _______________________________________________
>>>>> Moses-support mailing list
>>>>> [email protected]
>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>           
>>>> --
>>>> Felipe Sánchez Martínez <[email protected]>
>>>> Departamento de Lenguajes y Sistemas Informáticos
>>>> Universidad de Alicante, E-03071 Alicante (Spain)
>>>> Tel.: +34 965 903 400, ext: 2966 Fax: +34 965 909 326
>>>> http://www.dlsi.ua.es/~fsanchez
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> [email protected]
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> [email protected]
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>         
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>   
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Get the input - output word mapping

Reply via email to