OK - thanks! I don't really need it. And I could just get the word alignment directly from the phrase table I guess (using the segmentation info to get the phrases used). Why did it take so much memory to include it? Because of caching phrase translations?
Jorg Barry Haddow wrote: > Hi Joerg > > The word alignment info code got removed as it was using too much memory. If > you really need it, then you could go back in svn to the time before the > multi-threaded code was merged in (before r2477, I think) > > regards > Barry > > On Thursday 05 November 2009 13:53, Jörg Tiedemann wrote: >> Hmm ... >> I just tried the -print-alignment-info flag and did not get any word >> alignment info. I checked again and I do have the alignment fields in my >> phrase table. I also tried a binarized version including word alignments >> but nothing happens. Is there anything else I have to switch on to get >> the "verbose" output? I have no problem to get the phrase-mappings (like >> 0-3=0-1) in an n-best list when switching it on with >> -include-alignment-in-n-best. >> >> here is my command >> moses -f moses.ini -use-alignment-info -print-alignment-info < input >> >> this is what I get (for a very small test corpus ...): >> .... >> Collecting options took 2.470 seconds >> Search took 8.030 seconds >> BEST TRANSLATION: wir sind , die sich aus der gemeinsamen währung , die >> weit von uns , um die vorteile und flexibilität einer gemeinsamen >> währung einkerkert uns künstliche straitjacket|UNK|UNK|UNK , die über >> die völker europas . [11111111111111111111111111111111111111111111] >> [total=-257.879] <<-28.000, -34.000, -100.000, -9.775, -1.609, -5.646, >> -13.490, -1.609, -4.243, -247.338, -78.555, -108.830, -33.287, -47.820, >> 23.998>> >> wir sind , die sich aus der gemeinsamen währung , die weit von uns , um >> die vorteile und flexibilität einer gemeinsamen währung einkerkert uns >> künstliche straitjacket , die über die völker europas . >> Translation took 8.040 seconds >> Finished translating >> .... >> >> and my phrase-table looks like this: >> ! " ||| ! " ||| (0) (1) ||| (0) (1) ||| 0.75 0.0320773 1 0.595087 2.718 >> ! the total ||| denn die vorgesehene ||| () (1) () ||| () (1) () ||| >> 0.125 1.86531e-09 0.25 2.46276e-08 2.718 >> ! the total ||| denn die ||| () (1) () ||| () (1) ||| 0.0163934 >> 1.86531e-09 0.25 0.000386013 2.718 >> .... >> >> and I use the latest Moses (checked out yesterday) >> Well, it's not so important. Just wanted to know if there's something >> wrong with my command line options ... >> >> Jörg >> >> >> (By the way, the multi-threaded decoder works like a charm!) >> >> Nicola Bertoldi wrote: >>> Waleed, >>> if you are interested in a word-to-word alignment (instead of >>> phrase-to-phrase alignment) you could give a look to this feature of >>> Moses >>> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc6 >>> >>> Pay attention that you have to re-train your phrase tables including the >>> word-to-word alignments; see also >>> http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases >>> and >>> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#BinaryPhraseTable >>> >>> regards, >>> Nicola >>> >>> On 11/1/09 4:33 PM, "Felipe Sánchez Martínez" <[email protected]> >>> wrote: >>> >>> Hello, >>> >>> To obtain the correspondence between source phrases and target phrases >>> run the decoder with -report-segmentation. Also note that in >>> phrase-based SMT each target phrases is the translation of a source >>> phrases, there are no target phrases "aligned" with null. >>> >>> Regards >>> -- >>> Felipe >>> >>> El dom, 01-11-2009 a las 17:12 +0200, Waleed Oransa escribió: >>>> Hello all, >>>> >>>> I would like to know if moses can output the mapping between input >>>> words and output words. In other way, how can I know the source word >>>> (or null) for each word in the translated sentence. I appreciate your >>>> prompt response. Thank you! >>>> >>>> Waleed >>>> _______________________________________________ >>>> Moses-support mailing list >>>> [email protected] >>>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> -- >>> Felipe Sánchez Martínez <[email protected]> >>> Departamento de Lenguajes y Sistemas Informáticos >>> Universidad de Alicante, E-03071 Alicante (Spain) >>> Tel.: +34 965 903 400, ext: 2966 Fax: +34 965 909 326 >>> http://www.dlsi.ua.es/~fsanchez >>> >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >>> >>> >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
