OK. Then I am going to investigate with the newest version. I have to admit I haven't tried it yet after Hieu's recent refactoring work. Should have some results shortly.

W dniu 04.07.2013 13:26, Fishkov, Alexander pisze:

Hi, Marcin!

I have rather new version of Moses. To double-check I pulled the recent version from GitHub and rebuild everything from scratch to repeat my experiments. Everything stays the same -- different translations.

*From:*[email protected] [mailto:[email protected]] *On Behalf Of *Marcin Junczys-Dowmunt
*Sent:* Thursday, July 04, 2013 3:00 PM
*To:* [email protected]
*Subject:* Re: [Moses-support] Compact phrase table produces different ranslations than original binary

Hi Alexander,
both phrase tables should produce the same translations (they may differ once every many thousands sentences), so there is definitly something off. Your ini files look fine. How recent is your moses version? Does this happen with the most recent one?
Best,
Marcin


W dniu 04.07.2013 12:24, Fishkov, Alexander pisze:

    Hi!

    I recently wanted to convert existing Moses translation models
    into compact format to reduce memory usage. As far as I
    understood, compact phrase table representation is a 'lossless
    compression' of original phrase table that requires a lot of time.
    However, while testing existing French-English model trained on
    Europarl corpus I encountered disagreements in translation:

    Original sentence: Wikipédia est un projet d'encyclopédie
    collective établie sur Internet, universelle, multilingue et
    fonctionnant sur le principe du wiki.

    Compact translation:  Wikip?dia un project is a collective
    encyclopaedia ?tablie sur Internet universal, multilingual and
    operating sur the principle of wiki.

    Old translation: Wikip?dia is a project of collective
    encyclopaedia established on the Internet, universal, multilingual
    and operating on the principle of wiki.

    As can be seen, compact translation model seems to know less words
    or phrases. Is it a bug or is my understanding of compact
    representation as 'lossless' incorrect?

    Best regards, Alexander.




    _______________________________________________

    Moses-support mailing list

    [email protected]  <mailto:[email protected]>

    http://mailman.mit.edu/mailman/listinfo/moses-support


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to