Hi, Supposedly your phrase table does not contain an entry "Gitarre ||| guitar" because this word pair is always unaligned in your training data. You could try to improve your word alignment quality.
Alternatively, you could implement a procedure in the manner of the "forced single word heuristic" as described in: D. Stein, D. Vilar, S. Peitz, M. Freitag, M. Huck, and H. Ney. A Guide to Jane, an Open Source Hierarchical Translation Toolkit. The Prague Bulletin of Mathematical Linguistics, number 95, pages 5-18, Prague, Czech Republic, April 2011. http://ufal.mff.cuni.cz/pbml/95/art-stein-vilar-ney-jane.pdf (see Fig. 1c). But the latter would rather be a workaround. Cheers, Matthias On Thu, 2014-11-27 at 01:18 +0900, Raj Dabre wrote: > Hello, > > > If I am not wrong this is most likely due to the grow (-diag) method applied > to the word aligned data (both directions) before phrase extraction. > > Furthermore..... one word translations should exist (but not always).... > search for them. > > > > Regards. > > > On Thu, Nov 27, 2014 at 12:53 AM, Vera Aleksic, Linguatec GmbH > <[email protected]> wrote: > Hi, > > I have observed many times that some words do not exist as single > word translations in the phrase table, although they exist in the training > corpus and in multiword phrases. > An example: > German-English translation for "Gitarre" is unknown, i.e. there is no > single word entry for "Gitarre" in the phrase table, although some other > phrases containing this word exist (see below). > How is it possible? > Thanks and best regards, > Vera > > > Gitarre , ||| guitar ; ||| 1 0.0284465 1 0.0654272 2.718 ||| ||| 1 1 > Gitarre darstellt , unter Beanspruchung ||| guitar using ||| 0.25 > 2.7351e-11 1 0.0625119 2.718 ||| ||| 4 1 > Gitarre darstellt , unter ||| guitar using ||| 0.25 1.18917e-05 1 > 0.0625119 2.718 ||| ||| 4 1 > Gitarre darstellt , ||| guitar using ||| 0.25 0.00569228 1 0.0625119 > 2.718 ||| ||| 4 1 > Gitarre darstellt ||| guitar using ||| 0.25 0.0400028 1 0.0625119 > 2.718 ||| ||| 4 1 > Kopfplatte einer Gitarre darstellt , ||| head of a guitar using ||| > 0.5 4.23407e-08 1 0.00471281 2.718 ||| ||| 2 1 > Kopfplatte einer Gitarre darstellt ||| head of a guitar using ||| 0.5 > 2.97552e-07 1 0.00471281 2.718 ||| ||| 2 1 > eine elektrische Gitarre , ||| an electric guitar ; ||| 1 0.00107982 > 1 0.00163632 2.718 ||| ||| 1 1 > einer Gitarre darstellt , unter ||| of a guitar using ||| 0.333333 > 6.4754e-07 1 0.00471281 2.718 ||| ||| 3 1 > einer Gitarre darstellt , ||| of a guitar using ||| 0.333333 > 0.000309961 1 0.00471281 2.718 ||| ||| 3 1 > einer Gitarre darstellt ||| of a guitar using ||| 0.333333 0.00217827 > 1 0.00471281 2.718 ||| ||| 3 1 > elektrische Gitarre , ||| electric guitar ; ||| 1 0.005661 1 > 0.0142097 2.718 ||| ||| 1 1 > wie eine elektrische Gitarre , ||| as an electric guitar ; ||| 1 > 0.000177339 1 0.000809485 2.718 ||| ||| 1 1 > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > > > > -- > Raj Dabre. > Research Student, > > Graduate School of Informatics, > Kyoto University. > CSE MTech, IITB., 2011-2014 > > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
