it looks ok to me, not sure what could be wrong. i've added a daily test to ensure that the placeholder will work in future. Perhaps you can have a look at the moses.ini file and to-translate.txt files to see if there are any differences with yours
https://github.com/moses-smt/moses-regression-tests/tree/master/tests/phrase.placeholder Hieu Hoang Researcher New York University, Abu Dhabi http://www.hoang.co.uk/hieu On 19 May 2015 at 11:53, Carla Parra <[email protected]> wrote: > Dear Hieu, > > thanks for your reply. I attach the config file, my moses.ini (I think > this is the one you want to get), and a few lines of our input file, > already preprocessed. If you want the RAW lines I can also send them to you. > > I don't know if this will be a similar issue, but I tried the same > strategy using the forced translations (<np > translation="German">Deutsch</np>), and this morning I have observed the > same, some tags are suddenly appearing in the translation. > > Thank you very much for your support! > > Carla > > > El 19.05.2015 09:13, Hieu Hoang escribió: > >> what is the exact command you used to decode? Can you please provide >> the moses.ini file and a few lines of your input data for us to look >> at. >> >> Hieu Hoang >> Researcher >> >> New York University, Abu Dhabi >> >> http://www.hoang.co.uk/hieu [3] >> >> >> On 18 May 2015 at 15:35, Carla Parra <[email protected]> >> wrote: >> >> Dear all, >>> >>> we just finished some experiments using placeables, and we have >>> observed >>> several issues that may be worth sharing. I don't know if someone >>> has >>> experienced the same, or you were already aware of this, but just >>> in >>> case: >>> >>> (1) Special characters must be scaped in the "entity" value field. >>> Otherwise, the cause XML parsing errors at tuning (not at training, >>> though!), and wrong values are retrieved from the tags (e.g. we had >>> text >>> with additional quotation marks, and this caused that the >>> translation >>> stopped at the first quotation mark, not yielding the complete >>> "entity" >>> value we had encoded). >>> >>> (2) <ne> tags are added to sentences as if they were computed as >>> tokens >>> during training. (i.e. not ignored, as they just contain the >>> placeables). >>> As an example, the English sentence "Allow simple password", is >>> translated as "Permitir simple contraseña <ne translation="@tag@" >>> entity="</1>">@tag@</ne> ." >>> >>> While the first issue is our fault, we do not know what causes the >>> second one. We have followed the instructions at the MOSES advanced >>> features site and thus specified "extract-settings = "--Placeholder >>> @tag@"" in training and "-placeholder-factor 1 -xml-input >>> exclusive" in >>> the decoder and evaluation. Has anyone experienced the same thing >>> and/or >>> know how to solve this issue? >>> >>> Thank you very much. Best regards, >>> >>> Carla >>> >>> -- >>> Carla Parra Escartín >>> Marie Curie Experienced Researcher - EXPERT ITN >>> http://expert-itn.eu/ [1] >>> Hermes Traducciones >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support [2] >>> >> >> >> >> Links: >> ------ >> [1] http://expert-itn.eu/ >> [2] http://mailman.mit.edu/mailman/listinfo/moses-support >> [3] http://www.hoang.co.uk/hieu >> > > -- > Carla Parra Escartín > Marie Curie Experienced Researcher - EXPERT ITN > http://expert-itn.eu/ > Hermes Traducciones >
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
