Hi Sorin, There is some code available that does that but it's implemented as pre- and postprocessing steps:
https://github.com/christianbuck/matecat_util/tree/master/code/tags4moses This uses an external HMM aligner which may need a bit more work as it does not produce the same alignment as giza++ at the moment. If you include word alignment information in your phrase table that might be the better option. In the end this was rarely used and probably needs some brushing up but should be a good starting point. Please ask if you run into problems. cheers, Christian On 04/02/14 15:06, Sorin Slavescu wrote: > Hi all, > > Is there any research, tools or libraries to address the issue of > repositioning non-translatable content like tags, placeholders, entities > into the translated sentence? > For example if the source sentence is "This is a <b>test</b>" and the > translation is "C'est un test" to reposition the <b> tag into the > translation in the right spot to become "C'est un <b>test</b>" > > Thanks, > Sorin > -- > > > ORACLE <http://www.oracle.com> > Sorin Slavescu | Principal Software Engineer > Phone: +35318031937 | E-mail: [email protected] > Oracle Worldwide Product Translation (WPTG) - Tools > Block P5, East Point Business Park | Dublin 3, Ireland > Oracle is committed to developing practices and products that help > protect the environment <http://www.oracle.com/commitment> > > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
