Hi! I think I discovered a subtle bug in the way the formatting is handled. In brief, this is the problem (from Spanish to English):
Input: "quiero una manzana <em>roja</em> del huerto" Output 1: "I want a red <em>apple</em> of the orchard" note that the emphasis is in the wrong place. With spectie's help over IRC, I changed es-en.t1x for the rule "REGLA: DET NOM ADJ" to just swap the blanks, i.e. <b pos="1"/> <b pos="2"/>. Now the output is: Output 2: "I want a <em>red apple</em> of the orchard." If you look at this debug printout, you'll notice that the problem is that the "</em>" marker is outside the chunk during transfer: http://codepad.org/tfm6gqOi Of course, the golden output would be: Output: "I want a <em>red</em> apple of the orchard." Any ideas on how to fix this? Thanks! -- Hector ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity and more. Splunk takes this data and makes sense of it. Business sense. IT sense. Common sense. http://p.sf.net/sfu/splunk-d2dcopy1 _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
