On 29 September 2011 22:13, Hector <[email protected]> wrote:
> Hi!
> I think I discovered a subtle bug in the way the formatting is
> handled. In brief, this is the problem (from Spanish to English):
>

Not a bug, per se; more a limitation in the design. I think it's even
mentioned in the documentation. IIRC, we had a discussion about more
or less the same thing in the last few days.

> Input: "quiero una manzana <em>roja</em> del huerto"
> Output 1: "I want a red <em>apple</em> of the orchard"
>
> note that the emphasis is in the wrong place. With spectie's help over
> IRC, I changed es-en.t1x for the rule "REGLA: DET NOM ADJ" to just
> swap the blanks, i.e. <b pos="1"/> <b pos="2"/>. Now the output is:
>
> Output 2: "I want a <em>red apple</em> of the orchard."

Changing the order of the blanks is, generally, a bad idea. Think
about what would have happened if the input had been 'una
<em>manzana</em> roja'.

>
> If you look at this debug printout, you'll notice that the problem is
> that the "</em>" marker is outside the chunk during transfer:
>

Yes, otherwise you would have to have space handling at the end of every chunk.

> Of course, the golden output would be:
> Output: "I want a <em>red</em> apple of the orchard."
>

Golden output would be 'I want a red apply *from* the orchard', which
I hope puts things in their proper perspective.

-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to