Hi,

I attached a small patch to lt-proc -b at:
http://bugs.apertium.org/cgi-bin/bugzilla/show_bug.cgi?id=106#c1

I'm not sure if anyone actually uses the --bilingual mode, but it seems
handy for debugging, since it simply does a lookup in the bilingual
dictionary without any transfer rules. However, currently it returns as
unknown anything that has extra symbols, e.g. if your bidix specifies

<e><p><l>tener<s n="vblex"/></l><r>have<s n="vblex"/></r></p></e>

then lt-proc -b will tell you that ^tener<vblex><pri><p1><sg>$ is
unknown and give you

^tener<vblex><pri><p1><sg>/@tener<vblex><pri><p1><sg>$

which is not very useful for debugging transfer. Transfer with no
transfer rules will return the longest match and then just append any
following tags, giving

^have<vblex><pri><p1><sg>$

The patch changes lt-proc -b so that it works like transfer in appending
the superfluous tags, but otherwise works like lt-proc -b in that it
also outputs the source language analysis:

^tener<vblex><pri><p1><sg>/have<vblex><pri><p1><sg>$


(Since there was an unused "queue" variable in fstprocessor.cc:bilingual
I assume this was the intended behaviour, since that also makes it
possible to use lt-proc -b as a module before a bidix-free transfer
module.)

Please let me know if it works...


-Kevin


------------------------------------------------------------------------------
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires 
February 28th, so secure your free ArcSight Logger TODAY! 
http://p.sf.net/sfu/arcsight-sfd2d
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to