Hi, I attached a small patch to lt-proc -b at: http://bugs.apertium.org/cgi-bin/bugzilla/show_bug.cgi?id=106#c1
I'm not sure if anyone actually uses the --bilingual mode, but it seems handy for debugging, since it simply does a lookup in the bilingual dictionary without any transfer rules. However, currently it returns as unknown anything that has extra symbols, e.g. if your bidix specifies <e><p><l>tener<s n="vblex"/></l><r>have<s n="vblex"/></r></p></e> then lt-proc -b will tell you that ^tener<vblex><pri><p1><sg>$ is unknown and give you ^tener<vblex><pri><p1><sg>/@tener<vblex><pri><p1><sg>$ which is not very useful for debugging transfer. Transfer with no transfer rules will return the longest match and then just append any following tags, giving ^have<vblex><pri><p1><sg>$ The patch changes lt-proc -b so that it works like transfer in appending the superfluous tags, but otherwise works like lt-proc -b in that it also outputs the source language analysis: ^tener<vblex><pri><p1><sg>/have<vblex><pri><p1><sg>$ (Since there was an unused "queue" variable in fstprocessor.cc:bilingual I assume this was the intended behaviour, since that also makes it possible to use lt-proc -b as a module before a bidix-free transfer module.) Please let me know if it works... -Kevin ------------------------------------------------------------------------------ Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)! Finally, a world-class log management solution at an even better price-free! Download using promo code Free_Logger_4_Dev2Dev. Offer expires February 28th, so secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsight-sfd2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
