Hello everyone!

We're making progress on a module for treating separable "multiword" expressions. The general idea is to be able to do stuff like

^take$ ^the$ ^rubbish$ ^out$ -> ^take# out$ ^the$ ^rubbish$ -> ^sacar$ ^la$ ^basura$ ^be$ ^always$ ^late$ -> ^be# late$ ^always$ -> ^llegar# tarde$ ^siempre$ ^take$ ^the$ ^rubbish$ ^out of$ ^here$ -> ^take# out$ ^the$ ^rubbish$ ^of$ ^here$ -> ^sacar$ ^la$ ^basura$ ^de$ ^aquĆ­$

The general idea is that it will be a finite-state transducer (like the existing monolingual and bilingual dictionaries) but that can work over words. It will appear between the pretransfer module and the lexical transfer module (apertium-pretransfer | new module | lt-proc -b).

So, this email is a call for language pair developers to give us examples of phenomena you would like to treat in your language pair.

Thanks!

Fran

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to