Hi,

I've been developing the Polish - Silesian Apertium pair for some time and the translations have become reasonable so I reckon it's time to publish them. From the 10 000 most frequent Polish words it covers nearly 9 000 (the rest is on its way) and it handles more than 21 thousand words altogether.

https://github.com/gkkulik/apertium-pol

https://github.com/gkkulik/apertium-pol-szl

https://github.com/gkkulik/apertium-szl

It still needs some fine tuning as sometimes it gives slightly amusing output. I improve it regularly because I use it daily to translate news so I get rid of any spotted mistakes. I hope you people can also give me some tips since you obviously know much more about the technical aspects of Apertium.

Questions:

I want to improve the translation by developing handtagged coprora for both languages. What size do I need to make it reasonable?

There was a great PDF Apertium developer manual but I cannot find it anywhere. Can anybody point me in the right direction?

When the pair is published in trunk and on the website, I want to make it a media event here in Upper Silesia which means increased traffic. Is that okay? Sorry if this question is silly. :)

Best regards,
Gregory


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to