Francis Tyers <[email protected]> čálii: [...]
> One thing that is remaining is to rapidly expand the bilingual > dictionary. > > Basically for this we just need word pairs in Swedish and Danish. > > apertium-swe.swe.dix: 96642 lemmas > apertium-dan.dan.dix: 51991 lemmas > > apertium-swe-dan.swe-dan.dix: approx. 11202 lemmas > apertium-swe-dan.swe-dan.dix: approx. 11745 lemmas (incl. proper > nouns) > > In comparison: > > apertium-dan-nor.dan-nor.dix: 32018 > apertium-dan-nor.dan-nor.dix: 53864 (incl. proper nouns) > > apertium-nno-nob.nno-nob.dix: 49637 > apertium-nno-nob.nno-nob.dix: 69472 (incl. proper nouns) > > So, proper names would be a good start, but also, there is plenty of > room for adding > more other categories... perhaps from EuroParl + word alignment ? Also, Wikidata might give something (especially relevant for Wikipedia Content Translation, one would think =D), and I could probably find more spelling differences for adding cognates (I've so far only done ö-ø and ä-æ). A larger dan.dix would of course help with that, if we can get that Retskrivningsordbog in. -Kevin
signature.asc
Description: PGP signature
------------------------------------------------------------------------------ Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
