Francis Tyers <[email protected]> čálii:

[...]

> One thing that is remaining is to rapidly expand the bilingual 
> dictionary.
>
> Basically for this we just need word pairs in Swedish and Danish.
>
>    apertium-swe.swe.dix: 96642 lemmas
>    apertium-dan.dan.dix: 51991 lemmas
>
>    apertium-swe-dan.swe-dan.dix: approx. 11202 lemmas
>    apertium-swe-dan.swe-dan.dix: approx. 11745 lemmas (incl. proper 
> nouns)
>
> In comparison:
>
>    apertium-dan-nor.dan-nor.dix: 32018
>    apertium-dan-nor.dan-nor.dix: 53864 (incl. proper nouns)
>
>    apertium-nno-nob.nno-nob.dix: 49637
>    apertium-nno-nob.nno-nob.dix: 69472 (incl. proper nouns)
>
> So, proper names would be a good start, but also, there is plenty of 
> room for adding
> more other categories... perhaps from EuroParl + word alignment ?

Also, Wikidata might give something (especially relevant for Wikipedia
Content Translation, one would think =D), and I could probably find more
spelling differences for adding cognates (I've so far only done ö-ø and
ä-æ). A larger dan.dix would of course help with that, if we can get
that Retskrivningsordbog in.


-Kevin

Attachment: signature.asc
Description: PGP signature

------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to