Hi,

I added the support for German Wiktionary, it is available in the newest
version. There is a quick test script that should get you 300k+
translations from the German Wiktionary in less than 15 minutes.

The dictionaries in 50 languages built using wikt2dict and other resources
(parallel and comparable corpora) are available here:
http://hlt.sztaki.hu/resources/index.html
Please let me know if you find parsing errors.

I understand that DBPedia Wiktionary does a lot more than wikt2dict and I
do not plan to compete with that. However, adding 35+ Wiktionaries would
have been near impossible for me. This a quick (and dirty) way to extract
the translations.

Cheers,
Judit



2013/7/12 Judit, Ács <[email protected]>

> Hi All,
>
> I created a tool to extract translations from different editions of
> Wiktionary. Right now it supports 39 different Wiktionaries. It only
> extracts translations and ignores the rest.
>
> Supported Wiktionaries:
> Azerbaijani, Bulgarian, Catalan, Czech, Danish, Greek, English, Esperanto,
> Spanish, Estonian, Basque, Finnish, French, Galician, Hebrew, Croatian,
> Hungarian, Indonesian, Italian, Georgian, Latin, Lithuanian, Malagasy,
> Dutch, Norwegian, Occitan, Polish, Portuguese, Romanian, Russian, Slovak,
> Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian, Vietnamese and
> Chinese.
>
> Adding a new Wiktionary is done via a configuration file.
>
> Right now the beta version is available for download at:
> https://github.com/juditacs/wikt2dict
>
> Documentation is in progress, until then the README should be enough to
> get started.
>
> Please test it and send me your feedback and bug reports.
>
> Thanks,
> Judit Ács
>
_______________________________________________
Wiktionary-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiktionary-l

Reply via email to