2009/10/17 Lars Aronsson <[email protected]>

> Kelly Jones wrote:
>
> > How can I extract just a word list w/ definitions from wiktionary?
>
> A very simple Perl script for extracting information from the
> Wikimedia XML dumps is found on
> http://meta.wikimedia.org/wiki/User:LA2/Extraktor
>
> If you know Perl, you can modify this script to filter out the
> articles and sections you want, and output them separately.
>
> There are some Perl tools I've made on the Toolserver subversion repository
but since I'm the only one using them so far they be tricky for others to
use: https://fisheye.toolserver.org/browse/enwikt/wiktdump/
wiktsplitnames.pl
<https://fisheye.toolserver.org/browse/enwikt/wiktdump/wiktsplitnames.pl>will
split an English Wiktionary dumpfile into word lists or mini dump files for
each language at the most simplistic level

I've also created a feature request on bugzilla: "Regularly publish updated
word lists and definition lists"
https://bugzilla.wikimedia.org/show_bug.cgi?id=21164

Andrew Dunbar (hippietrail)


> --
>  Lars Aronsson ([email protected])
>  Aronsson Datateknik - http://aronsson.se
>
> _______________________________________________
> Wiktionary-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
>



-- 
http://wiktionarydev.leuksman.com http://linguaphile.sf.net
_______________________________________________
Wiktionary-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiktionary-l

Reply via email to