Hey Marieke,

You can either use the Wikidata toolkit by Markus Krötzsch, if you want to
work on the dump, or the Wikidata web API, if you only need a few such
mappings at a time.
On Jul 17, 2014 9:24 AM, "Erp, M.G.J. van" <[email protected]> wrote:

> Hi there,
>
> I was wondering how to get the language mappings between different
> wikipedia pages. This information seems to be available on Wikidata as I
> can find it through browsing different pages on Wikidata such as
> http://www.wikidata.org/wiki/Q213710 and the
> https://www.mediawiki.org/wiki/Manual:Langlinks_table mentions a
> langlinks table, but I can't figure out how to get a dump.
>
> The "Wiki interlanguage link records" at
> http://dumps.wikimedia.org/wikidatawiki/20140705/ looked promising but
> that seems to contain user information if I'm not mistaken. For example, "
> select count(*), ll_title from langlinks group by 2 order by 1 desc limit
> 20;” results in:
>
> +----------+--------------------------------------+
> | count(*) | ll_title                             |
> +----------+--------------------------------------+
> |      284 | User:تفکر                            |
> |      272 | user:OffsBlink                       |
> |      215 | User:YourEyesOnly                    |
> |      179 | User:MoiraMoira                      |
> |       65 | User:AvocatoBot                      |
> |       35 | User:Shikai shaw                     |
> |       35 | user:Shuaib-bot                      |
> |       33 | user:לערי ריינהארט                   |
> |       33 | User:Leyo                            |
> |       27 | user:Лобачев Владимир                |
> |       20 | User:Wagino 20100516                 |
> |       18 | user:Gangleri                        |
> |       17 | user:I18n                            |
> |       16 | user:Meursault2004                   |
> |       12 | User:Labant                          |
> |       11 | User:Stryn                           |
> |       11 | User:angelia2041                     |
> |       10 | user:Kelvin                          |
> |       10 | User:JCIV                            |
> |        9 | Template:Mbox                        |
> +----------+———————————————————+
>
> I checked out  the #mediawiki IRC channel someone recommended the
> "Interwiki link tracking records" but those seem to also contain al sorts
> of other links, and I don't see a way to filter out the "in other
> languages" links. It would be great if you could help me out.
>
> Thanks!
>
> Marieke van Erp
>
>
>
> --
> Computational Lexicology & Terminology Lab (CLTL)
> The Network Institute, VU University Amsterdam
>
> De Boelelaan 1105
> 1081 HV  Amsterdam, The Netherlands
> http://www.mariekevanerp.com
> http://www.newsreader-project.eu
>
>
>
> _______________________________________________
> Wikidata-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>
_______________________________________________
Wikidata-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Reply via email to