There was work to try to formalize and collect this data, it's part of this
schema: https://meta.wikimedia.org/wiki/Schema:ExternalLinksChange  But
development is stalled on this code review:
https://gerrit.wikimedia.org/r/#/c/346915/

On Fri, Mar 9, 2018 at 10:55 AM, Jaime Crespo <[email protected]> wrote:

> Hi, Nick,
>
> I am not in analytics, but, assuming you mean the English language
> Wikipedia, the all the internal links and the external links are available
> for download in sql format at:
>
> https://dumps.wikimedia.org/enwiki/20180220/ - (search "pagelinks" and
> "externallinks"). The latest dump is runing now, it may be ready in a few
> days, too. They are quite large because they include the text and other
> properties, but with very little automation, it is easy to count the pairs
> in the direction you want. You may need the "all-titles" or
> "all-titles-on-ns0" files, too, to reference page ids.
>
> Maybe someone else can offer an easier option?
>
> On Fri, Mar 9, 2018 at 4:42 PM, Nick Bell <[email protected]> wrote:
>
>>  Dear Analytics Team,
>>
>> I’m doing a project on Wikipedia for my Maths degree, and I was hoping
>> you could help me acquire some data about Wikipedia.
>>
>> I would like to get the number of incoming internal links and outgoing
>> internal links for every page, if possible. I could limit this if needs be,
>> as I am aware this totals around 11 million values.
>>
>> I have minimal programming experience, so if this is unreasonable or
>> impossible please let me know. I very much appreciate your time considering
>> my request.
>>
>>
>>
>> Many thanks,
>>
>>
>> Nicholas Bell
>>
>> Mathematics Undergraduate
>>
>> University of Bristol
>>
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
>
> --
> Jaime Crespo
> <http://wikimedia.org>
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to