Hi all,

For archive happiness:

Clickstream dataset is now being generated on a monthly basis for 5
Wikipedia languages (English, Russian, German, Spanish, and Japanese). You
can access the data at https://dumps.wikimedia.org/other/clickstream/ and
read more about the release and those who contributed to it at
https://blog.wikimedia.org/2018/01/16/wikipedia-rabbit-hole-clickstream/

Best,
Leila



--
Leila Zia
Senior Research Scientist
Wikimedia Foundation

On Tue, Feb 17, 2015 at 11:00 AM, Dario Taraborelli <
[email protected]> wrote:

> We’re glad to announce the release of an aggregate clickstream dataset
> extracted from English Wikipedia
>
> http://dx.doi.org/10.6084/m9.figshare.1305770
>
> This dataset contains counts of *(referer, article) *pairs aggregated
> from the HTTP request logs of English Wikipedia. This snapshot captures 22
> million *(referer, article)* pairs from a total of 4 billion requests
> collected during the month of January 2015.
>
> This data can be used for various purposes:
> • determining the most frequent links people click on for a given article
> • determining the most common links people followed to an article
> • determining how much of the total traffic to an article clicked on a
> link in that article
> • generating a Markov chain over English Wikipedia
>
> We created a page on Meta for feedback and discussion about this release:
> https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream
>
> Ellery and Dario
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to