Maurice, the data per-article is available starting in October:
http://dumps.wikimedia.org/other/pageviews/2015/2015-10/

We have the data so we could backfill back to May 2015, but not beyond
that.  The backfill process would take quite a long time, however, because
there's a lot of data to crunch through.  So we haven't decided to kick off
that back-filling until people ask us for it.  So do ask for it if that's
useful.

If you need data going further back, we have the pagecounts-all-sites
dataset, which at least includes mobile data as well.  This is available
per-article starting in late 2014.

As you can see, this is confusing which is why I just started a thread on
this list about simplifying it.  Please chime in there if you have an
opinion about the cleanup.

On Thu, Dec 24, 2015 at 8:41 AM, Maurice Vergeer <[email protected]>
wrote:

> Dear Dan,
>
> thanks for this information. I looked at it the new data. Are these data
> also available per subject or only aggregated for the entire site? I am
> mainly interested in statistics about specific pages in Wikipedia.
> For now the traditional measurements are sufficient. I.e. I assume spider
> traffic is sort of random on a daily basis, and because I will do
> timeseries analysis, this will have little affect on the findings
> .
> On a similar topic, when a user is logged in to perform edits to a page
> and he refreshes the wikipedia-page, does this register as a visit? Or do
> only not loggedon visits register as a visit?
>
> To conclude, I think Wikipedia data are very nteresting for scientific
> study. I've seen some studies in information science, but in my field -
> Communicatoin science - very little. I hope to change that with my
> contribution :-)
>
> Again thanks Dan,
> Maurice
>
>
>
>
> On Thu, Dec 24, 2015 at 2:07 PM, Dan Andreescu <[email protected]>
> wrote:
>
>> Maurice, if you're looking for recent data, we have a better source:
>> dumps.wikimedia.org/other/pageviews/
>>
>> This is better than pagecounts-raw because it excludes spider (crawler)
>> traffic, some other automata traffic, and includes mobile traffic.  We have
>> been slow to announce this and change the pages because it's a confusing
>> change.
>>
>> On Thu, Dec 24, 2015 at 4:44 AM, Maurice Vergeer <[email protected]>
>> wrote:
>>
>>> Dear Federico Leva,
>>> thank you very much for the clarification and the quick reply.
>>>
>>> Because I want to relate the pagecounts to events taking place in the
>>> Netherlands (e.g televised political debates), knowing what the timezone is
>>> is important.
>>>
>>> Again thanks and best regards
>>> Maurice Vergeer
>>>
>>> On Thu, Dec 24, 2015 at 10:38 AM, Federico Leva (Nemo) <
>>> [email protected]> wrote:
>>>
>>>> Maurice Vergeer, 24/12/2015 10:16:
>>>>
>>>>> I am looking at your pagecounts as archived on
>>>>> https://dumps.wikimedia.org/other/pagecounts-raw/2015/2015-12/
>>>>> Can you tell me from what timezone the time stamps originate?
>>>>>
>>>>
>>>> Any and all timestamps in dumps.wikimedia.org are in UTC. Apparently
>>>> this is not as obvious as generally thought, so I've added a note:
>>>> https://wikitech.wikimedia.org/w/index.php?title=Analytics%2FData%2FPagecounts-raw&type=revision&diff=240313&oldid=184255
>>>>
>>>> Nemo
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> [email protected]
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>>
>>>
>>>
>>> --
>>> ________________________________________________
>>> Maurice Vergeer
>>> To contact me, see http://mauricevergeer.nl/node/5
>>> To see my publications, see http://mauricevergeer.nl/node/1
>>> ________________________________________________
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>>
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
>
> --
> ________________________________________________
> Maurice Vergeer
> To contact me, see http://mauricevergeer.nl/node/5
> To see my publications, see http://mauricevergeer.nl/node/1
> ________________________________________________
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to