Yeah, here are a few more examples for English Wikipedia:
https://tools.wmflabs.org/pageviews/?project=en.wikipedia.org&platform=all-access&agent=user&start=2019-09-01&end=2019-11-01&pages=IPv4|Simple_Mail_Transfer_Protocol|Solar_System|Petrodollar_recycling

If you go into the clickstream dataset (
https://dumps.wikimedia.org/other/clickstream/readme.html) for the
appropriate month, you'll generally see that these spikes are listed as
'other-empty' for referrer, which is more evidence that this is not people
coming from search or other websites.

On Mon, Dec 23, 2019 at 4:33 AM RhinosF1 - <[email protected]> wrote:

> Hi,
>
> Not all bots will follow rules to identify as a bot in user agents so
> can't easily be caught.
>
> Thanks,
> RhinosF1
>
> On Mon, 23 Dec 2019 at 10:26, Keren WMIL <[email protected]> wrote:
>
>> Thanks all for your input. I was pretty sure this data point should be
>> disregarded, but I am still a bit puzzled - I thought pageviews by default
>> reflect user views and do not include agent=spider. What is the difference
>> between spider and bot?
>> Also I was wondering about the fact that the Hebrew caffeine page
>> received a factor of 10 more views compared to the Hungarian examples (450K
>> vs around 30K), and that as Lodewijk pointed out there was none of the
>> spillover which one sees in the Hungarian examples.
>> 🤔
>> Keren
>>
>>
>>
>>
>>
>> ‫בתאריך יום ב׳, 23 בדצמ׳ 2019 ב-1:58 מאת ‪Nuria Ruiz‬‏ <‪
>> [email protected]‬‏>:‬
>>
>>> Hello,
>>>
>>> This spike is probably caused by bot traffic. I would disregard it
>>> entirely. Please see, for example, a similar problem in all top pageviews
>>> in hungarian wikipedia for last month.
>>>
>>> https://phabricator.wikimedia.org/T237282
>>>
>>> Thanks,
>>>
>>> Nuria
>>>
>>> On Sun, Dec 22, 2019 at 2:42 PM Brian Keegan <[email protected]>
>>> wrote:
>>>
>>>> Webmasters sometimes design their 404 pages to link to Wikipedia
>>>> articles, so if their website goes down all their users (human and bot)
>>>> start getting referred to Wikipedia articles. I could easily image there
>>>> being a “This page isn’t available, go grab a cup of coffee” kind of
>>>> placeholder page being up.
>>>>
>>>>
>>>>
>>>> *From: *Analytics <[email protected]> on behalf of
>>>> Jan Ainali <[email protected]>
>>>> *Reply-To: *"A mailing list for the Analytics Team at WMF and
>>>> everybody who has an interest in Wikipedia and analytics." <
>>>> [email protected]>
>>>> *Date: *Sunday, December 22, 2019 at 3:01 PM
>>>> *To: *"A mailing list for the Analytics Team at WMF and everybody who
>>>> has an interest in Wikipedia and analytics." <
>>>> [email protected]>
>>>> *Subject: *Re: [Analytics] Pageviews anomaly‏
>>>>
>>>>
>>>>
>>>> Another observation is that it only spiked from desktop and not from
>>>> mobile which suggests it was not because of a general interest (which would
>>>> cause spikes on all platforms).
>>>>
>>>>
>>>> Best regards,
>>>>
>>>> Jan Ainali
>>>>
>>>> http://ainali.com
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> Den sön 22 dec. 2019 kl 22:01 skrev effe iets anders <
>>>> [email protected]>:
>>>>
>>>> I agree this is odd - especially the fact that both the day before and
>>>> the day after, the article had less than 100 visits
>>>> <https://tools.wmflabs.org/pageviews/?project=he.wikipedia.org&platform=all-access&agent=user&start=2019-09-01&end=2019-09-30&pages=%D7%A7%D7%A4%D7%90%D7%99%D7%9F>.
>>>> Usually there seems to be some spillover at the very least into the next
>>>> day.
>>>>
>>>>
>>>>
>>>> Lodewijk
>>>>
>>>>
>>>>
>>>> On Sun, Dec 22, 2019 at 5:17 AM Keren WMIL <[email protected]>
>>>> wrote:
>>>>
>>>> Dear all,
>>>>
>>>> It's almost Christmas and the new year is coming around. At the end of
>>>> each year we publish a list of the most viewed Hebrew Wikipedia articles in
>>>> the past year.
>>>>
>>>> We have a data point that appears to be anomalous: the article
>>>> caffeine
>>>> <https://tools.wmflabs.org/pageviews/?project=he.wikipedia.org&platform=all-access&agent=user&range=this-year&pages=%D7%A7%D7%A4%D7%90%D7%99%D7%9F>received
>>>> more than 450K views on one day: 26th of September 2019. We can't see any
>>>> reason for such a surge and it is completely disproportionate. Even on
>>>> English Wikipedia caffeine
>>>> <https://tools.wmflabs.org/pageviews/?project=en.wikipedia.org&platform=all-access&agent=user&range=this-year&pages=Caffeine>hasn't
>>>> received so many views on one day - not even on the 8th of February
>>>> when Friedlieb Ferdinand Runge who identified caffeine was features on the
>>>> daily Google Doodle.
>>>>
>>>> It seems this data point is erroneous. Is there any way to verify that,
>>>> or inquire where the error stems from?
>>>>
>>>>
>>>>
>>>> Kind regards and seasons greetings,
>>>>
>>>>
>>>>
>>>> Dr. Keren Shatzman
>>>>
>>>> Senior Coordinator, Academia & Projects
>>>> Wikimedia Israel
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> [email protected]
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> [email protected]
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> [email protected]
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>


-- 
Isaac Johnson (he/him/his) -- Research Scientist -- Wikimedia Foundation
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to