Hi,

Not all bots will follow rules to identify as a bot in user agents so can't
easily be caught.

Thanks,
RhinosF1

On Mon, 23 Dec 2019 at 10:26, Keren WMIL <[email protected]> wrote:

> Thanks all for your input. I was pretty sure this data point should be
> disregarded, but I am still a bit puzzled - I thought pageviews by default
> reflect user views and do not include agent=spider. What is the difference
> between spider and bot?
> Also I was wondering about the fact that the Hebrew caffeine page received
> a factor of 10 more views compared to the Hungarian examples (450K vs
> around 30K), and that as Lodewijk pointed out there was none of the
> spillover which one sees in the Hungarian examples.
> 🤔
> Keren
>
>
>
>
>
> ‫בתאריך יום ב׳, 23 בדצמ׳ 2019 ב-1:58 מאת ‪Nuria Ruiz‬‏ <‪
> [email protected]‬‏>:‬
>
>> Hello,
>>
>> This spike is probably caused by bot traffic. I would disregard it
>> entirely. Please see, for example, a similar problem in all top pageviews
>> in hungarian wikipedia for last month.
>>
>> https://phabricator.wikimedia.org/T237282
>>
>> Thanks,
>>
>> Nuria
>>
>> On Sun, Dec 22, 2019 at 2:42 PM Brian Keegan <[email protected]>
>> wrote:
>>
>>> Webmasters sometimes design their 404 pages to link to Wikipedia
>>> articles, so if their website goes down all their users (human and bot)
>>> start getting referred to Wikipedia articles. I could easily image there
>>> being a “This page isn’t available, go grab a cup of coffee” kind of
>>> placeholder page being up.
>>>
>>>
>>>
>>> *From: *Analytics <[email protected]> on behalf of
>>> Jan Ainali <[email protected]>
>>> *Reply-To: *"A mailing list for the Analytics Team at WMF and everybody
>>> who has an interest in Wikipedia and analytics." <
>>> [email protected]>
>>> *Date: *Sunday, December 22, 2019 at 3:01 PM
>>> *To: *"A mailing list for the Analytics Team at WMF and everybody who
>>> has an interest in Wikipedia and analytics." <
>>> [email protected]>
>>> *Subject: *Re: [Analytics] Pageviews anomaly‏
>>>
>>>
>>>
>>> Another observation is that it only spiked from desktop and not from
>>> mobile which suggests it was not because of a general interest (which would
>>> cause spikes on all platforms).
>>>
>>>
>>> Best regards,
>>>
>>> Jan Ainali
>>>
>>> http://ainali.com
>>>
>>>
>>>
>>>
>>>
>>> Den sön 22 dec. 2019 kl 22:01 skrev effe iets anders <
>>> [email protected]>:
>>>
>>> I agree this is odd - especially the fact that both the day before and
>>> the day after, the article had less than 100 visits
>>> <https://tools.wmflabs.org/pageviews/?project=he.wikipedia.org&platform=all-access&agent=user&start=2019-09-01&end=2019-09-30&pages=%D7%A7%D7%A4%D7%90%D7%99%D7%9F>.
>>> Usually there seems to be some spillover at the very least into the next
>>> day.
>>>
>>>
>>>
>>> Lodewijk
>>>
>>>
>>>
>>> On Sun, Dec 22, 2019 at 5:17 AM Keren WMIL <[email protected]>
>>> wrote:
>>>
>>> Dear all,
>>>
>>> It's almost Christmas and the new year is coming around. At the end of
>>> each year we publish a list of the most viewed Hebrew Wikipedia articles in
>>> the past year.
>>>
>>> We have a data point that appears to be anomalous: the article caffeine
>>> <https://tools.wmflabs.org/pageviews/?project=he.wikipedia.org&platform=all-access&agent=user&range=this-year&pages=%D7%A7%D7%A4%D7%90%D7%99%D7%9F>received
>>> more than 450K views on one day: 26th of September 2019. We can't see any
>>> reason for such a surge and it is completely disproportionate. Even on
>>> English Wikipedia caffeine
>>> <https://tools.wmflabs.org/pageviews/?project=en.wikipedia.org&platform=all-access&agent=user&range=this-year&pages=Caffeine>hasn't
>>> received so many views on one day - not even on the 8th of February
>>> when Friedlieb Ferdinand Runge who identified caffeine was features on the
>>> daily Google Doodle.
>>>
>>> It seems this data point is erroneous. Is there any way to verify that,
>>> or inquire where the error stems from?
>>>
>>>
>>>
>>> Kind regards and seasons greetings,
>>>
>>>
>>>
>>> Dr. Keren Shatzman
>>>
>>> Senior Coordinator, Academia & Projects
>>> Wikimedia Israel
>>>
>>>
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to