Correction: The number for 404.php shot up on September 13:
https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/desktop/user/404.php/daily/20160901/20160930?purge756777637

On Thu, Nov 17, 2016 at 4:51 PM, Vipul Naik <vipulna...@gmail.com> wrote:

> Thanks for opening the ticket and for clarifying the issue more.
>
> On a related note, I wonder if you could add the documentation for the
> unusual amount of pageviews to 404.php as returned by the API. That number
> also shot up in October 2016; see http://wikipediaviews.org/
> displayviewsformultiplemonths.php?page=404.php&allmonths=
> allmonths&drilldown=all for the historical trend.
>
> Vipul
>
> On Thu, Nov 17, 2016 at 1:26 PM, Nuria Ruiz <nu...@wikimedia.org> wrote:
>
>> >Just to verify what you are saying, would it be right to say that the
>> bug fix caused
>> >a a lot of pageviews to be moved from the respective (nonexistent) pages
>> to "-" pageviews?
>>
>> No, the bugfix makes those faulty requests to no longer be stored as
>> pageviews thus it cannot make that number increase.  I am not sure we can
>> link the surge of "-" pageviews in October to any determined cause without
>> further research. Have filed ticket to that extent, hopefully we can get to
>> it before we do away with raw data: https://phabricator.wiki
>> media.org/T150990
>>
>> >And, does that means that the current estimate of "-" pageviews is more
>> accurate than it used to be prior to the bug fix?
>> No, it doesn't.
>>
>>
>>
>>
>> On Thu, Nov 17, 2016 at 1:17 PM, Vipul Naik <vipulna...@gmail.com> wrote:
>>
>>> Thank you for linking to that bug, Marcel. Just to verify what you are
>>> saying, would it be right to say that the bug fix caused a a lot of
>>> pageviews to be moved from the respective (nonexistent) pages to "-"
>>> pageviews? And, does that means that the current estimate of "-" pageviews
>>> is more accurate than it used to be prior to the bug fix?
>>>
>>> Vipul
>>>
>>> On Wed, Nov 16, 2016 at 4:33 AM, Marcel Ruiz Forns <mfo...@wikimedia.org
>>> > wrote:
>>>
>>>> Maybe the high value in October (45M) has something to do with the last
>>>> changes in https://phabricator.wikimedia.org/T145922 ?
>>>>
>>>> On Mon, Nov 14, 2016 at 9:25 PM, Nuria Ruiz <nu...@wikimedia.org>
>>>> wrote:
>>>>
>>>>> This is documented now here:
>>>>>
>>>>> https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas
>>>>>
>>>>> On Tue, Nov 8, 2016 at 7:25 AM, Vipul Naik <vipulna...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Hi Joseph,
>>>>>>
>>>>>> Thanks for the clarification.
>>>>>>
>>>>>> Any ideas why this number is much higher for some months? In
>>>>>> particular, on desktop, it's high in the months of July to September 2015
>>>>>> (around 10 million, compared to the usual 5 million) and then high again 
>>>>>> in
>>>>>> October 2016 (45 million, about 10x the usual value).
>>>>>>
>>>>>> Data is from http://wikipediaviews.org/displayviewsformultiplemonths
>>>>>> .php?page=-&allmonths=allmonths&drilldown=all which summarizes
>>>>>> results from the Wikimedia API (and stats.grok.se for data before
>>>>>> July 2015).
>>>>>>
>>>>>> Vipul
>>>>>>
>>>>>> On Tue, Nov 8, 2016 at 3:46 AM, Joseph Allemandou <
>>>>>> jalleman...@wikimedia.org> wrote:
>>>>>>
>>>>>>> Hello Issa,
>>>>>>>
>>>>>>> Thank you for your question.
>>>>>>> The very high number of views of the "-" page is explained by this
>>>>>>> dash value being used as a special value for "no page title found" when
>>>>>>> extracting titles from urls.
>>>>>>> We definitely should document this in the API, creating this task:
>>>>>>> https://phabricator.wikimedia.org/T150249
>>>>>>> Best
>>>>>>> Joseph
>>>>>>>
>>>>>>>
>>>>>>> On Tue, Nov 8, 2016 at 12:28 AM, Issa Rice <ricei...@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Dear Analytics Mailing List,
>>>>>>>>
>>>>>>>> Recently while querying pageviews of various pages, I discovered
>>>>>>>> that
>>>>>>>> the page whose title is a single hyphen character (i.e. with the
>>>>>>>> title
>>>>>>>> "-", with URL <https://en.wikipedia.org/wiki/->, which redirects to
>>>>>>>> <https://en.wikipedia.org/wiki/Hyphen-minus>) receives an
>>>>>>>> unusually high
>>>>>>>> number of pageviews under the Pageview API. Taking October 2015 as
>>>>>>>> an
>>>>>>>> example, the page received 5.4 million pageviews during that month
>>>>>>>> according to the API:
>>>>>>>> <https://wikimedia.org/api/rest_v1/metrics/pageviews/per-art
>>>>>>>> icle/en.wikipedia/desktop/user/-/daily/20151001/20151031>.
>>>>>>>>
>>>>>>>> However, according the stats.grok.se (which was still operational
>>>>>>>> in the
>>>>>>>> same month), the page received only 1209 pageviews:
>>>>>>>> <http://stats.grok.se/en/201510/->.
>>>>>>>>
>>>>>>>> Looking at the tabulation of pageviews on Wikipedia Views, the
>>>>>>>> increase
>>>>>>>> in pageviews for this page coincides with the change to the Pageview
>>>>>>>> API in July 2015:
>>>>>>>> <http://wikipediaviews.org/displayviewsformultiplemonths.php
>>>>>>>> ?page=-&allmonths=allmonths&drilldown=all>.
>>>>>>>>
>>>>>>>> As I understand, page titles must be URL-encoded before the query,
>>>>>>>> but the URL-encoding of "-" is itself.
>>>>>>>>
>>>>>>>> I looked at the API documentation but did not see this behavior
>>>>>>>> listed,
>>>>>>>> so I am wondering where these numbers are coming from.
>>>>>>>>
>>>>>>>> Best regards,
>>>>>>>> Issa
>>>>>>>>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> Analytics mailing list
>>>>>>>> Analytics@lists.wikimedia.org
>>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> *Joseph Allemandou*
>>>>>>> Data Engineer @ Wikimedia Foundation
>>>>>>> IRC: joal
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Analytics mailing list
>>>>>>> Analytics@lists.wikimedia.org
>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> Analytics mailing list
>>>>>> Analytics@lists.wikimedia.org
>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>>>
>>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Analytics mailing list
>>>>> Analytics@lists.wikimedia.org
>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> *Marcel Ruiz Forns*
>>>> Analytics Developer
>>>> Wikimedia Foundation
>>>>
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> Analytics@lists.wikimedia.org
>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>>>
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> Analytics@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>>
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to