Correction: The number for 404.php shot up on September 13: https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/desktop/user/404.php/daily/20160901/20160930?purge756777637
On Thu, Nov 17, 2016 at 4:51 PM, Vipul Naik <vipulna...@gmail.com> wrote: > Thanks for opening the ticket and for clarifying the issue more. > > On a related note, I wonder if you could add the documentation for the > unusual amount of pageviews to 404.php as returned by the API. That number > also shot up in October 2016; see http://wikipediaviews.org/ > displayviewsformultiplemonths.php?page=404.php&allmonths= > allmonths&drilldown=all for the historical trend. > > Vipul > > On Thu, Nov 17, 2016 at 1:26 PM, Nuria Ruiz <nu...@wikimedia.org> wrote: > >> >Just to verify what you are saying, would it be right to say that the >> bug fix caused >> >a a lot of pageviews to be moved from the respective (nonexistent) pages >> to "-" pageviews? >> >> No, the bugfix makes those faulty requests to no longer be stored as >> pageviews thus it cannot make that number increase. I am not sure we can >> link the surge of "-" pageviews in October to any determined cause without >> further research. Have filed ticket to that extent, hopefully we can get to >> it before we do away with raw data: https://phabricator.wiki >> media.org/T150990 >> >> >And, does that means that the current estimate of "-" pageviews is more >> accurate than it used to be prior to the bug fix? >> No, it doesn't. >> >> >> >> >> On Thu, Nov 17, 2016 at 1:17 PM, Vipul Naik <vipulna...@gmail.com> wrote: >> >>> Thank you for linking to that bug, Marcel. Just to verify what you are >>> saying, would it be right to say that the bug fix caused a a lot of >>> pageviews to be moved from the respective (nonexistent) pages to "-" >>> pageviews? And, does that means that the current estimate of "-" pageviews >>> is more accurate than it used to be prior to the bug fix? >>> >>> Vipul >>> >>> On Wed, Nov 16, 2016 at 4:33 AM, Marcel Ruiz Forns <mfo...@wikimedia.org >>> > wrote: >>> >>>> Maybe the high value in October (45M) has something to do with the last >>>> changes in https://phabricator.wikimedia.org/T145922 ? >>>> >>>> On Mon, Nov 14, 2016 at 9:25 PM, Nuria Ruiz <nu...@wikimedia.org> >>>> wrote: >>>> >>>>> This is documented now here: >>>>> >>>>> https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas >>>>> >>>>> On Tue, Nov 8, 2016 at 7:25 AM, Vipul Naik <vipulna...@gmail.com> >>>>> wrote: >>>>> >>>>>> Hi Joseph, >>>>>> >>>>>> Thanks for the clarification. >>>>>> >>>>>> Any ideas why this number is much higher for some months? In >>>>>> particular, on desktop, it's high in the months of July to September 2015 >>>>>> (around 10 million, compared to the usual 5 million) and then high again >>>>>> in >>>>>> October 2016 (45 million, about 10x the usual value). >>>>>> >>>>>> Data is from http://wikipediaviews.org/displayviewsformultiplemonths >>>>>> .php?page=-&allmonths=allmonths&drilldown=all which summarizes >>>>>> results from the Wikimedia API (and stats.grok.se for data before >>>>>> July 2015). >>>>>> >>>>>> Vipul >>>>>> >>>>>> On Tue, Nov 8, 2016 at 3:46 AM, Joseph Allemandou < >>>>>> jalleman...@wikimedia.org> wrote: >>>>>> >>>>>>> Hello Issa, >>>>>>> >>>>>>> Thank you for your question. >>>>>>> The very high number of views of the "-" page is explained by this >>>>>>> dash value being used as a special value for "no page title found" when >>>>>>> extracting titles from urls. >>>>>>> We definitely should document this in the API, creating this task: >>>>>>> https://phabricator.wikimedia.org/T150249 >>>>>>> Best >>>>>>> Joseph >>>>>>> >>>>>>> >>>>>>> On Tue, Nov 8, 2016 at 12:28 AM, Issa Rice <ricei...@gmail.com> >>>>>>> wrote: >>>>>>> >>>>>>>> Dear Analytics Mailing List, >>>>>>>> >>>>>>>> Recently while querying pageviews of various pages, I discovered >>>>>>>> that >>>>>>>> the page whose title is a single hyphen character (i.e. with the >>>>>>>> title >>>>>>>> "-", with URL <https://en.wikipedia.org/wiki/->, which redirects to >>>>>>>> <https://en.wikipedia.org/wiki/Hyphen-minus>) receives an >>>>>>>> unusually high >>>>>>>> number of pageviews under the Pageview API. Taking October 2015 as >>>>>>>> an >>>>>>>> example, the page received 5.4 million pageviews during that month >>>>>>>> according to the API: >>>>>>>> <https://wikimedia.org/api/rest_v1/metrics/pageviews/per-art >>>>>>>> icle/en.wikipedia/desktop/user/-/daily/20151001/20151031>. >>>>>>>> >>>>>>>> However, according the stats.grok.se (which was still operational >>>>>>>> in the >>>>>>>> same month), the page received only 1209 pageviews: >>>>>>>> <http://stats.grok.se/en/201510/->. >>>>>>>> >>>>>>>> Looking at the tabulation of pageviews on Wikipedia Views, the >>>>>>>> increase >>>>>>>> in pageviews for this page coincides with the change to the Pageview >>>>>>>> API in July 2015: >>>>>>>> <http://wikipediaviews.org/displayviewsformultiplemonths.php >>>>>>>> ?page=-&allmonths=allmonths&drilldown=all>. >>>>>>>> >>>>>>>> As I understand, page titles must be URL-encoded before the query, >>>>>>>> but the URL-encoding of "-" is itself. >>>>>>>> >>>>>>>> I looked at the API documentation but did not see this behavior >>>>>>>> listed, >>>>>>>> so I am wondering where these numbers are coming from. >>>>>>>> >>>>>>>> Best regards, >>>>>>>> Issa >>>>>>>> >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> Analytics mailing list >>>>>>>> Analytics@lists.wikimedia.org >>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>> >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> *Joseph Allemandou* >>>>>>> Data Engineer @ Wikimedia Foundation >>>>>>> IRC: joal >>>>>>> >>>>>>> _______________________________________________ >>>>>>> Analytics mailing list >>>>>>> Analytics@lists.wikimedia.org >>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>> >>>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Analytics mailing list >>>>>> Analytics@lists.wikimedia.org >>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>> >>>>>> >>>>> >>>>> _______________________________________________ >>>>> Analytics mailing list >>>>> Analytics@lists.wikimedia.org >>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>> >>>>> >>>> >>>> >>>> -- >>>> *Marcel Ruiz Forns* >>>> Analytics Developer >>>> Wikimedia Foundation >>>> >>>> _______________________________________________ >>>> Analytics mailing list >>>> Analytics@lists.wikimedia.org >>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>> >>>> >>> >>> _______________________________________________ >>> Analytics mailing list >>> Analytics@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/analytics >>> >>> >> >> _______________________________________________ >> Analytics mailing list >> Analytics@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> >
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics