Thanks for opening the ticket and for clarifying the issue more. On a related note, I wonder if you could add the documentation for the unusual amount of pageviews to 404.php as returned by the API. That number also shot up in October 2016; see http://wikipediaviews.org/displayviewsformultiplemonths.php?page=404.php&allmonths=allmonths&drilldown=all for the historical trend.
Vipul On Thu, Nov 17, 2016 at 1:26 PM, Nuria Ruiz <nu...@wikimedia.org> wrote: > >Just to verify what you are saying, would it be right to say that the > bug fix caused > >a a lot of pageviews to be moved from the respective (nonexistent) pages > to "-" pageviews? > > No, the bugfix makes those faulty requests to no longer be stored as > pageviews thus it cannot make that number increase. I am not sure we can > link the surge of "-" pageviews in October to any determined cause without > further research. Have filed ticket to that extent, hopefully we can get to > it before we do away with raw data: https://phabricator. > wikimedia.org/T150990 > > >And, does that means that the current estimate of "-" pageviews is more > accurate than it used to be prior to the bug fix? > No, it doesn't. > > > > > On Thu, Nov 17, 2016 at 1:17 PM, Vipul Naik <vipulna...@gmail.com> wrote: > >> Thank you for linking to that bug, Marcel. Just to verify what you are >> saying, would it be right to say that the bug fix caused a a lot of >> pageviews to be moved from the respective (nonexistent) pages to "-" >> pageviews? And, does that means that the current estimate of "-" pageviews >> is more accurate than it used to be prior to the bug fix? >> >> Vipul >> >> On Wed, Nov 16, 2016 at 4:33 AM, Marcel Ruiz Forns <mfo...@wikimedia.org> >> wrote: >> >>> Maybe the high value in October (45M) has something to do with the last >>> changes in https://phabricator.wikimedia.org/T145922 ? >>> >>> On Mon, Nov 14, 2016 at 9:25 PM, Nuria Ruiz <nu...@wikimedia.org> wrote: >>> >>>> This is documented now here: >>>> >>>> https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas >>>> >>>> On Tue, Nov 8, 2016 at 7:25 AM, Vipul Naik <vipulna...@gmail.com> >>>> wrote: >>>> >>>>> Hi Joseph, >>>>> >>>>> Thanks for the clarification. >>>>> >>>>> Any ideas why this number is much higher for some months? In >>>>> particular, on desktop, it's high in the months of July to September 2015 >>>>> (around 10 million, compared to the usual 5 million) and then high again >>>>> in >>>>> October 2016 (45 million, about 10x the usual value). >>>>> >>>>> Data is from http://wikipediaviews.org/displayviewsformultiplemonths >>>>> .php?page=-&allmonths=allmonths&drilldown=all which summarizes >>>>> results from the Wikimedia API (and stats.grok.se for data before >>>>> July 2015). >>>>> >>>>> Vipul >>>>> >>>>> On Tue, Nov 8, 2016 at 3:46 AM, Joseph Allemandou < >>>>> jalleman...@wikimedia.org> wrote: >>>>> >>>>>> Hello Issa, >>>>>> >>>>>> Thank you for your question. >>>>>> The very high number of views of the "-" page is explained by this >>>>>> dash value being used as a special value for "no page title found" when >>>>>> extracting titles from urls. >>>>>> We definitely should document this in the API, creating this task: >>>>>> https://phabricator.wikimedia.org/T150249 >>>>>> Best >>>>>> Joseph >>>>>> >>>>>> >>>>>> On Tue, Nov 8, 2016 at 12:28 AM, Issa Rice <ricei...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Dear Analytics Mailing List, >>>>>>> >>>>>>> Recently while querying pageviews of various pages, I discovered that >>>>>>> the page whose title is a single hyphen character (i.e. with the >>>>>>> title >>>>>>> "-", with URL <https://en.wikipedia.org/wiki/->, which redirects to >>>>>>> <https://en.wikipedia.org/wiki/Hyphen-minus>) receives an unusually >>>>>>> high >>>>>>> number of pageviews under the Pageview API. Taking October 2015 as an >>>>>>> example, the page received 5.4 million pageviews during that month >>>>>>> according to the API: >>>>>>> <https://wikimedia.org/api/rest_v1/metrics/pageviews/per-art >>>>>>> icle/en.wikipedia/desktop/user/-/daily/20151001/20151031>. >>>>>>> >>>>>>> However, according the stats.grok.se (which was still operational >>>>>>> in the >>>>>>> same month), the page received only 1209 pageviews: >>>>>>> <http://stats.grok.se/en/201510/->. >>>>>>> >>>>>>> Looking at the tabulation of pageviews on Wikipedia Views, the >>>>>>> increase >>>>>>> in pageviews for this page coincides with the change to the Pageview >>>>>>> API in July 2015: >>>>>>> <http://wikipediaviews.org/displayviewsformultiplemonths.php >>>>>>> ?page=-&allmonths=allmonths&drilldown=all>. >>>>>>> >>>>>>> As I understand, page titles must be URL-encoded before the query, >>>>>>> but the URL-encoding of "-" is itself. >>>>>>> >>>>>>> I looked at the API documentation but did not see this behavior >>>>>>> listed, >>>>>>> so I am wondering where these numbers are coming from. >>>>>>> >>>>>>> Best regards, >>>>>>> Issa >>>>>>> >>>>>>> >>>>>>> _______________________________________________ >>>>>>> Analytics mailing list >>>>>>> Analytics@lists.wikimedia.org >>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Joseph Allemandou* >>>>>> Data Engineer @ Wikimedia Foundation >>>>>> IRC: joal >>>>>> >>>>>> _______________________________________________ >>>>>> Analytics mailing list >>>>>> Analytics@lists.wikimedia.org >>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>> >>>>>> >>>>> >>>>> _______________________________________________ >>>>> Analytics mailing list >>>>> Analytics@lists.wikimedia.org >>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>> >>>>> >>>> >>>> _______________________________________________ >>>> Analytics mailing list >>>> Analytics@lists.wikimedia.org >>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>> >>>> >>> >>> >>> -- >>> *Marcel Ruiz Forns* >>> Analytics Developer >>> Wikimedia Foundation >>> >>> _______________________________________________ >>> Analytics mailing list >>> Analytics@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/analytics >>> >>> >> >> _______________________________________________ >> Analytics mailing list >> Analytics@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> > > _______________________________________________ > Analytics mailing list > Analytics@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/analytics > >
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics