Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-12-06 Thread Nuria Ruiz
>Also, just out of curiosity and to better understand the issue, what >would be an example of a real life request URL that results in such a >"no page title found" error when extracting the title? Special page requests, for example. Normally pages like "Special:Blah" are "actions" not pages

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-12-05 Thread Tilman Bayer
On Mon, Nov 14, 2016 at 12:25 PM, Nuria Ruiz wrote: > This is documented now here: > > https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas Thanks for the documentation. Does this only affect data provided by the API, or also the page_title field in the

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Correction: The number for 404.php shot up on September 13: https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/desktop/user/404.php/daily/20160901/20160930?purge756777637 On Thu, Nov 17, 2016 at 4:51 PM, Vipul Naik wrote: > Thanks for opening the

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Thanks for opening the ticket and for clarifying the issue more. On a related note, I wonder if you could add the documentation for the unusual amount of pageviews to 404.php as returned by the API. That number also shot up in October 2016; see

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Nuria Ruiz
>Just to verify what you are saying, would it be right to say that the bug fix caused >a a lot of pageviews to be moved from the respective (nonexistent) pages to "-" pageviews? No, the bugfix makes those faulty requests to no longer be stored as pageviews thus it cannot make that number

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Thank you for linking to that bug, Marcel. Just to verify what you are saying, would it be right to say that the bug fix caused a a lot of pageviews to be moved from the respective (nonexistent) pages to "-" pageviews? And, does that means that the current estimate of "-" pageviews is more

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-16 Thread Marcel Ruiz Forns
Maybe the high value in October (45M) has something to do with the last changes in https://phabricator.wikimedia.org/T145922 ? On Mon, Nov 14, 2016 at 9:25 PM, Nuria Ruiz wrote: > This is documented now here: > >

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-14 Thread Nuria Ruiz
This is documented now here: https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas On Tue, Nov 8, 2016 at 7:25 AM, Vipul Naik wrote: > Hi Joseph, > > Thanks for the clarification. > > Any ideas why this number is much higher for some months? In particular, >

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-08 Thread Vipul Naik
Hi Joseph, Thanks for the clarification. Any ideas why this number is much higher for some months? In particular, on desktop, it's high in the months of July to September 2015 (around 10 million, compared to the usual 5 million) and then high again in October 2016 (45 million, about 10x the

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-08 Thread Joseph Allemandou
Hello Issa, Thank you for your question. The very high number of views of the "-" page is explained by this dash value being used as a special value for "no page title found" when extracting titles from urls. We definitely should document this in the API, creating this task: