Tbayer added a comment.
In https://phabricator.wikimedia.org/T135164#2290869, @Addshore wrote: > Okay, so the first queries we used while looking also caught pages such as Special:RecentChangesLinked/Q13215 which are tracked as views to different pages! "%" is the wildcard character for LIKE <https://docs.jboss.org/hibernate/orm/4.1/devguide/en-US/html/ch11.html#d5e3182> in HQL and SQL. So `uri_path LIKE "/wiki/Special:RecentChangesLinked%"` would be expected to catch these, yes. BTW, was there a particular reason why you were querying the webrequest <https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest> table and not the much smaller pageview_hourly <https://wikitech.wikimedia.org/wiki/Analytics/Data/Pageview_hourly> table? It gives you the same numbers with much less computational effort: SELECT agent_type, SUM(view_count) AS count FROM wmf.pageview_hourly WHERE year = 2016 AND month = 5 AND day = 10 AND project = "wikidata" AND page_title = "Special:RecentChangesLinked" GROUP BY agent_type ORDER BY count LIMIT 999999; ... agent_type count spider 72 user 91328 2 rows selected (108.284 seconds) TASK DETAIL https://phabricator.wikimedia.org/T135164 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Tbayer Cc: Tbayer, Lydia_Pintscher, JAllemandou, Aklapper, madhuvishy, Addshore, Zppix, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331, jeremyb _______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
