Tbayer added a comment.

  In https://phabricator.wikimedia.org/T135164#2290869, @Addshore wrote:
  
  > Okay, so the first queries we used while looking also caught pages such as 
Special:RecentChangesLinked/Q13215 which are tracked as views to different 
pages!
  
  
  "%" is the wildcard character for LIKE 
<https://docs.jboss.org/hibernate/orm/4.1/devguide/en-US/html/ch11.html#d5e3182>
 in HQL and SQL. So `uri_path LIKE "/wiki/Special:RecentChangesLinked%"` would 
be expected to catch these, yes.
  
  BTW, was there a particular reason why you were querying the webrequest 
<https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest> table and not 
the much smaller pageview_hourly 
<https://wikitech.wikimedia.org/wiki/Analytics/Data/Pageview_hourly> table? It 
gives you the same numbers with much less computational effort:
  
    SELECT
        agent_type,
        SUM(view_count) AS count
    FROM
      wmf.pageview_hourly
    WHERE
      year = 2016
      AND month = 5
      AND day = 10
      AND project = "wikidata"  
      AND page_title = "Special:RecentChangesLinked"
    GROUP BY agent_type
    ORDER BY count
    LIMIT 999999;
    ...
    agent_type  count
    spider      72
    user        91328
    2 rows selected (108.284 seconds)

TASK DETAIL
  https://phabricator.wikimedia.org/T135164

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Tbayer
Cc: Tbayer, Lydia_Pintscher, JAllemandou, Aklapper, madhuvishy, Addshore, 
Zppix, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331, jeremyb



_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to