GoranSMilovanovic added a comment.

  @Manuel
  
  > Let's discuss how we could use these tags in combination with other inf 
(e.g isBot and the total number of edits).
  
  I am currently working on this from SQL, because the august 2021 snapshot of 
`wmf.mediawiki_history` is not yet produced in our Data Lake.
  
  It might take some time to get to the results since a rather tricky join 
(`change_tag_def` --> `change_tag` --> `revision_actor_temp` --> `actor` --> 
`user`) needs to be performed in order to find out how many of the new tags 
were used by who in `wikidatawiki` since `20210801000000`.
  
  I will not be using the `user.user_editcount` since it is a rough 
approximation of the number of revisions made on behalf of a user; rather join 
`user` on `revision_actor_temp` --> `actor`.
  Also, I will need to filter out bots manually in this approach, and I have 
already created two fields in that respect (`botByName`, `botByGroup`), 
similarly to what is done in `wmf.mediawiki_history` in the Data Lake.

TASK DETAIL
  https://phabricator.wikimedia.org/T285459

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: Lydia_Pintscher, Manuel, Tobi_WMDE_SW, GoranSMilovanovic, Aklapper, 
Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to