GoranSMilovanovic added a comment.
@Manuel > Let's discuss how we could use these tags in combination with other inf (e.g isBot and the total number of edits). I am currently working on this from SQL, because the august 2021 snapshot of `wmf.mediawiki_history` is not yet produced in our Data Lake. It might take some time to get to the results since a rather tricky join (`change_tag_def` --> `change_tag` --> `revision_actor_temp` --> `actor` --> `user`) needs to be performed in order to find out how many of the new tags were used by who in `wikidatawiki` since `20210801000000`. I will not be using the `user.user_editcount` since it is a rough approximation of the number of revisions made on behalf of a user; rather join `user` on `revision_actor_temp` --> `actor`. Also, I will need to filter out bots manually in this approach, and I have already created two fields in that respect (`botByName`, `botByGroup`), similarly to what is done in `wmf.mediawiki_history` in the Data Lake. TASK DETAIL https://phabricator.wikimedia.org/T285459 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: Lydia_Pintscher, Manuel, Tobi_WMDE_SW, GoranSMilovanovic, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
