Htriedman added a comment.
@GoranSMilovanovic Just took a look. Those files are alright to share with people who have signed NDA with the Foundation. **They are not ok to share publicly**, since they contain exact counts of editors and edits, rather than aggregated buckets of counts (11-20 editors instead of 14, 100-200 edits instead of 151, etc.). Following along with the schema established <https://phabricator.wikimedia.org/T131280> when releasing geoeditors/public <https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Geoeditors/Public>, the data released publicly might look like this: | country code | month | editors | editors (ns0) | edits | edits (ns0) | | FR | 01-2021 | 51-60 | 41-50 | 401-500 | 401-500 | | ES | 06-2021 | 31-40 | 31-40 | 201-300 | 201-300 | | AR | 03-2021 | 1-10 | 1-10 | 5-100 | 5-100 | | Rows with edit counts under 5 should also not be included in the released tabular data, as they are easily reidentifiable. TASK DETAIL https://phabricator.wikimedia.org/T291186 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Htriedman Cc: Mohammed_Sadat_WMDE, Milimetric, Tobi_WMDE_SW, Ladsgroup, GoranSMilovanovic, Manuel, Aklapper, Invadibot, maantietaja, Akuckartz, 4748kitoko, Jcross, JFishback_WMF, Nandana, Akovalyov, Jony, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, JAllemandou, terrrydactyl, Wikidata-bugs, aude, Mbch331, jeremyb
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
