Htriedman added a comment.

  @GoranSMilovanovic
  
  Just took a look. Those files are alright to share with people who have 
signed NDA with the Foundation. **They are not ok to share publicly**, since 
they contain exact counts of editors and edits, rather than aggregated buckets 
of counts (11-20 editors instead of 14, 100-200 edits instead of 151, etc.).
  
  Following along with the schema established 
<https://phabricator.wikimedia.org/T131280> when releasing geoeditors/public 
<https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Geoeditors/Public>,
 the data released publicly might look like this:
  
  | country code | month   | editors | editors (ns0) | edits   | edits (ns0) |
  | FR           | 01-2021 | 51-60   | 41-50         | 401-500 | 401-500     |
  | ES           | 06-2021 | 31-40   | 31-40         | 201-300 | 201-300     |
  | AR           | 03-2021 | 1-10    | 1-10          | 5-100   | 5-100       |
  |
  
  Rows with edit counts under 5 should also not be included in the released 
tabular data, as they are easily reidentifiable.

TASK DETAIL
  https://phabricator.wikimedia.org/T291186

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Htriedman
Cc: Mohammed_Sadat_WMDE, Milimetric, Tobi_WMDE_SW, Ladsgroup, 
GoranSMilovanovic, Manuel, Aklapper, Invadibot, maantietaja, Akuckartz, 
4748kitoko, Jcross, JFishback_WMF, Nandana, Akovalyov, Jony, Lahi, Gq86, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, JAllemandou, 
terrrydactyl, Wikidata-bugs, aude, Mbch331, jeremyb
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to