[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-21 Thread Jan_Dittrich
Jan_Dittrich added a comment.


  I'll just ask you – I guess it won't fluctuate widely from month to month.
  
  Am Di., 21. Jan. 2020 um 12:06 Uhr schrieb GoranSMilovanovic <
  no-re...@phabricator.wikimedia.org>:
  
  > GoranSMilovanovic added a comment. View Task
  > https://phabricator.wikimedia.org/T242631
  > @Jan_Dittrich https://phabricator.wikimedia.org/p/Jan_Dittrich/ Great!
  > Would like to have the ETL procedure put on a crontab and run a regular
  > monthly update, or shall we say just ask me when you need the data again?
  > *TASK DETAIL*
  > https://phabricator.wikimedia.org/T242631
  > *EMAIL PREFERENCES*
  > https://phabricator.wikimedia.org/settings/panel/emailpreferences/
  > *To: *GoranSMilovanovic
  > *Cc: *WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi,
  > Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper,
  > Scott_WUaS, Wikidata-bugs, aude, Mbch331

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, Jan_Dittrich
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-21 Thread GoranSMilovanovic
GoranSMilovanovic added a comment.


  @Jan_Dittrich Great! Would like to have the ETL procedure put on a crontab 
and run a regular monthly update, or shall we say just ask me when you need the 
data again?

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-21 Thread Jan_Dittrich
Jan_Dittrich added a comment.


  Thanks! This is what I needed. In case I need precise numbers for 
area-under-curve or so, I’ll create another task (I "integrated" via cumulative 
sums in Excel by now, so I might be off a measurement or so, but it is still 
enough as estimates)

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, Jan_Dittrich
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-18 Thread GoranSMilovanovic
GoranSMilovanovic added a comment.


  @Jan_Dittrich @WMDE-leszek
  
  - results with anonymized user_ids shared with @Jan_Dittrich via e-mail (cc: 
@WMDE-leszek);
  - awaiting feedback;
  - no public results before we ask for a public data set review from the 
#analytics  if this is to go 
on crontab and produce regular updates.

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-14 Thread Jan_Dittrich
Jan_Dittrich added a comment.


  > a column where each particular editor is represented by one row while the 
(pseudonymous) user column refers to some ID value which thus anonymizes the 
real user ID/username? I guess (2)?
  
  Thanks for pointing this out! It just indicates that, while one user should 
be one row, I do not need the userID or user name, just some key that is unique 
is the table. 
  Also (but I think you guessed so already), it would be great if a bot/not bot 
column would be there OR if bots would be directly exculuded (if I get the data 
as CSV, I take the extra column)

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, Jan_Dittrich
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T242631: Get user/editcount data to determine count at percentiles

2020-01-14 Thread GoranSMilovanovic
GoranSMilovanovic added a comment.


  @Jan_Dittrich
  
  The only thing that I do not understand here is the following planned column:
  
  > (pseudonymous) users
  
  Do you need (1) a split between anonymous vs. non-anonymous editors in this 
column, or (2) a column where each particular editor is represented by one row 
while the `(pseudonymous) user` column refers to some ID value which thus 
anonymizes the real user ID/username? I guess (2)?
  
  > I want to know how "typical" a certain edit count among editors who have 
been active in some timeframe in the last months.
  
  This is doable as a monthly update from the `wmf.mediawiki_history` table and 
could be delivered as a Notebook Report + the data set.
  I can provide visualizations and statistical summaries of the data set to 
help you address the "typicality" of particular edit count classes.

TASK DETAIL
  https://phabricator.wikimedia.org/T242631

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: WMDE-leszek, Aklapper, Jan_Dittrich, darthmon_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs