Hi Jerôme, most of the actions you refer to are not stored as edits by mediawiki. They can be accessed via the logging table [1] (with log_type 'delete' or 'block'), which is replicated on tool labs (you can apply for a tool labs account if you don't have one [2]).
HTH Dario [1] https://www.mediawiki.org/wiki/Manual:Logging_table [2] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help On Oct 10, 2013, at 10:02 AM, "Klein,Max" <[email protected]> wrote: > Hello Jerome, > > I'm not sure this is the best way, but pywikipediabot [1] has a library > called pagegenerators.py and there is a function def > UserContributionsGenerator(username) (around line 706). That would allow you > to iterate through theses user names, and I bet there will be a special > marking for deletions/undeletions. If not, worst comes to worse you can use a > regular expression for those words. > > [1] https://meta.wikimedia.org/wiki/pywikipediabot > > When you use have a pywikibot-hammer everything looks like a pywikibot-nail! > > > Maximilian Klein > Wikipedian in Residence, OCLC > +17074787023 > > From: [email protected] > <[email protected]> on behalf of Jérôme Hergueux > <[email protected]> > Sent: Thursday, October 10, 2013 3:11 AM > To: [email protected] > Subject: [Wiki-research-l] How to collect all the admin-specific edits for a > subset of Wp admins > > Dear all, > > I am starting this thread in the hope that some of the great Wiki researchers > on this list could advise me on a data collection problem. > > Here is the question: for a each of 120 Wikipedia admins (for whom I have the > usernames and unique numeric ids), I would like to reliably count the number > of times they (i) deleted a page (ii) undeleted (i.e. restored) a page (iii) > protected a page (iv) blocked a user and (v) unblocked a user. > Those types of edits all correspond to a specific "action" in the Wikipedia > API documentation page (http://en.wikipedia.org/w/api.php): > action=delete,action=undelete, action=protect, action=block and > action=unblock. > I don't know, however, what would be the best strategy to go about collecting > those edits. Does anyone have an idea about which data collection strategy I > should adopt in this case? Is there a way to query the Wikipedia API > directly, or should I look for some specific markers in the edit summaries? > > I would be very grateful for any advice of feedback! > Thanks much for your attention and time. :) > > Best, > > Jérôme. > _______________________________________________ > Wiki-research-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
signature.asc
Description: Message signed with OpenPGP using GPGMail
_______________________________________________ Wiki-research-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
