In my short experience studying this, there are some articles that might be deleted very quickly, not even waiting 30 minutes.
If I look here at monthly dumps, http://dumps.wikimedia.org/enwiki/ then we would be missing many articles that were created and deleted between the two dumps. We could look in total what articles were deleted Historical data, I guess they could be extracted from older dumps, to extract the list of articles tagged with categories based on a series of dumps. right now I am pulling them every thirty minutes, it would be possible to scan historical dumps and find any articles that are no longer in the newer dumps. The deletion logs here http://en.wikipedia.org/w/index.php?title=Special:Log/delete we could scan those, but as i said, how to get the text? the CPU usage for something like that would go way over my current processing. we could as I said install in on the toolserver, I would have to work on the code for a bit first. so, this comes down to the question, do we have a full log of the deleted articles ? thanks, mike On Mon, Jun 11, 2012 at 5:49 AM, Samuel Klein <meta...@gmail.com> wrote: > This is great. Thank you, Mike! It would be nice to see this done > for historically speedied articles, too. Sam. > > On Sun, Jun 10, 2012 at 3:37 AM, Mike Dupont > <jamesmikedup...@googlemail.com> wrote: > > Hi, > > I have launched speedydeletion.wika.com , it is updated every 30 minutes > > with the proposed deletions and speedy deletion articles (not notable and > > hoaxes, not others). > > it is running on the en.wikipedia.org. the sources for the script are > all > > on git hub and are a merger of pywikipediabot and the wikiteam codebases. > > hope you enjoy it, > > thanks, > > mike > > -- > > James Michael DuPont > > Member of Free Libre Open Source Software Kosova http://flossk.org > > Contributor FOSM, the CC-BY-SA map of the world http://fosm.org > > Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3 > > _______________________________________________ > > Wikimedia-l mailing list > > Wikimedia-l@lists.wikimedia.org > > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l > > > > -- > Samuel Klein identi.ca:sj w:user:sj +1 617 > 529 4266 > > _______________________________________________ > Wikimedia-l mailing list > Wikimedia-l@lists.wikimedia.org > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l > -- James Michael DuPont Member of Free Libre Open Source Software Kosova http://flossk.org Contributor FOSM, the CC-BY-SA map of the world http://fosm.org Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3 _______________________________________________ Wikimedia-l mailing list Wikimedia-l@lists.wikimedia.org Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l