In my short experience studying this, there are some articles that might be
deleted very quickly, not even waiting 30 minutes.

If I look here at monthly dumps,  http://dumps.wikimedia.org/enwiki/ then
we would be missing many articles that were created and deleted between the
two dumps. We could look in total what articles were deleted

Historical data, I guess they could be extracted from older dumps, to
extract the list of articles tagged with categories based on a series of
dumps.
right now I am pulling them every thirty minutes, it would be possible to
scan historical dumps and find any articles that are no longer in the newer
dumps.

The deletion logs here
http://en.wikipedia.org/w/index.php?title=Special:Log/delete we could scan
those, but as i said, how to get the text?

the CPU usage for something like that would go way over my current
processing. we could as I said install in on the toolserver, I would have
to work on the code for a bit first.
so, this comes down to the question, do we have a full log of the deleted
articles ?

thanks,
mike


On Mon, Jun 11, 2012 at 5:49 AM, Samuel Klein <meta...@gmail.com> wrote:

> This is great.  Thank you, Mike!  It would be nice to see this done
> for historically speedied articles, too.  Sam.
>
> On Sun, Jun 10, 2012 at 3:37 AM, Mike  Dupont
> <jamesmikedup...@googlemail.com> wrote:
> > Hi,
> > I have launched speedydeletion.wika.com , it is updated every 30 minutes
> > with the proposed deletions and speedy deletion articles (not notable and
> > hoaxes, not others).
> > it is running on the en.wikipedia.org. the sources for the script are
> all
> > on git hub and are a merger of pywikipediabot and the wikiteam codebases.
> > hope you enjoy it,
> > thanks,
> > mike
> > --
> > James Michael DuPont
> > Member of Free Libre Open Source Software Kosova http://flossk.org
> > Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
> > Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3
> > _______________________________________________
> > Wikimedia-l mailing list
> > Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
>
>
>
> --
> Samuel Klein          identi.ca:sj           w:user:sj          +1 617
> 529 4266
>
> _______________________________________________
> Wikimedia-l mailing list
> Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
>



-- 
James Michael DuPont
Member of Free Libre Open Source Software Kosova http://flossk.org
Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3
_______________________________________________
Wikimedia-l mailing list
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l

Reply via email to