Hi Max -- let me ping the API folks. I don't think we researchers can make
the final call on this.

-Toby

On Fri, Dec 12, 2014 at 2:53 PM, Maximilian Klein <isa...@gmail.com> wrote:
>
> Hello Researchers,
>
> I've been playing with Recent Changes Stream Interface
> <https://wikitech.wikimedia.org/wiki/RCStream> recently, and have started
> trying to use the API's "*action=compare*" to look at every diff of every
> wiki in real time. The goal is to produce real-time analytics on the
> content that's being added or deleted. The only problem is that is will
> really hammer the API with lots of reads since it doesn't have a batch
> interface. Can I spawn multiple network threads and do 10+ reads per second
> forever without the API complaining? Can I warn someone about this and get
> a special exemption for research purposes?
>
> The other thing to do would be to use "*action=query*" to get the
> revisions in batches and do the diffing myself, but then i'm not guaranteed
> to be diffing in the same way that the site is.
>
> What techniques would you recommend?
>
>
> Make a great day,
> Max Klein ‽ http://notconfusing.com/
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to