The sleep time as implemented in pywikipedia is influenced by three factors:

 - the minimum time between requests, minthrottle=1  (by default)   in
user-config.py
 - the minimum time between page saves, put_throttle=10
 - the database lag on the DB servers - pages will not be saved unless the
lag is less than   maxlag = 5

(all values in seconds).

In the case of wikidata, the maximum number of saves/edits per minute is 60
(iirc), so you could reduce put_throttle to 1 by adding

put_throttle=1

in your user-config.py

Best,
Merlijn


On 20 August 2013 20:16, Chinmay Naik <[email protected]> wrote:

> Hi,
> I am Chinmay Naik, a Gsoc intern and operator of ProteinBoxBot(runs on
> pywikipedia - formerly rewrite branch). When i run the bot, there exists a
> sleep time of around 10 s (7s average) between two successive writes.
> The bot will handle around 40,000 wikidata items corresponding to  gene
> wiki articles from http://en.wikipedia.org/wiki/Category:Human_proteins.  
> Also , i will be uploading around 30 (claims + sources) for each
> wikidata item.
>
> If u take a look at the recent edits (
> https://www.wikidata.org/w/index.php?title=Special:Contributions/ProteinBoxBot&offset=&limit=250&target=ProteinBoxBot),
> it takes around 5 mins to completely write all claims+sources to each
> wikidata item. This is a very large time lag and will hugely affect
> deployment.  Is there any way to reduce this sleep time? Any pointers on
> this would be helpful.
>
> Thanks,
> Chinmay
>
> _______________________________________________
> Pywikipedia-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
>
>
_______________________________________________
Pywikipedia-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l

Reply via email to