Please, update hourly, to avoid duplicated downloads. 2009/7/3 Frederic Schutz <sch...@mathgen.ch>
> On Fri, Jul 3, 2009 at 1:02 PM, emijrp<emi...@gmail.com> wrote: > > > To update a template similar to {{Popular articles}} of English > Wikipedia. > > Now, I'm downloading one .gz (40 MB) each hour, so, it wouldn't be > neccesary > > if this directory is updated in "real time". > > If nothing has changed since last time I checked, it is one of my cron > jobs that does the update, and I am happy to run it every hour if > needed (and if that is not a problem). > > By the way, the directory has grown quite a bit and is getting > difficult to use (even an "ls" takes ages to run), so I should > probably change the layout a bit (e.g. having subdirectories for > archives). At some point, we may have to delete the older files, or > compress them (that's what Erik Zachte does for the "official" > statistics), but I think there is enough space for now (let me know if > any of you, especially ts-admins, think otherwise). > > One short-term plan is, instead of simply downloading the files, to > replicate part of the infrastucture set up by Erik (provide compressed > and/or processed files) so that it is easier to use the data on the > toolserver. Well, it was a short-term plan in January and then I was > kept away from this work by other comitments... > > Frédéric > > _______________________________________________ > Toolserver-l mailing list > Toolserver-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/toolserver-l >
_______________________________________________ Toolserver-l mailing list Toolserver-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/toolserver-l