To me the implementation depends on what alkamid actually wants to do.
 For keeping some of SuggestBot's data sources up-to-date I use the
site object's recentchanges() generator to grab data (and although one
can only get a limited amount at each step, I've never had troubles
exhausting the generator), where it's easy to check the edit timestamp
to stop iterating when necessary.  I then store page titles in a
set(), which can be fed to a PagesFromTitlesGenerator, and I chain
said generator with a PreloadingGenerator to get the latest revisions.

In my experience only a minority of a Wikipedia edition's articles are
updated on a weekly basis, so using allpages() results in a lot of
unnecessary data.


Cheers,
Morten

On 5 February 2012 17:28, Dr. Trigon <[email protected]> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
>> past week? I thought of using the AllPagesPageGenerator and
>> executing editTime() on each page, but this method gives me only
>> zeros if the page was not read before (e.g. I have to call
>> page.get() first in order for editTime() to work properly). Is
>> there any edit-time-related piece of information I can get from a
>> generated list of pages? Or maybe there is another page generator
>> suitable for me?
>
> Everything using 'getall' from 'wikipedia.py' (imported as 'pywikibot')
> does give you the first history entry WITHOUT having to trigger
> page.get(). E.g. the 'PreloadingGenerator' and as you can chain the
> generators you can first setup your generator as 'gen1' and then pass
> 'gen1' to a 'PreloadingGenerator' (may be in a 'ThreadedGenerator'...)
> in order to get the first history entry of every page... In
> 'sum_disc.py' of the DrTrigonBot repo is an example for this.
>
> Greetings
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.12 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
>
> iEYEARECAAYFAk8vEKcACgkQAXWvBxzBrDAMTwCfe7kKUHrtgsE+EguKAuiWoODb
> zr4An2M5d6G0XZJGMntDLS54DL6XGdug
> =37Hk
> -----END PGP SIGNATURE-----
>
> _______________________________________________
> Pywikipedia-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l

_______________________________________________
Pywikipedia-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l

Reply via email to