-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Magnus Manske wrote:
> On Fri, Oct 24, 2008 at 5:59 PM, Brion Vibber <[EMAIL PROTECTED]> wrote:
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>>
>> Johannes Beigel wrote:
>>> Is there a way (or a plan to implement one) to retrieve the list of
>>> unique contributors for a given article (from a given revision down to
>>> the first one)? Ideally this would accept parameters for the mentioned
>>> filtering. I guess inside of MediaWiki code this can be handled very
>>> efficiently (using appropriate database queries) and would eliminate
>>> the need to transfer lots of redundant data over the socket.
>> Given that this could require filtering through hundreds of thousands of
>> unique revisions for a single request, I don't think we currently have a
>> good plan for that. :)
> 
> I just ran a DISTINCT mysql query for all non-IP editors of
> [[en:George W. Bush]] on the toolserver, and that took 3 seconds.
> There are 41790 revisions.

Indeed, it's not as bad as I was afraid. I'm still a little leery that
the EXPLAIN lists "Using temporary" though. :P

> Considering that this would be a worst case article, and that it ran
> on the overtaxed toolserver, it does seem possible. Maybe if we'd have
> one MySQL slave / Apache dedicated for this task?

Probably fine to pull from the same slaves already dedicated for
contributions queries (relevant indexes are already pulled into memory).

Figuring out how to get something other than a raw list of thousands of
editors for a "nice" author list remains a harder task. :)

- -- brion
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkkInEIACgkQwRnhpk1wk45YFQCgqGtWOps8dAU/qbjQJA290qDJ
6pMAnRBgO0erMT2fmB2GxHnWXj7t/bdi
=QiWY
-----END PGP SIGNATURE-----

_______________________________________________
Mediawiki-api mailing list
Mediawiki-api@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api

Reply via email to