https://bugzilla.wikimedia.org/show_bug.cgi?id=25602

--- Comment #9 from Ariel T. Glenn <[email protected]> ---
(In reply to comment #8)
> This bug covers far more than two fields.

For the user table we are talking about two fields.

>If you'd like a full list of missing
> tables and fields from the public dumps, I can put one together.

If folks want other partial tables, they should request them in separate bugs. 
The discussion around privacy and/or necessity for each will likely be
different.

> The English Wikipedia has over 19,000,000 registered users. While the
> MediaWiki
> Web API can be used to retrieve some of this information, are we really
> suggesting that polling the API 3,800 times (this assumes batches of 5,000)
> is
> the best way to dump the user table? That seems kind of insane.

With a script doing the work, as long as it respects maxlag, who cares?  Just
fire it up, go do other things, and come back at some point to see when it's
done.

If you are concerned about multiple users doing this same set of requests
instead of a single producer providing the list for download, I can see that as
a legitimate complaint.  But that too would be a different bug: it would be
nice if users had a space that they could put data sets that they generate, for
re-use by others.  I think that could be managed by interested users off-site
though, without WMF intervention.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to