On 8 January 2015 at 02:31, Gergo Tisza <[email protected]> wrote:
> On Wed, Jan 7, 2015 at 6:26 PM, Oliver Keyes <[email protected]> wrote:
>>
>> places to get edits? Well....the revision table? I'm sort of confused
>> as to what you're looking for, I guess, that the db wouldn't have.
>
>
> There are a thousand or so wikis; it would be nice if there was a single
> table with all the edits. I guess I can generate a query with a thousand
> unions...

We've talked about having a unified db; the problem is some tables
exist on some wikis and not on others (and keeping it synced up would
be a pain).
The most we have is a machine with all of the dbs on it, and a table
containing all the dbnames; it should be fairly trivial to write
something in Python
or similar to iterate through them.

>
> The harder problem is that it would be nice to group by editor activity
> levels. One of the concerns about MediaViewer was that it makes harder for
> new editors to understand file pages and start editing them; so it would be
> a plausible hypothesis that the number of file edits by new editors would
> drop sharply after making MV default, but the total file edit count wouldn't
> be visibly affected because it would be dominated by power users who already
> know how to curate image metadata.
>
> So I would like to look at something like the number of first edits per
> month, or the number of edits by editors who at the time had less than 10
> edits... recovering that kind of data from the revision table seems
> extremely difficult.

Yeah, that is difficult. Aaron has, I believe, precomputed some things; Aaron?

>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>



-- 
Oliver Keyes
Research Analyst
Wikimedia Foundation

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to