This is great. I'd like to go on record saying that this is leaning towards a data warehouse kind of approach - basically pre-aggregating useful datasets. So we might want to do this in a more organized way down the line.
On Thu, Jun 12, 2014 at 2:57 PM, Oliver Keyes <[email protected]> wrote: > This is fricking awesome! > > > On 12 June 2014 10:58, Aaron Halfaker <[email protected]> wrote: > >> I created a new table on analytics-store.eqiad.wmnet. It contains the >> monthly edit counts for all wikis. See a brief overview below. >> >> Note that the "revisions" column contains a count of all revisions -- >> archived or not. The "archived" column contains a count of archived >> revisions. So revisions - archived == non-archived revisions. >> >> analytics-store.eqiad.wmnet [staging]> explain editor_month; >> +-------------------+----------------+------+-----+---------+-------+ >> | Field | Type | Null | Key | Default | Extra | >> +-------------------+----------------+------+-----+---------+-------+ >> | wiki | varbinary(50) | NO | PRI | | | >> | month | varbinary(7) | NO | PRI | | | >> | user_id | int(11) | NO | PRI | 0 | | >> | user_name | varbinary(191) | YES | | NULL | | >> | user_registration | varbinary(14) | YES | | NULL | | >> | archived | int(11) | YES | | NULL | | >> | revisions | int(11) | YES | | NULL | | >> +-------------------+----------------+------+-----+---------+-------+ >> 7 rows in set (0.01 sec) >> >> analytics-store.eqiad.wmnet [staging]> select * from editor_month limit 3; >> >> +--------+---------+---------+------------+-------------------+----------+-----------+ >> | wiki | month | user_id | user_name | user_registration | archived >> | revisions | >> >> +--------+---------+---------+------------+-------------------+----------+-----------+ >> | enwiki | 2001-01 | 34 | WojPob | 20010129110725 | 0 >> | 13 | >> | enwiki | 2001-01 | 99 | RoseParks | 20010121021221 | 0 >> | 7 | >> | enwiki | 2001-01 | 479 | JimboWales | 20010123223416 | 0 >> | 13 | >> >> +--------+---------+---------+------------+-------------------+----------+-----------+ >> 3 rows in set (0.03 sec) >> >> Feedback is welcome. One of the next things, I'd like to do is remove >> the "-" from the month column as it ruins comparison with MW timestamps. >> >> -Aaron >> >> _______________________________________________ >> wmfresearch mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wmfresearch >> >> > > > -- > Oliver Keyes > Research Analyst > Wikimedia Foundation > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
