For those interested: i've merged Nemo's patch, so anyone interested in doing queries for a category can use the script now without needing an additional list of files.
https://github.com/hay/wiki-tools/blob/master/etc/mediacounts-stats.py -- Hay On Wed, Mar 25, 2015 at 4:11 PM, Federico Leva (Nemo) <[email protected]> wrote: > Hay (Husky), 25/03/2015 11:03: >> >> Answering my own question: until somebody puts up a stats.grok.se-like >> interface for the mediacounts, i've hacked together a Python script >> that can be used to 'query' the TSV files with a file, or a list of >> files: >> >> https://github.com/hay/wiki-tools/blob/master/etc/mediacounts-stats.py > > > And I sent a small silly patch to give a category name like > https://commons.wikimedia.org/wiki/Category:Media_from_BEIC as input. > Example output attached for the lazy. > Some data I found particularly interesting: > 1) the sum of columns 11–14 (big thumbs), > 2) the ratio between (1) and column 3 (total transfers), > 3) column 24 (no Wikimedia referrer). > Total transfers in this small sample seem even higher than > pageviews. (1) counts thumbs above 400 pixels, which are usually not > embedded by default: (2) should tell how many users probably clicked or did > something else. (3) may indicate which files "went viral". > > Nemo > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
