One can use the pageview_hourly <https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Pageview_hourly> table for this.
On Mon, Jul 9, 2018 at 1:18 AM, Amir E. Aharoni < [email protected]> wrote: > Hi, > > Is there a way to find what are the most popular articles per country? > > Finding the most popular articles per language is easy with the Pageviews > tool, but languages and countries are of course not the same. > > One thing I tried is going to Turnilo, webrequest_sampled_128, and > filtering by country. But here it gets troublesome: > * Splitting can be done by Uri host, which is *more or less* the project, > or by Uri path, which is *more or less* the article (but see below), and I > couldn't find a convenient way to combine them. > * Mobile (.m.) and desktop hosts are separate. It may actually sometimes > be useful to see differences (or lack thereof) between desktop and mobile, > but combining them is often useful, too. This can probably be done with > regular expressions, but this brings us to the biggest problem: > * Filtering by Uri path would be useful if it didn't have so many paths > for images, beacons, etc. Filtering using the regular expression > "\/wiki\/.+" may be the right thing functionally, but in practice it's very > slow or doesn't work at all. > * I don't know what exactly is logged in webrequest_sampled_128, but the > name hints that it doesn't include everything. A sample may be OK for > countries with a lot of traffic like U.S. or Spain, but for countries with > smaller traffic this may start being a problem. > > Any better ideas? > > Thanks! > > -- > Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי > http://aharoni.wordpress.com > “We're living in pieces, > I want to live in peace.” – T. Moore > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > > -- Tilman Bayer Senior Analyst Wikimedia Foundation IRC (Freenode): HaeB
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
