Thank you both for the great pointers! One more question: if I were interested in the counts based on the country of origin of a user, is this data publicly available?
Best, Gheorghe On Sep 29, 2016 01:36, "Joseph Allemandou" <[email protected]> wrote: > Hello Gheorghe, > What Dan said, plus a goodie for easy manual comparisons: the pageview > viewer tool <https://tools.wmflabs.org/pageviews> > For instance, "Suicide Squad (film)" vs "Json Bourne (film)", user only > pageviews (no explicit bots) for July 2016: > https://tools.wmflabs.org/pageviews/?project=en. > wikipedia.org&platform=all-access&agent=user&start=2016- > 07-01&end=2016-07-31&pages=Suicide_Squad_(film)|Jason_Bourne_(film) > Cheers > Joseph > > On Thu, Sep 29, 2016 at 2:37 AM, Dan Andreescu <[email protected]> > wrote: > >> Hello Gheorghe, that dataset is deprecated and we have a much cleaner one >> in the same format. Check out: >> >> * the new landing page for analytics dumps: >> dumps.wikimedia.org/other/analytics and specifically: dumps.wikimedia. >> org/other/pageviews >> >> * the in-depth documentation of the different datasets we provide: >> wikitech.wikimedia.org/wiki/Analytics/Data >> >> *From: *Gheorghe Postelnicu >> *Sent: *Wednesday, September 28, 2016 20:32 >> *To: *[email protected] >> *Reply To: *A mailing list for the Analytics Team at WMF and everybody >> who has an interest in Wikipedia and analytics. >> *Subject: *[Analytics] Question re. PageCounts >> >> Hello, >> >> First of all, many thanks for this wonderful project! >> >> I am writing as I downloaded the July pagecounts data from: >> >> https://dumps.wikimedia.org/other/pagecounts-raw/2016/2016-07/ >> >> As I was browsing it, I was surprised to notice that some entities, such >> as the movie "Suicide Squad", only seem to have gotten very sparse views in >> July - see below. In comparison, the clicks for Jason Bourne seem to have >> been much higher for the same period. Below are lines from the logs. >> >> Am I doing something wrong? >> >> Many thanks in advance, >> Gheorghe >> >> *Suicide Squad*: >> >> (pagecounts-20160727-020000.gz,en Suicide_squad_(film) 1 6614) >> >> (pagecounts-20160727-160000.gz,en Suicide_squad_(film) 1 25599) >> >> (pagecounts-20160728-220000.gz,en Suicide_squad_(film) 2 32210) >> >> (pagecounts-20160731-210000.gz,en Suicide_squad_(film) 11 72721) >> >> >> *Jason Bourne*: >> >> pagecounts-20160731-210000.gz,sv Jason_Bourne_(film) 12 124894) >> >> (pagecounts-20160731-210000.gz,tr Jason_Bourne_(film) 78 1852192) >> >> (pagecounts-20160731-220000.gz,en File:Jason_Bourne_(film).jpg 2 19067) >> >> (pagecounts-20160731-220000.gz,en Jason_Bourne_(film) 2119 73275075) >> >> (pagecounts-20160731-220000.gz,en Talk:Jason_Bourne_(film) 1 10059) >> >> pagecounts-20160731-220000.gz,fr Jason_Bourne_(film) 55 1226127) >> >> (pagecounts-20160731-220000.gz,hu Jason_Bourne_(film) 3 34335) >> >> (pagecounts-20160731-220000.gz,it Jason_Bourne_(film) 29 579129) >> >> (pagecounts-20160731-220000.gz,nl Jason_Bourne_(film) 11 125928) >> >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> > > > -- > *Joseph Allemandou* > Data Engineer @ Wikimedia Foundation > IRC: joal > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
