We get 120,000 requests a second. We're not storing them all for six months. But we do have sampled logs going back that far.
On 7 January 2015 at 20:59, Nuria Ruiz <[email protected]> wrote: > Probably someone can provide details as to your other questions, this is > what I think i can help with: > >>Does Hadoop pageview data go back that far? > Hadoop data is only for the last 30 days. > >>Back when MediaViewer was launched, I added a namespace parameter to >> NavigationTiming to be able to track per-namespace pageviews, > Navigation timing is heavily sampled so I am not sure you could estimate > pageviews with the scarce dataset it provides, I would say it is not > possible. > > > > On Wed, Jan 7, 2015 at 5:52 PM, Gergo Tisza <[email protected]> wrote: >> >> I would like to graph the correlation between file namespace page views >> and MediaViewer image views. Back when MediaViewer was launched, I added a >> namespace parameter to NavigationTiming to be able to track per-namespace >> pageviews, but I messed up and it only got deployed around the time >> MediaViewer was enabled on Commons, so we have no data for the early steps >> of the deploy process. >> >> Do you know of any other source for per-namespace pageview data that is >> still available for the 2014 April-June period? Technically the raw >> pagecount files contain the information but aggregating those would be a >> horribly complicated way of getting this information. Does Hadoop pageview >> data go back that far? >> >> >> thanks >> Gergő >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> > > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > -- Oliver Keyes Research Analyst Wikimedia Foundation _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
