Thanks for the clarification, Joseph. Bo
On Tue, Mar 1, 2016 at 2:02 PM, Joseph Allemandou <[email protected]> wrote: > Hi Again, > > @Dan: We will indeed reload data into cassandra. > > @Bo: Actually the two datasets are fairly different. > > The one called pagecounts is slowly getting deprecated toward the one called > pageview, defined by Research people at WMF: > https://meta.wikimedia.org/wiki/Research:Page_view > > The pageview dumps are actually a 'legacy format' view of the new pageview > :) > > > Code for the legacy extraction: > https://github.com/wikimedia/analytics-refinery/blob/master/oozie/pagecounts-all-sites/load/insert_hourly_pagecounts.hql > Code for the new pageview definition: > https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-core/src/main/java/org/wikimedia/analytics/refinery/core/PageviewDefinition.java > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
