Thanks for the clarification, Joseph.

Bo

On Tue, Mar 1, 2016 at 2:02 PM, Joseph Allemandou
<[email protected]> wrote:
> Hi Again,
>
> @Dan: We will indeed reload data into cassandra.
>
> @Bo: Actually the two datasets are fairly different.
>
> The one called pagecounts is slowly getting deprecated toward the one called
> pageview, defined by Research people at WMF:
> https://meta.wikimedia.org/wiki/Research:Page_view
>
> The pageview dumps are actually a 'legacy format' view of the new pageview
> :)
>
>
> Code for the legacy extraction:
> https://github.com/wikimedia/analytics-refinery/blob/master/oozie/pagecounts-all-sites/load/insert_hourly_pagecounts.hql
> Code for the new pageview definition:
> https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-core/src/main/java/org/wikimedia/analytics/refinery/core/PageviewDefinition.java
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to