Hi,

just to keep archives happy ... the below issue is now T97588

  https://phabricator.wikimedia.org/T97588

.

Have fun,
Christian




On Sat, Apr 18, 2015 at 12:47:20AM +0200, Christian Aistleitner wrote:
> Hi Analytics dev team,
> 
> just a heads up that it's a week that the pagecounts-all-sites (and
> pagecounts-raw) did not have the 20150409-160000 file generated [1].
> 
> To ease data quality assurances and avoid faulty aggregates, the
> pageview aggregator scripts that do the aggregation for dashiki's
> “Reader / Daily Pageviews” block for a week on missing data (unless
> they are being told that for a given day, missing data is ok).
> 
> For the above hourly pagecounts-all-sites file, this week of blocking
> has now passed without action.
> 
> Hence, the aggregator scripts will start aggregating again (to some
> degree), but the undeclared hole for the 2015-04-09 in the data will
> naturally start to bubble up.
> 
> If that hour's file cannot get generated, adding this date to the
> BAD_DATES.csv of the aggregator data repository, will unblock the
> aggregator cron job and make weekly, monthly, aggregates consider
> 2015-04-09 as day without data.
> 
> If that hour's file gets generated, be aware that aggregator per
> default only automatically backfills for a week. So from today on, you
> need to explicitly run the script to backfill for 2015-04-09.
> 
> Have fun,
> Christian
> 
> 
> P.S.: Since I guess the question of monitoring will arise ... the
> missing pagecounts file has alerted people at least twice by email.
> The subsequent aggregator blocking has been logged.
> But you can add yourself in the MAILTO of the aggregator cron at
>   modules/statistics/manifests/aggregator.pp
> in puppet, if you want an additional notification for that.
> 
> [1]
>   http://dumps.wikimedia.org/other/pagecounts-all-sites/2015/2015-04/
>   http://dumps.wikimedia.org/other/pagecounts-raw/2015/2015-04/
> 
> -- 
> ---- quelltextlich e.U. ---- \\ ---- Christian Aistleitner ----
>                            Companies' registry: 360296y in Linz
> Christian Aistleitner
> Kefermarkterstrasze 6a/3     Email:  [email protected]
> 4293 Gutau, Austria          Phone:          +43 7946 / 20 5 81
>                              Fax:            +43 7946 / 20 5 81
>                              Homepage: http://quelltextlich.at/
> ---------------------------------------------------------------

-- 
---- quelltextlich e.U. ---- \\ ---- Christian Aistleitner ----
                           Companies' registry: 360296y in Linz
Christian Aistleitner
Kefermarkterstrasze 6a/3     Email:  [email protected]
4293 Gutau, Austria          Phone:          +43 7946 / 20 5 81
                             Fax:            +43 7946 / 20 5 81
                             Homepage: http://quelltextlich.at/
---------------------------------------------------------------

Attachment: signature.asc
Description: Digital signature

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to