https://bugzilla.wikimedia.org/show_bug.cgi?id=56030

--- Comment #3 from Erik Zachte <[email protected]> ---
"The wikistats data is resource intensive and can't be computed for month X-1
in time for this meeting." 
This is actually the dumps which take up to three weeks into the next month to
arrive. Only a live stream of all relevant tables to hadoop could fix that. 

"to simply move the computation to hadoop" not sure how simple that would be
though, I've seen some wild optimism in earlier estimations on hadoop, but for
sure this is where we want to go

"Also for some months, pageview data doesn't get delivered because there are
problems with wikistats data that require manual intervention; in those cases
we use month X-2 pageview data." This is a mistake. Page views counts are
updated every day. There is a manual step in Limn (and for some files in
Wikistats, but page view data could be sent automated right now). So only when
Metrics Meeting is on 1st or 2nd day of the month, latest pageview data don't
make in into Limn. Updating Limn after the Metrics Meeting could also help.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to