I have the feeling there’s no need to keep 114Gb of raw client-side 
instrumentation data for tofu detection.
Copying Amir, Gilles and Jon who are the respective owners of the schemas in 
Sean’s list. 

On Jul 2, 2014, at 7:44 PM, Oliver Keyes <[email protected]> wrote:

> he odd name is frustrating to me too :/. I'd be interested to see if we need 
> the MV tables (or, the really old data in them): as I understand it those are 
> aggregated for public consumption fairly regularly.
> 
> 
> On 2 July 2014 22:21, Sean Pringle <[email protected]> wrote:
> Hi :)
> 
> The following table is easily the largest in eventlogging and growing fastest:
> 
> 114G     UniversalLanguageSelector-tofu_7629564
> 
> Is there a plan for purging old data from this one? I realize it's mostly new 
> data; just wondering if growth will be unbounded.
> 
> Why does it have an odd name "-tofu"? Is it intended?
> 
> There is a duplicate table called UniversalLanguageSelecTor-tofu_7629564 -- 
> note the uppercase T -- with a single row. Is that needed?
> 
> The next biggest are:
> 
> 67G     PageContentSaveComplete_5588433.ibd
> 61G     MediaViewer_8572637.ibd
> 57G     MediaViewer_8245578.ibd
> 33G     MobileWebClickTracking_5929948.ibd
> 
> BR
> Sean
> 
> --- 
> DBA @ WMF
> 
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
> 
> 
> 
> 
> -- 
> Oliver Keyes
> Research Analyst
> Wikimedia Foundation
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to