Re: [Analytics] Resources stat1005

2017-08-14 Thread Luca Toscano
Hi Adrian, you should open a phab task like the following: https://phabricator.wikimedia.org/T158053 to get into the nda LDAP group (if you really need it as Nuria mentioned :). Luca 2017-08-13 0:52 GMT+02:00 Adrian Bielefeldt < adrian.bielefe...@mailbox.tu-dresden.de>: > Hi Andrew, > > thanks

Re: [Analytics] Article creation stats

2017-08-14 Thread Morten Wang
Thanks for sending me to https://phabricator.wikimedia.org/T149021! That seems to answer the question I forgot to ask: does the mediawiki_history table include creation of deleted pages, and it looks like it does. I'll reuse the query and findings from that task then. Always great to find shortcuts

Re: [Analytics] Resources stat1005

2017-08-14 Thread Nuria Ruiz
Adrian, You already have access to use the cluster, which is where you should move your processing, the link to yarn was just to show resource consumption. Thanks, Nuria On Sat, Aug 12, 2017 at 3:52 PM, Adrian Bielefeldt < adrian.bielefe...@mailbox.tu-dresden.de> wrote: > Hi Andrew, > > thank

Re: [Analytics] Article creation stats

2017-08-14 Thread Nuria Ruiz
>Would there happen to be a dataset of that available somewhere? Data is available on public labs replicas but sql is complicated to write and likely to time out due the volume of data that is combing. Data is also available on Hadoop Data Lake which is not public yet (it is our plan to make it so