Hi Adrian,
you should open a phab task like the following:
https://phabricator.wikimedia.org/T158053 to get into the nda LDAP group
(if you really need it as Nuria mentioned :).
Luca
2017-08-13 0:52 GMT+02:00 Adrian Bielefeldt <
adrian.bielefe...@mailbox.tu-dresden.de>:
> Hi Andrew,
>
> thanks
Thanks for sending me to https://phabricator.wikimedia.org/T149021! That
seems to answer the question I forgot to ask: does the mediawiki_history
table include creation of deleted pages, and it looks like it does. I'll
reuse the query and findings from that task then. Always great to find
shortcuts
Adrian,
You already have access to use the cluster, which is where you should move
your processing, the link to yarn was just to show resource consumption.
Thanks,
Nuria
On Sat, Aug 12, 2017 at 3:52 PM, Adrian Bielefeldt <
adrian.bielefe...@mailbox.tu-dresden.de> wrote:
> Hi Andrew,
>
> thank
>Would there happen to be a dataset of that available somewhere?
Data is available on public labs replicas but sql is complicated to write
and likely to time out due the volume of data that is combing. Data is also
available on Hadoop Data Lake which is not public yet (it is our plan to
make it so