Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-26 Thread CherRaye Glenn
This is awesome! Thank you team! On Tue, Feb 25, 2020 at 7:35 AM Goran Milovanovic < goran.milovanovic_...@wikimedia.de> wrote: > Great job Luca. Thank you very much. > > I have started to diversify all WMDE Analytics jobs (mainly Wikidata > related things) across the stat100* machines. > While

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-25 Thread Goran Milovanovic
Great job Luca. Thank you very much. I have started to diversify all WMDE Analytics jobs (mainly Wikidata related things) across the stat100* machines. While I still mainly use stat1007, two modules of the WDCM system are already

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Neil Shah-Quinn
Thank you very much, Luca! To make this nice documentation easier to discover, I moved it to Analytics/Systems/Clients along with the other information on the clients from Analytics/Data access. On Tue, 18 Feb 2020 at 17:11, Isaac

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Isaac Johnson
Thanks for pulling together these directions Luca! I did a little clean-up and will try to remember to do so more routinely. Adding to what Diego said, I also started using stat1007 because it has the most access to resources (dumps, Hadoop, MariaDB), and then my virtual environments, config

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Andrew Otto
I added a 'GPU?' column too. :) THANKS LUCA! On Tue, Feb 18, 2020 at 11:51 AM Luca Toscano wrote: > Hey Diego, > > added a section at the end of the page with the info requested, let me > know if anything is missing :) > > Luca > > Il giorno mar 18 feb 2020 alle ore 17:37 Diego Saez-Trumper <

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Luca Toscano
Hey Diego, added a section at the end of the page with the info requested, let me know if anything is missing :) Luca Il giorno mar 18 feb 2020 alle ore 17:37 Diego Saez-Trumper < di...@wikimedia.org> ha scritto: > Thanks for this Luca. > > I tend to use stat1007 because I know that machine

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Diego Saez-Trumper
Thanks for this Luca. I tend to use stat1007 because I know that machine has a lot of ram/cpu and HDFS access. From other statsX I'm not sure which of them have what resources (I know at least one of them doesn't have HDFS access). There is a table where I can look at a summary of resources per

Re: [Analytics] [Research-Internal] Tutorials on disk space usage for notebook/stat boxes

2020-02-18 Thread Marcel Ruiz Forns
Looks great Luca! Handy commands... On Tue, Feb 18, 2020 at 8:53 AM Luca Toscano wrote: > Hi everybody! > > I created the following doc: > https://wikitech.wikimedia.org/wiki/Analytics/Tutorials/Analytics_Client_Nodes > > It contains two FAQ: > - How do I ensure that there is enough space on