Re: [scikit-learn] titanic dataset, use for book

2019-06-28 Thread Sole Galli
Thank you! that's very helpful :) On Thu, 27 Jun 2019 at 12:27, Roman Yurchak via scikit-learn < scikit-learn@python.org> wrote: > Meanwhile, loading the CSV from OpenML (https://www.openml.org/d/40945) > would also work, > > pd.read_csv('https://www.openml.org/data/get_csv/16826755/phpMYEkMl') >

Re: [scikit-learn] titanic dataset, use for book

2019-06-27 Thread Roman Yurchak via scikit-learn
Meanwhile, loading the CSV from OpenML (https://www.openml.org/d/40945) would also work, pd.read_csv('https://www.openml.org/data/get_csv/16826755/phpMYEkMl') -- Roman On 25/06/2019 17:04, Andreas Mueller wrote: > By the time your book comes out, it's likely to be merged, but might not > be r

Re: [scikit-learn] titanic dataset, use for book

2019-06-25 Thread Andreas Mueller
Hi Sole. I would suggest not to use this version of the titanic dataset. It's a personal repository of mine and might not exist forever. Ideally you (and we) would use fetch_openml. However, the current version doesn't have support for returning dataframes. That's addressed in https://github.com/s

[scikit-learn] titanic dataset, use for book

2019-06-24 Thread Sole Galli
Hello Scikit-learn team, I am currently writing a book for Packt on feature engineering, where I plan to show how to use the newest sklearn transformers. Could I confirm with you whether I can use the titanic dataset located here: titanic_url = ('https://raw.githubusercontent.com/amueller/'