Machine learning - I would suggest that you pick up a fine book that explains 
machine learning. That's the way I went about - pick up each type of machine 
learning concept - say Linear regression then understand the why/when/how etc 
and infer results etc. 

Then apply the learning to a small data set using python or R or scala without 
Spark. This is to familiarize the learning.

Then run the same with MLlib and see it with a big data set on Spark. I would 
call this consolidation. 

Few things to remember - all Machine learning algorithms are not available On 
spark. There is a list of machine learning supported in spark. Kindly look at 
that. Also look at how to integrate mahout / h20 with spark and see how you can 
run the machine learning stuff supported by mahout with spark.

And then your journey begins :-).

Regards,
Harmeet




On Jun 12, 2016, at 0:31, Mich Talebzadeh <mich.talebza...@gmail.com> wrote:

> yes absolutely Ted.
> 
> Thanks for highlighting it
> 
> 
> 
> Dr Mich Talebzadeh
>  
> LinkedIn  
> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>  
> http://talebzadehmich.wordpress.com
>  
> 
> On 11 June 2016 at 19:00, Ted Yu <yuzhih...@gmail.com> wrote:
> Another source is the presentation on various ocnferences.
> e.g.
> http://www.slideshare.net/databricks/apache-spark-mllib-20-preview-data-science-and-production
> 
> FYI
> 
> On Sat, Jun 11, 2016 at 8:47 AM, Mich Talebzadeh <mich.talebza...@gmail.com> 
> wrote:
> Interesting.
> 
> The pace of development in this field is such that practically every single 
> book in Big Data landscape gets out of data before the ink dries on it  :)
> 
> I concur that they serve as good reference for starters but in my opinion the 
> best way to learn is to start from on-line docs (and these are pretty 
> respectful when it comes to Spark) and progress from there.
> 
> If you have a certain problem then put to this group and I am sure someone 
> somewhere in this forum has come across it. Also most of these books' authors 
> actively contribute to this mailing list.
> 
> 
> HTH
> 
> 
> Dr Mich Talebzadeh
>  
> LinkedIn  
> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>  
> http://talebzadehmich.wordpress.com
>  
> 
> On 11 June 2016 at 16:10, Ted Yu <yuzhih...@gmail.com> wrote:
> https://www.amazon.com/Machine-Learning-Spark-Powerful-Algorithms/dp/1783288515/ref=sr_1_1?ie=UTF8&qid=1465657706&sr=8-1&keywords=spark+mllib
> 
> https://www.amazon.com/Spark-Practical-Machine-Learning-Chinese/dp/7302420424/ref=sr_1_3?ie=UTF8&qid=1465657706&sr=8-3&keywords=spark+mllib
> 
> https://www.amazon.com/Advanced-Analytics-Spark-Patterns-Learning/dp/1491912766/ref=sr_1_2?ie=UTF8&qid=1465657706&sr=8-2&keywords=spark+mllib
> 
> 
> On Sat, Jun 11, 2016 at 8:04 AM, Deepak Goel <deic...@gmail.com> wrote:
> 
> Hey
> 
> Namaskara~Nalama~Guten Tag~Bonjour
> 
> I am a newbie to Machine Learning (MLIB and other libraries on Spark)
> 
> Which would be the best book to learn up?
> 
> Thanks
> Deepak
>    -- 
> Keigu
> 
> Deepak
> 73500 12833
> www.simtree.net, dee...@simtree.net
> deic...@gmail.com
> 
> LinkedIn: www.linkedin.com/in/deicool
> Skype: thumsupdeicool
> Google talk: deicool
> Blog: http://loveandfearless.wordpress.com
> Facebook: http://www.facebook.com/deicool
> 
> "Contribute to the world, environment and more : http://www.gridrepublic.org
> "
> 
> 
> 
> 

Reply via email to