Re: Building Desktop application for ALS-MlLib/ Training ALS

2014-12-15 Thread Xiangrui Meng
On Sun, Dec 14, 2014 at 3:06 AM, Saurabh Agrawal
saurabh.agra...@markit.com wrote:


 Hi,



 I am a new bee in spark and scala world



 I have been trying to implement Collaborative filtering using MlLib supplied
 out of the box with Spark and Scala



 I have 2 problems



 1.   The best model was trained with rank = 20 and lambda = 5.0, and
 numIter = 10, and its RMSE on the test set is 25.718710831912485. The best
 model improves the baseline by 18.29%. Is there a scientific way in which
 RMSE could be brought down? What is a descent acceptable value for RMSE?


The grid search approach used in the AMPCamp tutorial is pretty
standard. Whether an RMSE is good or not really depends on your
dataset.

 2.   I picked up the Collaborative filtering algorithm from
 http://ampcamp.berkeley.edu/5/exercises/movie-recommendation-with-mllib.html
 and executed the given code with my dataset. Now, I want to build a desktop
 application around it.

 a.   What is the best language to do this Java/ Scala? Any possibility
 to do this using C#?


We support Java/Scala/Python. Start with the one your are most
familiar with. C# is not supported.

 b.  Can somebody please share any relevant documents/ source or any
 helper links to help me get started on this?


For ALS, you can check the API documentation.



 Your help is greatly appreciated



 Thanks!!



 Regards,

 Saurabh Agrawal


 
 This e-mail, including accompanying communications and attachments, is
 strictly confidential and only for the intended recipient. Any retention,
 use or disclosure not expressly authorised by Markit is prohibited. This
 email is subject to all waivers and other terms at the following link:
 http://www.markit.com/en/about/legal/email-disclaimer.page

 Please visit http://www.markit.com/en/about/contact/contact-us.page? for
 contact information on our offices worldwide.

 MarkitSERV Limited has its registered office located at Level 4, Ropemaker
 Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and regulated
 by the Financial Conduct Authority with registration number 207294

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Building Desktop application for ALS-MlLib/ Training ALS

2014-12-15 Thread Abhi Basu
In case you must write c# code, you can call python code from c# or use
IronPython. :)

On Mon, Dec 15, 2014 at 12:04 PM, Xiangrui Meng men...@gmail.com wrote:

 On Sun, Dec 14, 2014 at 3:06 AM, Saurabh Agrawal
 saurabh.agra...@markit.com wrote:
 
 
  Hi,
 
 
 
  I am a new bee in spark and scala world
 
 
 
  I have been trying to implement Collaborative filtering using MlLib
 supplied
  out of the box with Spark and Scala
 
 
 
  I have 2 problems
 
 
 
  1.   The best model was trained with rank = 20 and lambda = 5.0, and
  numIter = 10, and its RMSE on the test set is 25.718710831912485. The
 best
  model improves the baseline by 18.29%. Is there a scientific way in which
  RMSE could be brought down? What is a descent acceptable value for RMSE?
 

 The grid search approach used in the AMPCamp tutorial is pretty
 standard. Whether an RMSE is good or not really depends on your
 dataset.

  2.   I picked up the Collaborative filtering algorithm from
 
 http://ampcamp.berkeley.edu/5/exercises/movie-recommendation-with-mllib.html
  and executed the given code with my dataset. Now, I want to build a
 desktop
  application around it.
 
  a.   What is the best language to do this Java/ Scala? Any
 possibility
  to do this using C#?
 

 We support Java/Scala/Python. Start with the one your are most
 familiar with. C# is not supported.

  b.  Can somebody please share any relevant documents/ source or any
  helper links to help me get started on this?
 

 For ALS, you can check the API documentation.

 
 
  Your help is greatly appreciated
 
 
 
  Thanks!!
 
 
 
  Regards,
 
  Saurabh Agrawal
 
 
  
  This e-mail, including accompanying communications and attachments, is
  strictly confidential and only for the intended recipient. Any retention,
  use or disclosure not expressly authorised by Markit is prohibited. This
  email is subject to all waivers and other terms at the following link:
  http://www.markit.com/en/about/legal/email-disclaimer.page
 
  Please visit http://www.markit.com/en/about/contact/contact-us.page? for
  contact information on our offices worldwide.
 
  MarkitSERV Limited has its registered office located at Level 4,
 Ropemaker
  Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and
 regulated
  by the Financial Conduct Authority with registration number 207294

 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org



-- 
Abhi Basu


Building Desktop application for ALS-MlLib/ Training ALS

2014-12-13 Thread Saurabh Agrawal


Hi,



I am a new bee in spark and scala world



I have been trying to implement Collaborative filtering using MlLib supplied 
out of the box with Spark and Scala



I have 2 problems



1.   The best model was trained with rank = 20 and lambda = 5.0, and 
numIter = 10, and its RMSE on the test set is 25.718710831912485. The best 
model improves the baseline by 18.29%. Is there a scientific way in which RMSE 
could be brought down? What is a descent acceptable value for RMSE?

2.   I picked up the Collaborative filtering algorithm from 
http://ampcamp.berkeley.edu/5/exercises/movie-recommendation-with-mllib.html 
and executed the given code with my dataset. Now, I want to build a desktop 
application around it.

a.   What is the best language to do this Java/ Scala? Any possibility to 
do this using C#?

b.  Can somebody please share any relevant documents/ source or any helper 
links to help me get started on this?



Your help is greatly appreciated



Thanks!!



Regards,

Saurabh Agrawal


This e-mail, including accompanying communications and attachments, is strictly 
confidential and only for the intended recipient. Any retention, use or 
disclosure not expressly authorised by Markit is prohibited. This email is 
subject to all waivers and other terms at the following link: 
http://www.markit.com/en/about/legal/email-disclaimer.page

Please visit http://www.markit.com/en/about/contact/contact-us.page? for 
contact information on our offices worldwide.

MarkitSERV Limited has its registered office located at Level 4, Ropemaker 
Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and regulated by 
the Financial Conduct Authority with registration number 207294


Building Desktop application for ALS-MlLib/ Training ALS

2014-12-13 Thread Saurabh Agrawal


Hi,



I am a new bee in spark and scala world



I have been trying to implement Collaborative filtering using MlLib supplied 
out of the box with Spark and Scala



I have 2 problems



1.   The best model was trained with rank = 20 and lambda = 5.0, and 
numIter = 10, and its RMSE on the test set is 25.718710831912485. The best 
model improves the baseline by 18.29%. Is there a scientific way in which RMSE 
could be brought down? What is a descent acceptable value for RMSE?

2.   I picked up the Collaborative filtering algorithm from 
http://ampcamp.berkeley.edu/5/exercises/movie-recommendation-with-mllib.html 
and executed the given code with my dataset. Now, I want to build a desktop 
application around it.

a.   What is the best language to do this Java/ Scala? Any possibility to 
do this using C#?

b.  Can somebody please share any relevant documents/ source or any helper 
links to help me get started on this?



Your help is greatly appreciated



Thanks!!



Regards,

Saurabh Agrawal


This e-mail, including accompanying communications and attachments, is strictly 
confidential and only for the intended recipient. Any retention, use or 
disclosure not expressly authorised by Markit is prohibited. This email is 
subject to all waivers and other terms at the following link: 
http://www.markit.com/en/about/legal/email-disclaimer.page

Please visit http://www.markit.com/en/about/contact/contact-us.page? for 
contact information on our offices worldwide.

MarkitSERV Limited has its registered office located at Level 4, Ropemaker 
Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and regulated by 
the Financial Conduct Authority with registration number 207294