Re: Newbie question

2014-03-08 Thread Martin, Nick
+ Mahout user

Sent from my iPhone

On Mar 8, 2014, at 10:42 AM, Mahmood Naderan 
nt_mahm...@yahoo.commailto:nt_mahm...@yahoo.com wrote:

Hi
Maybe this is a newbie question but I want to know does Hadoop/Mahout use 
pthread models?

Regards,
Mahmood


Newbie Question

2013-01-25 Thread Robby Stamper
I have successfully run the Breiman example from 
https://cwiki.apache.org/confluence/display/MAHOUT/Breiman+Example

How do I view the tree?  Do I need to write a program that instantiates 
ForestVisualizer.java?   Is there another program for visualizing the results 
of Mahout output?

Many Thanks,

Robby
font size=2
The information transmitted is intended solely for the individual or entity to 
which it is addressed and may contain confidential and/or privileged material.  
Any review, retransmission, dissemination or other use of or taking action in 
reliance upon this information by persons or entities other than the intended 
recipient is prohibited.  If you have received this email in error please 
contact the sender and delete the material from your computer (s).   All 
incoming and outgoing e-mail is reviewed and archived by James River Capital 
Corp. and may be produced at the request of regulators or in connection with 
civil litigation.  James River Capital Corp. accepts no liability for any 
errors or omissions arising as a result of transmission.brbr
This document is neither an offer to sell nor a solicitation of an offer to buy 
any fund, product, security or other investment.  An offer of investment can be 
made only via the approved offering documents, which contains important 
information concerning risk factors and other material aspects of the 
investment and must be read carefully before investing.  Any person investing 
must be able to bear the risks involved and must meet the relevant  suitability 
requirements. Some or all alternative investment programs may not be suitable 
for certain investors. No assurance can be given that the investment objectives 
of the investment will be achieved.  Any historical price or value is as of the 
date indicated.  Information is provided as of the date of this material only 
and are subject to change without notice.  BPAST RESULTS ARE NOT NECESSARILY 
INDICATIVE OF FUTURE PERFORMANCE./bbrbr
font size=2

Re: Newbie question on modeling a Recommender using Mahout when the matrix is sparse

2012-09-13 Thread Sean Owen
Well there are only 7 products in the universe! If you ask for 10
recommendations, you will always get all unrated items back in the
recommendations. That's always true unless the algorithm can't
actually establish a value for some items.

What result were you expecting, less than 10 recs? less than 7?

On Thu, Sep 13, 2012 at 6:55 AM, Gokul Pillai gokoolt...@gmail.com wrote:
 I am trying out Mahout to come up with product recommendations for users
 based on data that show what products they use today.
 The data is not web-scale, just about 300,000 users and 7 products. Few
 comments about the data here:
 1. Since users either have or not have a particular product, the value in
 the matrix is either 1 or 0 for all the columns (rows being the userids)
 2. All the users have one basic product, so I discounted this from the
 data-model passed to the Mahout recommender since I assume that if everyone
 has the same product, its effect on the recommendations are trivial.
 3. The matrix itself is sparse, the total counts of users having each
 product is :
 A=31847, 54754,1897 |23154 |2201 |2766 |33585

 Steps followed:
 1. Created a data-source from the user-product table in the database
 File ratingsFile = new
 File(datasets/products.csv);
 DataModel model = new FileDataModel(ratingsFile);
   2.  Created a recommender on this data
 CachingRecommender recommender = new CachingRecommender(new
 SlopeOneRecommender(model));
 3. Loop through all users and get the top ten recommendations:
 ListRecommendedItem recommendations =
 recommender.recommend(userId, 10);

 Issue faced:
 The problem I am facing is that the recommendations that come out are way
 too simple - meaning that all that it seems like what is being recommended
 is if a user does not have product A, then recommend it, if they dont have
 product B, then recommend it and so on. Basically a simple inverse of
 their ownership status.

 Obviously, I am not doing something right here. How can I do the modeling
 better to get the right recommendations. Or is it that my dataset (30
 users times 7 products) is too small for Mahout to work with?

 Look forward to your comments. Thanks.


Re: Newbie question on modeling a Recommender using Mahout when the matrix is sparse

2012-09-13 Thread Gokul Pillai
Very true, good catch. I think I was interpreting the results the wrong way.
I expect only the top 5, so I changed the parameter to 5 instead of 10
and the results are as expected now.

Thanks.

On Wed, Sep 12, 2012 at 11:36 PM, Sean Owen sro...@gmail.com wrote:

 Well there are only 7 products in the universe! If you ask for 10
 recommendations, you will always get all unrated items back in the
 recommendations. That's always true unless the algorithm can't
 actually establish a value for some items.

 What result were you expecting, less than 10 recs? less than 7?

 On Thu, Sep 13, 2012 at 6:55 AM, Gokul Pillai gokoolt...@gmail.com
 wrote:
  I am trying out Mahout to come up with product recommendations for users
  based on data that show what products they use today.
  The data is not web-scale, just about 300,000 users and 7 products. Few
  comments about the data here:
  1. Since users either have or not have a particular product, the value in
  the matrix is either 1 or 0 for all the columns (rows being the
 userids)
  2. All the users have one basic product, so I discounted this from the
  data-model passed to the Mahout recommender since I assume that if
 everyone
  has the same product, its effect on the recommendations are trivial.
  3. The matrix itself is sparse, the total counts of users having each
  product is :
  A=31847, 54754,1897 |23154 |2201 |2766 |33585
 
  Steps followed:
  1. Created a data-source from the user-product table in the database
  File ratingsFile = new
  File(datasets/products.csv);
  DataModel model = new FileDataModel(ratingsFile);
2.  Created a recommender on this data
  CachingRecommender recommender = new CachingRecommender(new
  SlopeOneRecommender(model));
  3. Loop through all users and get the top ten recommendations:
  ListRecommendedItem recommendations =
  recommender.recommend(userId, 10);
 
  Issue faced:
  The problem I am facing is that the recommendations that come out are way
  too simple - meaning that all that it seems like what is being
 recommended
  is if a user does not have product A, then recommend it, if they dont
 have
  product B, then recommend it and so on. Basically a simple inverse of
  their ownership status.
 
  Obviously, I am not doing something right here. How can I do the modeling
  better to get the right recommendations. Or is it that my dataset (30
  users times 7 products) is too small for Mahout to work with?
 
  Look forward to your comments. Thanks.



Newbie question on modeling a Recommender using Mahout when the matrix is sparse

2012-09-12 Thread Gokul Pillai
I am trying out Mahout to come up with product recommendations for users
based on data that show what products they use today.
The data is not web-scale, just about 300,000 users and 7 products. Few
comments about the data here:
1. Since users either have or not have a particular product, the value in
the matrix is either 1 or 0 for all the columns (rows being the userids)
2. All the users have one basic product, so I discounted this from the
data-model passed to the Mahout recommender since I assume that if everyone
has the same product, its effect on the recommendations are trivial.
3. The matrix itself is sparse, the total counts of users having each
product is :
A=31847, 54754,1897 |23154 |2201 |2766 |33585

Steps followed:
1. Created a data-source from the user-product table in the database
File ratingsFile = new
File(datasets/products.csv);
DataModel model = new FileDataModel(ratingsFile);
  2.  Created a recommender on this data
CachingRecommender recommender = new CachingRecommender(new
SlopeOneRecommender(model));
3. Loop through all users and get the top ten recommendations:
ListRecommendedItem recommendations =
recommender.recommend(userId, 10);

Issue faced:
The problem I am facing is that the recommendations that come out are way
too simple - meaning that all that it seems like what is being recommended
is if a user does not have product A, then recommend it, if they dont have
product B, then recommend it and so on. Basically a simple inverse of
their ownership status.

Obviously, I am not doing something right here. How can I do the modeling
better to get the right recommendations. Or is it that my dataset (30
users times 7 products) is too small for Mahout to work with?

Look forward to your comments. Thanks.