Re: FP-Growth deprecated

Pat Ferrel Tue, 30 Jun 2015 10:31:52 -0700

Yes, that is one of the most common uses of fp-growth.

Instead a better way to look at this might be using cooccurrence. If you 
collect the items bought in every shopping cart, one row per cart and boolean 
value for every item bought, then perform mahout’s spark-itemsimilarity this 
will produce a matrix row keyed by items and rows consisting of items most 
often bought with the key item. The Driver output is sorted so the most common 
items are the first in the list.

Then for a given shopping cart with items in it, you have a knn problem where 
you want to find items that were bought with the same list in the current 
shopping cart. This is done by indexing the output of spark-itemsimilarity with 
a search engine and querying with the current contents of the cart.

This is exactly how we do recommendations for individuals but instead of 
training with user-item interactions and querying with user history we are 
training with shopping carts and querying with the current contents.

See this reference and replace every reference to a user with some shopping 
cart id: http://mahout.apache.org/users/algorithms/recommender-overview.html
http://mahout.apache.org/users/algorithms/intro-cooccurrence-spark.html

On Jun 30, 2015, at 12:36 AM, guo weizhan <[email protected]> wrote:

We want FP-Growth to do the Market Basket Analysis, is there any other
algorithm we can use?

Thanks,
Guo

2015-06-22 8:37 GMT+08:00 Pat Ferrel <[email protected]>:

> What is your application?
> 
> On Jun 17, 2015, at 7:06 AM, guo weizhan <[email protected]> wrote:
> 
> Hi All,
> 
> I found the FP-Growth was deprecated since 0.8, but we want this algorithm
> to do the association analysis. Do I have to use the old version or  Is
> there any other association analysis I can use in the lastest version?
> 
> 
> Thanks,
> Guo
> 
>

Re: FP-Growth deprecated

Reply via email to