[ 
https://issues.apache.org/jira/browse/MAHOUT-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14170146#comment-14170146
 ] 

Pat Ferrel commented on MAHOUT-1618:
------------------------------------

https://guide.finderbots.com

This is an integration inside a video guide written in RoR using MongoDB. The 
models created by spark-itemsimilarity are put into the catalog in MongoDB and 
Solr indexes those models. The language used to put the models in Mongo could 
be Python (though it was actually Java in this case).

I do not suggest you use this as template but you get the idea. Your idea is 
fine though. A single language tutorial is better in some ways because it is 
easier to read and the integration is simpler. 

Solr is the central part so I'd work from there out. Solr indexes the model 
created by spark-itemsimilarity. You then find user history and use it as a 
query on the model. So using your template you'd have to build models with 
Mahout and index them with Solr. Then in perhaps a second  application you'd 
identify a user to get their history, which is the query or allow people to 
hand enter item ids corresponding to hypothetical history. 

In this case you have:
1) a batch/background application that runs spark-itemsimilarity *and* indexes 
the data with Solr. This could be a Scala program if you want or might even be 
a bash script.
2) an application that builds the query from user history or allows you to 
enter hypothetical user history. It would be nice if this was interactive to 
allow the user to pick a user by ID or enter a list of item IDs as hypothetical 
history and get recs. 

These two apps would illustrate all integration points in a minimal manner. If 
you want to be more ambitious go for it. You could build both together using 
Play2 to create a web app in Scala that has a button for "build and index 
models" and an input field for entering user history or user ID with another 
button for "make recommendations".



> Cooccurrence Recommender example and documentation 
> ---------------------------------------------------
>
>                 Key: MAHOUT-1618
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1618
>             Project: Mahout
>          Issue Type: Documentation
>          Components: Examples
>    Affects Versions: collections-1.0
>            Reporter: Thejas Prasad
>            Priority: Trivial
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to