[
https://issues.apache.org/jira/browse/BIGTOP-1272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14087011#comment-14087011
]
jay vyas edited comment on BIGTOP-1272 at 8/6/14 12:20 AM:
-----------------------------------------------------------
- is this patch 100% ready for testing, including README updates?. If so I can
have a look asap. otherwise busy with some other stuff at the moment, so I'd
rather just wait for a full clean patch to do the review..... its a complex
deploy, so the devil will be in the details of following the README and
ensuring that it works as specified.
- On another note: if the patch is still needing alot of work, im starting to
wonder how important mahout mapreduce implementations are to the broader
community, given the move to mahout spark implementations that is occuring.
- open to ideas. is anyone in need of mahout mapreduce tests ,... or are we
all moving to do all our machine learning on spark ? and [~bhashit] do you
think a clean implementationis close around the corner?
was (Author: jayunit100):
is this patch 100% ready for testing, including README updates?. If so I can
have a look asap. otherwise busy with some other stuff at the moment, so I'd
rather just wait for a full clean patch to do the review..... its a complex
deploy, so the devil will be in the details of following the README and
ensuring that it works as specified.
On another note: if the patch is still needing alot of work, im starting to
wonder how important mahout mapreduce implementations are to the broader
community, given the move to mahout spark implementations that is occuring.
open to ideas. is anyone in need of mahout mapreduce tests ,... or are we all
moving to do all our machine learning on spark ? and [~bhashit] do you think a
clean implementationis close around the corner?
> BigPetStore: Productionize the Mahout recommender
> -------------------------------------------------
>
> Key: BIGTOP-1272
> URL: https://issues.apache.org/jira/browse/BIGTOP-1272
> Project: Bigtop
> Issue Type: New Feature
> Components: Blueprints
> Affects Versions: backlog
> Reporter: jay vyas
> Attachments: BIGTOP-1272.patch, BIGTOP-1272.patch, BIGTOP-1272.patch,
> BIGTOP-1272.patch, BIGTOP-1272.patch, arch.jpeg, build.gradle
>
>
> BIGTOP-1271 adds patterns into the data that gaurantee that a meaningfull
> type of product recommendation can be given for at least *some* customers,
> since we know that there are going to be many customers who only bought 1
> product, and also customers that bought 2 or more products -- even in a
> dataset size of 10. due to the gaussian distribution of purchases that is
> also in the dataset generator.
> The current mahout recommender code is statically valid: It runs to
> completion in local unit tests if a hadoop 1x tarball is present but
> otherwise it hasn't been tested at scale. So, lets get it working. this
> JIRA also will comprise:
> - deciding wether to use mahout 2x for unit tests (default on mahout maven
> repo is the 1x impl) and wether or not bigtop should host a mahout 2x jar?
> After all, bigtop builds a mahout 2x jar as part of its packaging process,
> and BigPetStore might thus need a mahout 2x jar in order to test against the
> right same of bigtop releases.
--
This message was sent by Atlassian JIRA
(v6.2#6252)