jay vyas created BIGTOP-1270:
--------------------------------

             Summary: BigPetStore: Productionize the Hive portion
                 Key: BIGTOP-1270
                 URL: https://issues.apache.org/jira/browse/BIGTOP-1270
             Project: Bigtop
          Issue Type: New Feature
          Components: Blueprints
            Reporter: jay vyas


The hive portion of the BigPetStore blueprint app builds a "view" over the 
cleaned data that Mahout can then use to do product recommendations.

The hive code in bigpetstore only runs locally - lets add the necessary 
configuration hooks and/or (if we have to) externalize the hive script itself 
from java so that its easy to run directly on a cluster.

And lets actually run it on some kind of a cluster at scale.  The contract for 
the hive portion is an output file with three numbers like this: 

{noformat}
100 30021 1
100 212341 1
...
{noformat}

Signifying that customer=100 likes both of the products "30021" and "212341".  





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to