jay vyas created BIGTOP-1292:
--------------------------------
Summary: Add Spark to the bigpetstore architecture
Key: BIGTOP-1292
URL: https://issues.apache.org/jira/browse/BIGTOP-1292
Project: Bigtop
Issue Type: New Feature
Components: Blueprints
Reporter: jay vyas
Priority: Minor
Spark's in memory architecture would be a great demonstration inside of
bigpetstore. I suggest we add a "spark alternative" to the pig data cleaning
+ analytics phase.
The official patch would ideally follow after BIGTOP-1269 because gradle has
great scala and multilanguage support....
The input will be the bigetstore tuples:
{noformat}
BigPetStore,storeCode_AK,1 tim,cooley,Tue Jan 13 17:40:11 EST
1970,10.5,dog-food
BigPetStore,storeCode_AZ,1 dallas,mann,Thu Dec 18 03:36:01 EST
1969,10.5,dog-food
BigPetStore,storeCode_CA,1 cody,hendricks,Fri Dec 12 03:10:29 EST
1969,7.5,cat-food
BigPetStore,storeCode_CA,2 hannah,snyder,Tue Jan 13 08:20:59 EST
1970,7.5,cat-food
BigPetStore,storeCode_CA,3 chester,lyons,Mon Jan 19 12:01:29 EST
1970,10.5,dog-food
BigPetStore,storeCode_CO,1 trent,terry,Tue Jan 06 14:04:41 EST
1970,15.1,choke-collar
BigPetStore,storeCode_CT,1 wendy,ortega,Mon Dec 15 21:32:53 EST
1969,19.1,fuzzy-collar
BigPetStore,storeCode_NY,1 leroy,gentry,Sat Dec 20 01:05:31 EST
1969,19.75,fish-food
BigPetStore,storeCode_NY,2 leroy,gentry,Wed Jan 21 14:29:32 EST
1970,20.1,steel-leash
BigPetStore,storeCode_OK,1 trisha,camacho,Thu Dec 11 22:30:44 EST
1969,13.1,duck-caller
{noformat}
The final output will be something like that shown here in the text box
http://jayunit100.github.io/bigpetstore/
--
This message was sent by Atlassian JIRA
(v6.2#6252)