[
https://issues.apache.org/jira/browse/BIGTOP-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Konstantin Boudnik updated BIGTOP-1212:
---------------------------------------
Affects Version/s: 0.7.0
> Pick or build a framework for building fake data sets
> -----------------------------------------------------
>
> Key: BIGTOP-1212
> URL: https://issues.apache.org/jira/browse/BIGTOP-1212
> Project: Bigtop
> Issue Type: New Feature
> Components: Blueprints
> Affects Versions: 0.7.0
> Reporter: jay vyas
> Fix For: 0.8.0
>
>
> - We've already seen that the mahout smoke tests are fragile with respect to
> requiring many external input data sets.
> - Also in BigPetStore BIGTOP-1089 , we are building custom fake data
> generators so that we can build arbitrarily large data sets of customer
> transactions with patterns in them.
> So -- lest either (1) build a framework or (2) adopt one, that is modular
> enough to extend for different smoke test scenarios.
> ADVANTAGES:
> - VM tests can run the exact same smokes that real tests run , and just
> generate smaller input data sets. Right now, we cant do this with static
> external data sets .
> - We can start eliminating fragile external dependencies of smoke tests (i.e.
> the mahout ones), and replace them with own data sets on the fly, no need
> for wgetting them from 3rd parties
> - BigPetStore can focus on demo'ing the bigtop based hadoop ecosystem
> deployment, rather than on generating data.
--
This message was sent by Atlassian JIRA
(v6.2#6252)