[ 
https://issues.apache.org/jira/browse/BIGTOP-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13761469#comment-13761469
 ] 

jay vyas commented on BIGTOP-1057:
----------------------------------

Good point roman - yes its certainly okay for small clusters: 

teragen 10 = 10 rows of 100bytes -> 1000 bytes -> 1 KB
teragen 1000 = 1000 rows of 100 bytes -> 100 KB  

And the default value is 1000, so we are in good shape :)

$> ls -altrh data/teragen1000
total 114K
drwxr-xr-x. 39 root root 8.0K Sep  8 11:16 ..
drwxr-xr-x.  3 root root 4.1K Sep  8 11:16 _logs
-rwxrwxrwx.  1 root root  49K Sep  8 11:16 part-00000
-rwxrwxrwx.  1 root root  49K Sep  8 11:16 part-00001
-rwxrwxrwx.  1 root root    0 Sep  8 11:16 _SUCCESS
drwxrwxrwx.  3 root root 4.1K Sep  8 11:16 .

                
> Add TeraGen / TeraSort Benchmakring
> -----------------------------------
>
>                 Key: BIGTOP-1057
>                 URL: https://issues.apache.org/jira/browse/BIGTOP-1057
>             Project: Bigtop
>          Issue Type: Test
>          Components: Tests
>            Reporter: jay vyas
>            Assignee: jay vyas
>         Attachments: BIGTOP-1057.1.patch, BIGTOP-1057.1.patch, 
> BIGTOP-1057.1.patch
>
>
> Benchmarking is indeed on the road map for bigtop , as per a recent mailing 
> list thread.  Lets add in Terasort / Teragen as a starting point.  
> Two to note when adding this module: 
> First - This can be done 
> 1) either in a new test-artifacts submodule (i.e. called benchmarking), or 
> else,
> 2) it can be added individually into ecosystem components.  
> Second - Parameterization of the tests from the pom file will make it so that 
> the initial tests really are just smokes, and they can be changed over time 
> to be real performance benchmarks. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to