[jira] [Commented] (HADOOP-13286) add a scale test to do gunzip and linecount

Steve Loughran (JIRA) Sun, 19 Jun 2016 07:47:28 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15338542#comment-15338542
 ]


Steve Loughran commented on HADOOP-13286:
-----------------------------------------

My key goal here was to force through something which absolutely models a real 
use; this is essentially the same test which downstream tests demonstrated 
problems with the HADOOP-13203 test. A key thing is that I don't know how the 
gzip codec reads data (blocks? stream?) —and how it will continue to read data 
in future

II'd like to keep it for that reason, pulling something else instead if you 
think it duplicates.

> add a scale test to do gunzip and linecount
> -------------------------------------------
>
>                 Key: HADOOP-13286
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13286
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 2.8.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>         Attachments: HADOOP-13286-branch-2-001.patch
>
>
> the HADOOP-13203 patch proposal showed that there were performance problems 
> downstream which weren't surfacing in the current scale tests.
> Trying to decompress the .gz test file and then go through it with LineReader 
> models a basic use case: parse a .csv.gz data source. 
> Add this, with metric printing



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HADOOP-13286) add a scale test to do gunzip and linecount

Reply via email to