[jira] Commented: (MAPREDUCE-830) Providing BZip2 splitting support for Text data

Hadoop QA (JIRA) Thu, 10 Sep 2009 14:39:20 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753832#action_12753832
 ]


Hadoop QA commented on MAPREDUCE-830:
-------------------------------------

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12418869/M830-3.patch
  against trunk revision 813585.

    +1 @author.  The patch does not contain any @author tags.

    +1 tests included.  The patch appears to include 6 new or modified tests.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    -1 javac.  The patch appears to cause tar ant target to fail.

    -1 findbugs.  The patch appears to cause Findbugs to fail.

    +1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

    -1 core tests.  The patch failed core unit tests.

    -1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/24/testReport/
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/24/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/24/console

This message is automatically generated.

> Providing BZip2 splitting support for Text data
> -----------------------------------------------
>
>                 Key: MAPREDUCE-830
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-830
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.21.0
>            Reporter: Abdul Qadeer
>            Assignee: Abdul Qadeer
>             Fix For: 0.21.0
>
>         Attachments: M830-2.patch, M830-3.patch, MapReduce-830-version1.patch
>
>
> HADOOP-4012 (https://issues.apache.org/jira/browse/HADOOP-4012) is providing 
> support to handle BZip2 compressed data such that the input compressed file 
> is split at arbitrary points.  This JIRA uses that functionality in 
> LineRecordReader.  The benefit of this work is that, if user provides 
> compressed BZip2 Text data, it will be split by Hadoop and hence will be 
> processed by multiple mappers.  So BZip2 compressed data will be able to 
> fully utilize the cluster power.  Currently BZip2 compressed Text file goes 
> to one mapper and is not split.  So the enhancement in this JIRA provides 
> splitting support  and a considerable performance gains.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-830) Providing BZip2 splitting support for Text data

Reply via email to