[ 
https://issues.apache.org/jira/browse/PIG-3179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13575972#comment-13575972
 ] 

Rohini Palaniswamy commented on PIG-3179:
-----------------------------------------

I meant something like this. 
Input-split: 
file=hdfs://gridx.yahoo.com:8020/tmp/bz-6086044/msh_grouped.bz2/part-r-00032.bz2,
 start-offset=0,length=11814548
Each input split in a single line.

Question to folks -  Is someone one parsing this log information? This is 
something that can also be logged to a hdfs file apart from stderr of the map 
task. Just asking to know whether this will cause any backward incompatibility 
issues?
                
> Task Information Header only prints out the first split for each task
> ---------------------------------------------------------------------
>
>                 Key: PIG-3179
>                 URL: https://issues.apache.org/jira/browse/PIG-3179
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Koji Noguchi
>            Assignee: Koji Noguchi
>            Priority: Trivial
>         Attachments: pig-3179-v01.patch
>
>
> When a task's PigSplit is containing more than wrappedSplit, it only logs the 
> first fileinfo.
> When debugging, I saw 
> {noformat}
> ===== Task Information Header =====
> Command: bash ....
> Start time: Mon Feb 11 16:41:21 UTC 2013
> Input-split file: hdfs://abc.bcd.efg:8020/tmp/hij/part-r-00000.bz2
> Input-split start-offset: 0Input-split length: 11854247
> {noformat}
> but the actual error was happing while reading part-r-00007.bz2.  It would 
> have been nice if the log showed all the info that task was going to read.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to