[ 
https://issues.apache.org/jira/browse/TEZ-3237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15281573#comment-15281573
 ] 

Tsuyoshi Ozawa commented on TEZ-3237:
-------------------------------------

[~rajesh.balamohan] make sense to me.

> Corrupted shuffle transfers to disk are not detected during transfer
> --------------------------------------------------------------------
>
>                 Key: TEZ-3237
>                 URL: https://issues.apache.org/jira/browse/TEZ-3237
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.0
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>         Attachments: TEZ-3237.001.patch
>
>
> When a shuffle transfer is larger than the single transfer limit it gets 
> written straight to disk during the transfer.  Unfortunately there are no 
> checksum validations performed during that transfer, so if the data is 
> corrupted at the source or during transmit it goes undetected.  Only later 
> when the task tries to consume the transferred data is the error detected, 
> but at that point it's too late to blame the source task for the error.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to