[
https://issues.apache.org/jira/browse/TEZ-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14118463#comment-14118463
]
Hitesh Shah commented on TEZ-1345:
----------------------------------
[~zjffdu] I am not sure how this last patch fixes the issue.
Consider a case where a vertex has 2 initializers and 10 tasks. Each
initializer will generate 10 events each ( one for each task ). If Initializer
1 generates 10 events but only 3 are logged to disk and Initializer 2 only
generates 1 out of 10 and the 1 gets logged to disk, this will ensure that you
will see both input names when recovering events but you will not know that a
restartFromScratch() is needed as all 20 events have not been recovered.
> Add checks to guarantee all init events are written to recovery to consider
> vertex initialized
> ----------------------------------------------------------------------------------------------
>
> Key: TEZ-1345
> URL: https://issues.apache.org/jira/browse/TEZ-1345
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Hitesh Shah
> Assignee: Jeff Zhang
> Attachments: Tez-1345-2.patch, Tez-1345-3.patch, Tez-1345-4.patch,
> Tez-1345.patch
>
>
> Related to issue discovered in TEZ-1033
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)