[ 
https://issues.apache.org/jira/browse/TEZ-2581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14976371#comment-14976371
 ] 

Bikas Saha commented on TEZ-2581:
---------------------------------

tez-dag/src/main/java/org/apache/tez/dag/app/TaskCommunicatorManager.java
Typo - fore recovery
In the patch we are sending completion events to TA directly. Is the ordering 
of DM events and completion events via serializing through the Vertex not 
needed anymore? Does the same ordering apply to the event being sent to the TA 
-> DM events must be sent before completion events? If yes, then can the code 
have some comments for that. A test that verifies the ordering constraint would 
be even better.
Create a common method to filter taGeneratedEvents/

tez-dag/src/main/java/org/apache/tez/dag/app/dag/event/DAGEventRecoverEvent.java
Why is desire state still needed

tez-dag/src/main/java/org/apache/tez/dag/app/dag/event/VertexEventRecoverVertex.java
Why is this still needed


tez-dag/src/main/java/org/apache/tez/dag/history/events/VertexInitGeneratedEvent.java
 &
tez-dag/src/main/java/org/apache/tez/dag/history/events/TaskAttemptGeneratedEvent.java
Event serde code should be in Event or some common place (e.g. ProtoConverters) 
- not in RecoverConverters where others will not try to search for.
IMO, event constructors should be dumb and not have filters for what events are 
relevant. That is the reponsibility of the code that saves/loads/uses this data.
{code}      if (EnumSet.of(EventType.DATA_MOVEMENT_EVENT,
          EventType.COMPOSITE_DATA_MOVEMENT_EVENT,
          EventType.ROOT_INPUT_INITIALIZER_EVENT,
          EventType.VERTEX_MANAGER_EVENT)
              .contains(event.getEventType())) {
        this.events.add(event);
      }{code}

tez-dag/src/main/java/org/apache/tez/dag/history/logging/impl/HistoryEventJsonConversion.java
Remove dead/commented code - //      case 
VERTEX_DATA_MOVEMENT_EVENTS_GENERATED: and others
Unrelated removal?
{code}    otherInfo.put(ATSConstants.START_TIME, event.getStartTime());
-    otherInfo.put(ATSConstants.SCHEDULED_TIME, event.getScheduledTime());
{code}
NUM_TASKS in eventInfo and also in otherInfo?
Missing inputSpec, locations - its ok to skip them if they are not relevant but 
please put comments saying its done on purpose


> Umbrella for Tez Recovery Redesign
> ----------------------------------
>
>                 Key: TEZ-2581
>                 URL: https://issues.apache.org/jira/browse/TEZ-2581
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Jeff Zhang
>            Assignee: Jeff Zhang
>         Attachments: TEZ-2581-WIP-1.patch, TEZ-2581-WIP-2.patch, 
> TEZ-2581-WIP-3.patch, TEZ-2581-WIP-4.patch, TEZ-2581-WIP-5.patch, 
> TEZ-2581-WIP-6.patch, TezRecoveryRedesignProposal.pdf, 
> TezRecoveryRedesignV1.1.pdf
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to