[ 
https://issues.apache.org/jira/browse/TEZ-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011422#comment-14011422
 ] 

Hitesh Shah commented on TEZ-800:
---------------------------------

Comments:

{code}
     }
-
+    
     // inform the vertices
{code}
   - whitespace added.

{code}
LOG.info("Inferring vertex: " + outVertex.getVertexName() + " to be "
+                  + v.getParallelism() + " from 1-1 connection with vertex "
+                  + v.getVertexName());
{code}
   - log message is not clear. It will show up as "Inferring vertex: v1 to be 
10 from 1-1 connection with v2". Maybe added parallelism/ numtasks or something 
similar after 10 ? Or "Inferring vertex parallelism for v1 to be ... "

For checkAndInferOneToOneParallelism(), should this be done using a BFS/DFS 
approach? The current approach probably needs some comments to clarify how many 
loops it will need to converge. Also, there is no handling for vertices that 
have multiple inbound edges of different types or even all inbounds being of 
type 1:1. I am assuming some of those will be caught by the consistency check 
that follows. 

In the tests, it will be good to test with re-ordering of the vertices when 
they are added to the DAG. i.e. if the DAG is v1->v2->v3, do dag.add(v3); 
add(v2); add(v1).  

Is there a reason why this was not caught by the AM itself and parallelism set 
as needed in the state machines?
 









> One-one edge with parallelism -1 fails if source vertex parallelism is not -1 
> as well
> -------------------------------------------------------------------------------------
>
>                 Key: TEZ-800
>                 URL: https://issues.apache.org/jira/browse/TEZ-800
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.3.0
>            Reporter: Rohini Palaniswamy
>            Assignee: Bikas Saha
>         Attachments: TEZ-800.1.patch
>
>
> // no input initializers. At this moment, only other case is 1-1 edge
>           // with uninitialized sources
>           boolean hasOneToOneUninitedSource = false;
>           for (Map.Entry<Vertex, Edge> entry : 
> vertex.sourceVertices.entrySet()) {
>             if (entry.getValue().getEdgeProperty().getDataMovementType() == 
>                 DataMovementType.ONE_TO_ONE) {
>               if (entry.getKey().getTotalTasks() == -1) {
>                 hasOneToOneUninitedSource = true;
>                 break;
>               }
>             }
>           }
> This checks for the source vertex which has the 1-1 edge to also have -1 
> parallelism.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to