[ https://issues.apache.org/jira/browse/TEZ-4508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17751675#comment-17751675 ]
László Bodor commented on TEZ-4508: ----------------------------------- assigned this ticket to you [~okumin], thanks for the contribution! let me understand this whole area better to be able to review > Allow the FAIR_PARALLELISM mode to accept multiple source vertices > ------------------------------------------------------------------ > > Key: TEZ-4508 > URL: https://issues.apache.org/jira/browse/TEZ-4508 > Project: Apache Tez > Issue Type: Sub-task > Affects Versions: 0.10.2 > Reporter: okumin > Assignee: okumin > Priority: Major > Time Spent: 0.5h > Remaining Estimate: 0h > > Currently, FairShuffleVertexManager with FAIR_PARALLELISM fails when two or > more edges are configured with SCATTER_GATHER. > https://github.com/apache/tez/blob/rel/release-0.10.2/tez-runtime-library/src/main/java/org/apache/tez/dag/library/vertexmanager/FairShuffleVertexManager.java#L198-L204 > Looking at [TEZ-3500|https://issues.apache.org/jira/browse/TEZ-3500], we > assume the situation with such multiple sources happens when the vertex > performs JOIN. In that case, I agree we definitely need more than the current > FairShuffleVertexManager. > However, the current fair routing with multiple sources sufficiently works > when the sources are symmetric. One case I assume is UNION ALL + bucketed > INSERT. -- This message was sent by Atlassian Jira (v8.20.10#820010)