Hyunsik Choi created TAJO-982:
---------------------------------

             Summary: Improve Fetcher to get multiple shuffle outputs through a 
request
                 Key: TAJO-982
                 URL: https://issues.apache.org/jira/browse/TAJO-982
             Project: Tajo
          Issue Type: Bug
          Components: data shuffle
            Reporter: Hyunsik Choi
             Fix For: 0.9.0


Currently, Fetcher only can request at most a fetch for one shuffle output at a 
time. The implementation can cause performance degradation even though 
intermediate data is actually small.

For example, If an input data set of the first stage is big and the 
intermediate data is very small, QueryMaster will choose a few of nodes for 
next execution block. Since each node keeps limited threads for fetch, it will 
take a long time for the nodes for next stage to fetch all intermediate.

If Fetcher can get multiple shuffle outputs through a request, it would solve 
the slowness which occurs in some cases.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to