[
https://issues.apache.org/jira/browse/DRILL-5223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15840120#comment-15840120
]
ASF GitHub Bot commented on DRILL-5223:
---------------------------------------
GitHub user ppadma opened a pull request:
https://github.com/apache/drill/pull/730
DRILL-5223:Drill should ensure balanced workload assignment at node l…
…evel in order to get better query performance.
Please see DRILL-5223 for details:
https://issues.apache.org/jira/browse/DRILL-5223
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ppadma/drill DRILL-5223
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/730.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #730
----
commit a1ad4113f59e87a4885c271a53afea648bb6f9c3
Author: Padma Penumarthy <[email protected]>
Date: 2017-01-21T01:57:10Z
DRILL-5223:Drill should ensure balanced workload assignment at node level
in order to get better query performance
----
> Drill should ensure balanced workload assignment at node level in order to
> get better query performance
> -------------------------------------------------------------------------------------------------------
>
> Key: DRILL-5223
> URL: https://issues.apache.org/jira/browse/DRILL-5223
> Project: Apache Drill
> Issue Type: Bug
> Components: Query Planning & Optimization
> Reporter: Jinfeng Ni
>
> Drill's work assignment logic currently aims to achieve workload balance
> across different minor fragment (or slices) and honor data affinity in order
> to get as much local read as possible.
> However, when the # of work units could not be evenly divided by # of minor
> fragments, the remaining work units would tender to go to the first subset of
> drill endpoints. This means the drill endpoints assigned with the remaining
> work units could have larger workload than the rest of them. When
> MuxExchange is enabled (by default), all the minor fragments on the same node
> have to send data to a single Muxer per node, and unbalanced workload
> assignment at node level could impact query elapse time. which is
> essentially decided by the slowest drill endpoint.
> Some prototype experimental run shows that with more balanced workload
> assignment, Drill shows quite significant improvement for most of TPC-H
> queries.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)