GitHub user ppadma opened a pull request:

    https://github.com/apache/drill/pull/766

    DRILL-5304: Queries fail intermittently when there is skew in data di…

    …stribution
    
    Change the assignment logic so we first make sure we assign up to minCount 
for all nodes before going up to maxCount per node. 
    Also, fixed a small issue in parallelization code where we are rounding 
down the calculation of number of fragments to run on nodes with affinity, 
because of which, sometimes, we schedule less fragments on nodes with affinity 
vs. nodes without affinity.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ppadma/drill DRILL-5304

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/766.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #766
    
----
commit 49cf9f0b54d8c0ea15c3d6a59f99b8e23870104e
Author: Padma Penumarthy <[email protected]>
Date:   2017-02-28T02:32:24Z

    DRILL-5304: Queries fail intermittently when there is skew in data 
distribution

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to