[ 
https://issues.apache.org/jira/browse/DRILL-4147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15036014#comment-15036014
 ] 

ASF GitHub Bot commented on DRILL-4147:
---------------------------------------

GitHub user hsuanyi opened a pull request:

    https://github.com/apache/drill/pull/288

    DRILL-4147: Change UnionPrel's DrillDistributionTrait to be ANY to al…

    …low Union-All to be done in parallel

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/hsuanyi/incubator-drill DRILL-4147

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/288.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #288
    
----
commit 9f31a4c04c2cb219237519070b35d5fae3010908
Author: Hsuan-Yi Chu <[email protected]>
Date:   2015-12-02T00:46:51Z

    DRILL-4147: Change UnionPrel's DrillDistributionTrait to be ANY to allow 
Union-All to be done in parallel

----


> Union All operator runs in a single fragment
> --------------------------------------------
>
>                 Key: DRILL-4147
>                 URL: https://issues.apache.org/jira/browse/DRILL-4147
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: amit hadke
>            Assignee: Sean Hsuan-Yi Chu
>
> A User noticed that running select  from a single directory is much faster 
> than union all on two directories.
> (https://drill.apache.org/blog/2014/12/09/running-sql-queries-on-amazon-s3/#comment-2349732267)
>  
> It seems like UNION ALL operator doesn't parallelize sub scans (its using 
> SINGLETON for distribution type). Everything is ran in single fragment.
> We may have to use SubsetTransformer in UnionAllPrule.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to