[ 
https://issues.apache.org/jira/browse/TAJO-584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13894742#comment-13894742
 ] 

Tajo QA commented on TAJO-584:
------------------------------

{color:red}*-1 overall.*{color}  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12627650/TAJO-584_20140208_01%3A51%3A59.patch
  against master revision 4179a7c.

    {color:green}+1 @author.{color}  The patch does not contain any @author 
tags.

    {color:green}+1 tests included.{color}  The patch appears to include 18 new 
or modified test files.

    {color:green}+1 javac.{color}  The applied patch does not increase the 
total number of javac compiler warnings.

    {color:green}+1 javadoc.{color}  The applied patch does not increase the 
total number of javadoc warnings.

    {color:green}+1 checkstyle.{color}  The patch generated 0 code style errors.

    {color:red}-1 findbugs.{color}  The patch appears to introduce 214 new 
Findbugs (version 1.3.9) warnings.

    {color:green}+1 release audit.{color}  The applied patch does not increase 
the total number of release audit warnings.

    {color:green}+1 core tests.{color}  The patch passed unit tests in 
tajo-catalog/tajo-catalog-common tajo-client tajo-core/tajo-core-backend 
tajo-core/tajo-core-pullserver tajo-storage.

Test results: 
https://builds.apache.org/job/PreCommit-TAJO-Build/113//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TAJO-Build/113//artifact/incubator-tajo/patchprocess/newPatchFindbugsWarningstajo-catalog-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TAJO-Build/113//artifact/incubator-tajo/patchprocess/newPatchFindbugsWarningstajo-core-backend.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TAJO-Build/113//artifact/incubator-tajo/patchprocess/newPatchFindbugsWarningstajo-core-pullserver.html
Console output: https://builds.apache.org/job/PreCommit-TAJO-Build/113//console

This message is automatically generated.

> Improve distributed merge sort
> ------------------------------
>
>                 Key: TAJO-584
>                 URL: https://issues.apache.org/jira/browse/TAJO-584
>             Project: Tajo
>          Issue Type: Improvement
>          Components: distributed query plan, physical operator
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.8-incubating
>
>         Attachments: TAJO-584.patch, TAJO-584_20140208_01:51:59.patch
>
>
> In Tajo, sort operator is similar to merge sort, and it works in a 
> distributed manner. The first sort phase sorts each fragment in local 
> machine, the intermediate data are shuffled in range partition, and then the 
> second sort phase in each node sorts the range-partitioned data.
> However, the second sort phase reads all shuffled data via one scanner. It 
> misses the opportunity to exploit already-sorted data. This patch improves 
> the second sort phase to merge directly multiple already-sorted intermediate 
> data sets. It significantly reduces the response time of sort queries.
> I carried out some simple benchmark with the following query on TPC-H 100GB 
> data sets:
> {code:sql}
> select l_orderkey from lineitem order by l_orderkey;
> {code}
> The lineitem table occupies 75GB. The query response time are dramatically 
> reduced from 480 to 260 secs. This patch exploits the design of TAJO-36. So, 
> this patch requires TAJO-36.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to