[ 
https://issues.apache.org/jira/browse/TAJO-574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888629#comment-13888629
 ] 

Hudson commented on TAJO-574:
-----------------------------

SUCCESS: Integrated in Tajo-master-build #48 (See 
[https://builds.apache.org/job/Tajo-master-build/48/])
TAJO-574: Add a sort-based physical executor for column partition store. 
(hyunsik: 
https://git-wip-us.apache.org/repos/asf?p=incubator-tajo.git&a=commit&h=10c599f4b057308eca7ac8d5d7cc2542a69f0524)
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/HashBasedColPartitionStoreExec.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/ColumnPartitionedTableStoreExec.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/ColPartitionStoreExec.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/enforce/Enforcer.java
* CHANGES.txt
* tajo-core/tajo-core-backend/src/main/proto/TajoWorkerProtocol.proto
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/PhysicalExecutorVisitor.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/BasicPhysicalExecutorVisitor.java
* 
tajo-catalog/tajo-catalog-common/src/main/java/org/apache/tajo/catalog/statistics/StatisticsUtil.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/SortBasedColPartitionStoreExec.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/PhysicalPlannerImpl.java
* 
tajo-core/tajo-core-backend/src/test/java/org/apache/tajo/engine/planner/physical/TestPhysicalPlanner.java


> Add a sort-based physical executor for column partition store
> -------------------------------------------------------------
>
>                 Key: TAJO-574
>                 URL: https://issues.apache.org/jira/browse/TAJO-574
>             Project: Tajo
>          Issue Type: New Feature
>          Components: physical operator
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.8-incubating
>
>         Attachments: TAJO-574.patch
>
>
> ColumnPartitionStoreExec keeps numerous open files while it is storing all 
> data. In addition, it's random write gives burden to HDFS namenode.
> To solve this problem, I would like to propose a sort-based physical executor 
> for column partition store. It assumes that input tuples are sorted in an 
> ascending or descending order of partition keys. It means that it needs extra 
> sort operation. But, it opens only one file simultaneously. It writes all 
> data sequentially. In many cases, it would be the best choice for column 
> partition store.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to