[
https://issues.apache.org/jira/browse/TAJO-574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888629#comment-13888629
]
Hudson commented on TAJO-574:
-----------------------------
SUCCESS: Integrated in Tajo-master-build #48 (See
[https://builds.apache.org/job/Tajo-master-build/48/])
TAJO-574: Add a sort-based physical executor for column partition store.
(hyunsik:
https://git-wip-us.apache.org/repos/asf?p=incubator-tajo.git&a=commit&h=10c599f4b057308eca7ac8d5d7cc2542a69f0524)
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/HashBasedColPartitionStoreExec.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/ColumnPartitionedTableStoreExec.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/ColPartitionStoreExec.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/enforce/Enforcer.java
* CHANGES.txt
* tajo-core/tajo-core-backend/src/main/proto/TajoWorkerProtocol.proto
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/PhysicalExecutorVisitor.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/BasicPhysicalExecutorVisitor.java
*
tajo-catalog/tajo-catalog-common/src/main/java/org/apache/tajo/catalog/statistics/StatisticsUtil.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/physical/SortBasedColPartitionStoreExec.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/engine/planner/PhysicalPlannerImpl.java
*
tajo-core/tajo-core-backend/src/test/java/org/apache/tajo/engine/planner/physical/TestPhysicalPlanner.java
> Add a sort-based physical executor for column partition store
> -------------------------------------------------------------
>
> Key: TAJO-574
> URL: https://issues.apache.org/jira/browse/TAJO-574
> Project: Tajo
> Issue Type: New Feature
> Components: physical operator
> Reporter: Hyunsik Choi
> Assignee: Hyunsik Choi
> Fix For: 0.8-incubating
>
> Attachments: TAJO-574.patch
>
>
> ColumnPartitionStoreExec keeps numerous open files while it is storing all
> data. In addition, it's random write gives burden to HDFS namenode.
> To solve this problem, I would like to propose a sort-based physical executor
> for column partition store. It assumes that input tuples are sorted in an
> ascending or descending order of partition keys. It means that it needs extra
> sort operation. But, it opens only one file simultaneously. It writes all
> data sequentially. In many cases, it would be the best choice for column
> partition store.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)