[ https://issues.apache.org/jira/browse/HIVE-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12689794#action_12689794 ]
He Yongqiang commented on HIVE-352: ----------------------------------- Thanks, Raghotham Murthy. Besides these two posts, there are also several useful papers,like C-Store: A Column-oriented DBMS Column-Stores vs. Row-Stores- How Different Are They Really-sigmod08 A Comparison of C-Store and Row-Store in a Common Framework Materialization Strategies in a Column-Oriented DBMS. Integrating compression and execution in column-oriented database systems In these papers, which are written mostly(all?) by people in vertica, they place most emphasis on the column-oriented execution layer together with a column storage layer. I totally agree with these opinions. And actually we observed that operators with map-reduce approach have many differences with the ones implemented in systems like CStore. And we also found that bitmap compression can extremely reduce the execution time. So i guess we can first try to support a column storage layer, and then we can add some column oriented operators and column-specific compression algorithms. I will try to provide a small prototype of the storage layer as soon as possible. > Make Hive support column based storage > -------------------------------------- > > Key: HIVE-352 > URL: https://issues.apache.org/jira/browse/HIVE-352 > Project: Hadoop Hive > Issue Type: New Feature > Reporter: He Yongqiang > > column based storage has been proven a better storage layout for OLAP. > Hive does a great job on raw row oriented storage. In this issue, we will > enhance hive to support column based storage. > Acctually we have done some work on column based storage on top of hdfs, i > think it will need some review and refactoring to port it to Hive. > Any thoughts? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.