[ 
https://issues.apache.org/jira/browse/TAJO-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14164613#comment-14164613
 ] 

ASF GitHub Bot commented on TAJO-838:
-------------------------------------

Github user jihoonson commented on the pull request:

    https://github.com/apache/tajo/pull/192#issuecomment-58456061
  
    Hi Hyunsik, thanks for your comment.
    
    The index size is 2.4 GB while the size of input table is 41 GB. As you 
commented, the index size is quite large, because the index is built for only 
one column. 
    
    Furthermore, I used a straight forward approach to utilize the current bst 
index implementation. That is, multiple bst indexes are created for each hdfs 
block, and stored in HDFS. This may cause remote accesses during index access. 
    
    After finishing this issue, we can discuss the required features related to 
the next generation index structure. 
    
    Sincerely,
    Jihoon


> Improve query planner to utilize index
> --------------------------------------
>
>                 Key: TAJO-838
>                 URL: https://issues.apache.org/jira/browse/TAJO-838
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: planner/optimizer
>            Reporter: Jihoon Son
>            Assignee: Jihoon Son
>            Priority: Minor
>
> Index can improve the query performance when the selectivity of query is high.
> Thus, query planner should decide whether index is used or not for a given 
> query.
> The selectivity can be guessed using statistics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to