[ 
https://issues.apache.org/jira/browse/TAJO-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061573#comment-15061573
 ] 

ASF GitHub Bot commented on TAJO-2007:
--------------------------------------

Github user hyunsik commented on the pull request:

    https://github.com/apache/tajo/pull/900#issuecomment-165352379
  
    @jihoonson Thank you for your comments. As we discussed in an offline, it 
may be hard to handle the problem in this issue. Instead, I just created the 
another jira [1] and add ``FIXME`` tags in the unit tests. We can resolve this 
problem after we introduce hints.
    
    [1] https://issues.apache.org/jira/browse/TAJO-2026


> By default, Optimizer should use the table volume in TableStat.
> ---------------------------------------------------------------
>
>                 Key: TAJO-2007
>                 URL: https://issues.apache.org/jira/browse/TAJO-2007
>             Project: Tajo
>          Issue Type: Improvement
>          Components: Planner/Optimizer
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.12.0, 0.11.1
>
>
> Currently, the optimizer by default gets table volumes through storage 
> manager and employ them for join optimization. But, in some cases, it causes 
> performance degradation because aggregating all file volumes is not cheap in 
> large partitioned tables on S3 or HDFS.
> So, this patch improves TableStatUpdateRewriter to use table volumes of 
> TableStat by default, and it also adds a session variable 'USE_TABLE_VOLUME' 
> to allow the optimizer to use the table volume through storage handler.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to