[ 
https://issues.apache.org/jira/browse/IMPALA-6897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16673804#comment-16673804
 ] 

Tim Armstrong commented on IMPALA-6897:
---------------------------------------

Could we simplify this further and just include # files and total file size in 
the profile for all the files scanned.

> Catalog server should flag tables with large number of small files
> ------------------------------------------------------------------
>
>                 Key: IMPALA-6897
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6897
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Catalog
>    Affects Versions: Impala 2.13.0
>            Reporter: bharath v
>            Priority: Major
>              Labels: ramp-up, supportability
>
> Since Catalog has all the file metadata information available, it should help 
> flag tables with large number of small files. This information can be 
> propagated to the coordinators and should be reflected in the query profiles 
> like how we do for "missing stats".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to