[
https://issues.apache.org/jira/browse/IMPALA-6897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16673804#comment-16673804
]
Tim Armstrong commented on IMPALA-6897:
---------------------------------------
Could we simplify this further and just include # files and total file size in
the profile for all the files scanned.
> Catalog server should flag tables with large number of small files
> ------------------------------------------------------------------
>
> Key: IMPALA-6897
> URL: https://issues.apache.org/jira/browse/IMPALA-6897
> Project: IMPALA
> Issue Type: Improvement
> Components: Catalog
> Affects Versions: Impala 2.13.0
> Reporter: bharath v
> Priority: Major
> Labels: ramp-up, supportability
>
> Since Catalog has all the file metadata information available, it should help
> flag tables with large number of small files. This information can be
> propagated to the coordinators and should be reflected in the query profiles
> like how we do for "missing stats".
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]