[ 
https://issues.apache.org/jira/browse/IMPALA-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18037963#comment-18037963
 ] 

Sanjana Malhotra commented on IMPALA-14521:
-------------------------------------------

Draft content:
h2.  
h2. Limiting File Metadata Fetched in Catalog Requests

To prevent Catalog Service (catalogd) Out-of-Memory (OOM) errors when fetching 
metadata for tables with millions of files, the new configuration flag 
{{catalog_partial_fetch_max_files}} has been introduced.

This flag limits the maximum number of file descriptors returned in a single 
catalog fetch response.
 * *Default Value:* 1,000,000 files.

 * {*}Behavior{*}: If a request exceeds this limit, Catalogd truncates the 
response at the partition level. The Impala coordinator then automatically 
sends subsequent requests to fetch the remaining metadata, and it detects any 
version changes to force a query replan, ensuring metadata consistency.

> Document for catalog_partial_fetch_max_files
> --------------------------------------------
>
>                 Key: IMPALA-14521
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14521
>             Project: IMPALA
>          Issue Type: Documentation
>            Reporter: Quanlong Huang
>            Assignee: Sanjana Malhotra
>            Priority: Major
>
> catalog_partial_fetch_max_files is a startup flag of catalogd added in 
> IMPALA-11402. We need to document it somewhere.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to