WencongLiu opened a new issue, #3260:
URL: https://github.com/apache/paimon/issues/3260

   ### Search before asking
   
   - [X] I searched in the [issues](https://github.com/apache/paimon/issues) 
and found nothing similar.
   
   
   ### Motivation
   
   Currently, `FlinkRecomputeStatisticsProgram` first attempts to obtain 
statistical information through `Catalog#listPartitions()`. If an empty result 
is returned, it then uses `SupportsStatisticReport#reportStatistics` to get the 
statistics.
   
   Paimon's implementation of `Catalog#listPartitions()` scans the entire table 
to gather statistical information, which is very slow. In contrast, the 
implementation of `SupportsStatisticReport#reportStatistics` acquires 
statistics for filtered data, which is much faster.
   
   Therefore, we need to modify Paimon's implementation of 
`Catalog#listPartitions()` to return an empty result when it is called by 
`FlinkRecomputeStatisticsProgram`.
   
   ### Solution
   
   _No response_
   
   ### Anything else?
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [X] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to