ssandona commented on issue #9870: URL: https://github.com/apache/hudi/issues/9870#issuecomment-1764416380
Hi, for **OptionC** I did not specify any value for `hoodie.metadata.index.column.stats.column.list` so according to the [Hudi doc](https://hudi.apache.org/docs/0.13.1/configurations) statistics are collected for all the tables fields. I guess you are not able to replicate as this is related to range pruning and so to the specific min/max boundaries of the parquet files. In addition I see you are using Hudi 0.14.0 so I don't know if this was fixed there. - Can you test the same code for Hudi 0.13.1? - Is there an easy way to extract column statistics and print them out so I can check on my end if those are properly calculated? I would like to understand if in my case stats are wrongly calculated or wrongly used -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
