maxburke commented on issue #1433: URL: https://github.com/apache/arrow-datafusion/issues/1433#issuecomment-995339732
That's really interesting; thanks for finding that @alamb ! We aren't doing anything special with our Parquet generation; we basically write them straight from Arrow (via the Rust libraries), so perhaps there's an underlying bug in the Rust Parquet statistics generation code that we can start investigating. Being able to skip parquet pruning would be useful as well. From a curiosity point of view, what should the min/max values be for string columns? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
