mapleFU commented on issue #8643: URL: https://github.com/apache/arrow-rs/issues/8643#issuecomment-3475747548
> I don't quite understand this proposal I don't know should we separate to: 1. Decode all row-groups with limited columns ( or a selected set of row-groups, if compute engine has row-groups info in metadata), maybe only decode the filter columns min-max stats? 2. If we have all row-groups with limited columns, prune row-groups by statisitcs 3. Then, decode the Column Index by filter columns and limited row-groups, and fill it to `ParquetMetadata`? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
