alamb commented on PR #7307: URL: https://github.com/apache/arrow-rs/pull/7307#issuecomment-3446585278
> One concern I have with the approach here is how to provide exact row numbers if we start selectively reading row group metadata. If we don't have metadata for all preceding row groups, we can't know the starting row number. This at least argues for reverting back to using an `Option` for the start index. I don't think we will be able to provide row numbers if we don't have all the preceding row group metadata Given the main usecase I have heard so far is indexing and delete vectors, which require exact and accurate row numbers, I think it would be better if the reader simply returned an error if it was configured to read row numbers but didn't have enough information to do so -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
