n3world commented on pull request #10202: URL: https://github.com/apache/arrow/pull/10202#issuecomment-870855058
> > That would be ok with me, but the row number would not be available for parallel reading, right? > > Hmm, an interesting point. The parallel implementation would indeed struggle to know the correct row number. A byte offset to the start of the row would be possible however. Updating the reporting to be row number, where available, and byte offset of the start of the row would be useful for the parallel parsers which cannot easily know row number. If I were to do that it seems like it should be a separate MR done before this and on top of/after ARROW-11889. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org