raduteo opened a new pull request #7548: URL: https://github.com/apache/arrow/pull/7548
Follow up on the "Two proposals for expanding arrow Table API (virtual arrays and random access)" thread I have laid out a number of components illustrating how I see an arrow DataFrame implementation connecting to the current arrow structure while also allowing for virtual data source, thread safe mutation, shallow copying and fragmentation control. The best way to read the header is probably is either: - bottom up, starting from DataFrameOperationExamples::puttingItAllTogether for those inclined to see it all in action first or - top down, where I hopefully did an ok job introducing each component. I tried to strike a balance between being overwhelmingly verbose and hopelessly vague, so please flag any aspect that need clarification and I will update accordingly ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org