GitHub user mkleen edited a comment on the discussion: JSON support in Arrow and DataFusion
This paper is highly relevant for handling nested datastructures efficiently with parquet. https://db.in.tum.de/~rey/papers/nestedparquet_rey.pdf GitHub link: https://github.com/apache/datafusion/discussions/9103#discussioncomment-13352952 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
