Luosuu commented on issue #40981: URL: https://github.com/apache/arrow/issues/40981#issuecomment-2038076001
Thanks so much for providing such helpful materials and I think I understand this better now. The first point seems okay if I only care about a specific column (in this case I think I don't need to resemble the entire row structure since I don't care about other columns). For the second point, if I understand correctly, you are mentioning https://arrow.apache.org/blog/2022/10/08/arrow-parquet-encoding-part-2/ so for this Parquet schema: ``` message schema { optional int32 a; required group b { optional int32 b1; required int32 b2; } optional group c { required int32 c1; } optional group d { required int32 d1; optional int32 d2; } } ``` "a", "b", "c", and "d" are stored in different pages. Do I understand this right? Thank you very much again. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
