mmaitre314 commented on issue #5064:
URL: https://github.com/apache/arrow-rs/issues/5064#issuecomment-1806931523
That indeed looks related to #2394. Reading through the issue conversation,
I also hit the error ```called `Option::unwrap()` on a `None` value```
mentioned in related #3745 (although I don't have a good repro right now).
It sounds like there isn't enough volunteer time to expand `get_row_iter()`.
Would reviewing pull requests in that code be an option? If issues can be
tackled as byte-size chunks, I can likely contribute. So far testing decoding
`REPEATED` actually looked pretty good. For instance, this non-trivial schema
seems to be handled fine (at least in the case of one repetition):
```
message schema {
REQUIRED BYTE_ARRAY value1 (UTF8);
OPTIONAL BYTE_ARRAY value2 (UTF8);
REQUIRED INT64 value3;
REPEATED group level1 {
REQUIRED BYTE_ARRAY value4 (UTF8);
OPTIONAL INT64 value5;
OPTIONAL BYTE_ARRAY value6 (UTF8);
}
}
```
Doc/example contribution could also be fair game (I was also thinking about
adding something around ObjectStore + Parquet async -- I had a hard time
figuring this one out but it actually works really well and I think I
understand that part enough now to expand the docs).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]