novemberkilo edited a comment on issue #1036:
URL: https://github.com/apache/arrow-rs/issues/1036#issuecomment-997521410
@nevi-me @alamb I started with `json2parquet` and found the shape of the
RecordBatch that corresponded to `{"emptylist": []}` (see below). This then
guided me to writing the test that I've committed for now. I [get the same
panic and error
message](https://github.com/apache/arrow-rs/runs/4577227397?check_suite_focus=true#step:6:1648)
so I think I am on the right track. Any suggestions for where the actual fix
might be? I'm spelunking around but if either of you (or anyone else familiar
with the code here) can help orient me, that would help.
I ran `json2parquet` on `{"emptylist": []}` and placed a `dbg!` on what is
sent to the writer:
```
[src/main.rs:182] &batch = Ok(
RecordBatch {
schema: Schema {
fields: [
Field {
name: "emptylist",
data_type: List(
Field {
name: "item",
data_type: Null,
nullable: true,
dict_id: 0,
dict_is_ordered: false,
metadata: None,
},
),
nullable: true,
dict_id: 0,
dict_is_ordered: false,
metadata: None,
},
],
metadata: {},
},
columns: [
ListArray
[
NullArray(0),
],
],
},
)
thread 'main' panicked at 'Cannot filter indices on a non-primitive array,
found List(true)',
/home/navin/.cargo/registry/src/github.com-1ecc6299db9ec823/parquet-6.3.0/src/arrow/levels.rs:757:18
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]