[
https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17611823#comment-17611823
]
Matthias Vallentin edited comment on ARROW-17839 at 10/1/22 9:37 AM:
---------------------------------------------------------------------
Thanks for the guidance, Joris!
Regarding the dictionary, nowhere in the code I use {{{}int8{}}}. Where do I
implicitly "commit" to {{int8}} without knowing it?
EDIT: I think I found the issue. Going through {{pa.array(...,
type=dictionary_type)}} is not creating indices of the type as given by
{{{}dictionary_type{}}}. I had to go through {{pa.DictionaryArray.from_arrays}}
with explicitly typed arrays. (The detail fix is here:
https://github.com/tenzir/vast/pull/2606/files)
was (Author: mavam):
Thanks for the guidance, Joris!
Regarding the dictionary, nowhere in the code I use {{{}int8{}}}. Where do I
implicitly "commit" to \{{int8}} without knowing it?
> [Python] Cannot create RecordBatch with nested struct containing extension
> type
> -------------------------------------------------------------------------------
>
> Key: ARROW-17839
> URL: https://issues.apache.org/jira/browse/ARROW-17839
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 9.0.0
> Environment: macOS 12.5.1 on an Apple M1 Ultra.
> Reporter: Matthias Vallentin
> Priority: Blocker
> Attachments: enum.py, example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to
> extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address
> ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)