[
https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612492#comment-17612492
]
Joris Van den Bossche commented on ARROW-17839:
-----------------------------------------------
> Going through pa.array(..., type=dictionary_type) is not creating indices of
> the type as given by dictionary_type
That would be a bug IMO, but I can't directly reproduce this. In the below
example, I specify int8, and it does return a dictionary type using int8:
{code:python}
>>> pa.array(['a', 'b', 'a'], pa.dictionary(pa.int8(), pa.string())).type
DictionaryType(dictionary<values=string, indices=int8, ordered=0>)
{code}
> [Python] Cannot create RecordBatch with nested struct containing extension
> type
> -------------------------------------------------------------------------------
>
> Key: ARROW-17839
> URL: https://issues.apache.org/jira/browse/ARROW-17839
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 9.0.0
> Environment: macOS 12.5.1 on an Apple M1 Ultra.
> Reporter: Matthias Vallentin
> Priority: Blocker
> Attachments: enum.py, example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to
> extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address
> ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)