[ 
https://issues.apache.org/jira/browse/ARROW-17839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612492#comment-17612492
 ] 

Joris Van den Bossche commented on ARROW-17839:
-----------------------------------------------

> Going through pa.array(..., type=dictionary_type) is not creating indices of 
> the type as given by dictionary_type

That would be a bug IMO, but I can't directly reproduce this. In the below 
example, I specify int8, and it does return a dictionary type using int8:

{code:python}
>>> pa.array(['a', 'b', 'a'], pa.dictionary(pa.int8(), pa.string())).type
DictionaryType(dictionary<values=string, indices=int8, ordered=0>)
{code}

> [Python] Cannot create RecordBatch with nested struct containing extension 
> type
> -------------------------------------------------------------------------------
>
>                 Key: ARROW-17839
>                 URL: https://issues.apache.org/jira/browse/ARROW-17839
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 9.0.0
>         Environment: macOS 12.5.1 on an Apple M1 Ultra.
>            Reporter: Matthias Vallentin
>            Priority: Blocker
>         Attachments: enum.py, example.py
>
>
> I'm running into the following issue:
> {code:java}
> pyarrow.lib.ArrowNotImplementedError: Unsupported cast to 
> extension<vast.address<AddressType>> from fixed_size_binary[16]{code}
> Use case: I want to create a record batch that contains this type:
> {code:java}
> pa.struct([("address", AddressType()), ("length", pa.uint8())]){code}
> Here, {{AddressType}} is an extension type that models an IP address 
> ({{{}pa.binary(16){}}}).
> Please find attached a self-contained example that illustrates the issue.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to