[ 
https://issues.apache.org/jira/browse/ARROW-10172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17208105#comment-17208105
 ] 

Artem KOZHEVNIKOV commented on ARROW-10172:
-------------------------------------------

btw, casting to large_string is not supported neither (it's maybe linked):
{code:python}
str_array.cast(pa.large_string())
ArrowNotImplementedError                  Traceback (most recent call last)
<ipython-input-2-7399d0bc10ae> in <module>
----> 1 str_array.cast(pa.large_string())

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/table.pxi in 
pyarrow.lib.ChunkedArray.cast()

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/compute.py in 
cast(arr, target_type, safe)
     85     else:
     86         options = _pc.CastOptions.unsafe(target_type)
---> 87     return call_function("cast", [arr], options)
     88 
     89 

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/_compute.pyx in 
pyarrow._compute.call_function()

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/_compute.pyx in 
pyarrow._compute.Function.call()

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/error.pxi in 
pyarrow.lib.pyarrow_internal_check_status()

/opt/conda/envs/model/lib/python3.7/site-packages/pyarrow/error.pxi in 
pyarrow.lib.check_status()

ArrowNotImplementedError: Unsupported cast from string to large_utf8 using 
function cast_large_string
​
{code}

> cancat_arrays requires upcast for large array
> ---------------------------------------------
>
>                 Key: ARROW-10172
>                 URL: https://issues.apache.org/jira/browse/ARROW-10172
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 1.0.1
>            Reporter: Artem KOZHEVNIKOV
>            Priority: Major
>
> I'm sorry if this was already reported, but there's an overflow issue in 
> concatenation of large arrays
> {code:python}
> In [1]: import pyarrow as pa
> In [2]: str_array = pa.array(['a' * 128] * 10**8)
> In [3]: large_array = pa.concat_arrays([str_array] * 50)
> Segmentation fault (core dumped)
> {code}
> I suppose that  this should be handled by upcast to large_string.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to