[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Antoine Pitrou (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Antoine Pitrou updated ARROW-6882:
--
Fix Version/s: 0.15.1

> cannot create a chunked_array from dictionary_encoding result
> -
>
> Key: ARROW-6882
> URL: https://issues.apache.org/jira/browse/ARROW-6882
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.15.0
>Reporter: Artem KOZHEVNIKOV
>Assignee: Joris Van den Bossche
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0, 0.15.1
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> I've experienced a strange error raise when trying to apply 
> `pa.chunked_array` directly on the indices of dictionary_encoding (code is 
> below). Making a memory view solves the problem.
> {code:python}
> import pyarrow as pa
> ca = pa.array(['a', 'a', 'b', 'b', 'c'])  
>  
> fca = ca.dictionary_encode()  
>  
> fca.indices   
>  
> 
> [
>   0,
>   0,
>   1,
>   1,
>   2
> ]
> pa.chunked_array([fca.indices])   
>  
> ---
> ArrowInvalid  Traceback (most recent call last)
>  in 
> > 1 pa.chunked_array([fca.indices])
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
>  in pyarrow.lib.chunked_array()
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
>  in pyarrow.lib.check_status()
> ArrowInvalid: Unexpected dictionary values in array of type int32
> # with another memory view it's  OK
> pa.chunked_array([fca.indices.view(fca.indices.type)]) 
> Out[45]: 
> 
> [
>   [
> 0,
> 0,
> 1,
> 1,
> 2
>   ]
> ]
>  {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-6882:
--
Labels: pull-request-available  (was: )

> cannot create a chunked_array from dictionary_encoding result
> -
>
> Key: ARROW-6882
> URL: https://issues.apache.org/jira/browse/ARROW-6882
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.15.0
>Reporter: Artem KOZHEVNIKOV
>Assignee: Joris Van den Bossche
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.15.1
>
>
> I've experienced a strange error raise when trying to apply 
> `pa.chunked_array` directly on the indices of dictionary_encoding (code is 
> below). Making a memory view solves the problem.
> {code:python}
> import pyarrow as pa
> ca = pa.array(['a', 'a', 'b', 'b', 'c'])  
>  
> fca = ca.dictionary_encode()  
>  
> fca.indices   
>  
> 
> [
>   0,
>   0,
>   1,
>   1,
>   2
> ]
> pa.chunked_array([fca.indices])   
>  
> ---
> ArrowInvalid  Traceback (most recent call last)
>  in 
> > 1 pa.chunked_array([fca.indices])
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
>  in pyarrow.lib.chunked_array()
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
>  in pyarrow.lib.check_status()
> ArrowInvalid: Unexpected dictionary values in array of type int32
> # with another memory view it's  OK
> pa.chunked_array([fca.indices.view(fca.indices.type)]) 
> Out[45]: 
> 
> [
>   [
> 0,
> 0,
> 1,
> 1,
> 2
>   ]
> ]
>  {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Artem KOZHEVNIKOV (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Artem KOZHEVNIKOV updated ARROW-6882:
-
Description: 
I've experienced a strange error raise when trying to apply `pa.chunked_array` 
directly on the indices of dictionary_encoding (code is below). Making a memory 
view solves the problem.
{code:python}
import pyarrow as pa
ca = pa.array(['a', 'a', 'b', 'b', 'c'])
   
fca = ca.dictionary_encode()
   
fca.indices 
   

[
  0,
  0,
  1,
  1,
  2
]

pa.chunked_array([fca.indices]) 
   
---
ArrowInvalid  Traceback (most recent call last)
 in 
> 1 pa.chunked_array([fca.indices])

~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
 in pyarrow.lib.chunked_array()

~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
 in pyarrow.lib.check_status()

ArrowInvalid: Unexpected dictionary values in array of type int32

# with another memory view it's  OK
pa.chunked_array([fca.indices.view(fca.indices.type)]) 
Out[45]: 

[
  [
0,
0,
1,
1,
2
  ]
]
 {code}

  was:
I've experienced a strange error raise when trying to apply `pa.chunked_array` 
directly on the indices of dictionary_encoding (code is below). Making a memory 
view solves the problem.
{code:python}
import pyarrow as pa
ca = pa.array(['a', 'a', 'b', 'b', 'c'])
   
fca = ca.dictionary_encode()
   
fca.indices 
   

[
  0,
  0,
  1,
  1,
  2
]

pa.chunked_array([fca.indices]) 
   
---
ArrowInvalid  Traceback (most recent call last)
 in 
> 1 pa.chunked_array([fca.indices])

~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
 in pyarrow.lib.chunked_array()

~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
 in pyarrow.lib.check_status()

ArrowInvalid: Unexpected dictionary values in array of type int32

# with another memory view it's  OK
pa.chunked_array([pa.Array.from_buffers(type=pa.int32(), 
length=len(fca.indices), buffers=fca.indices.buffers())]) 
Out[45]: 

[
  [
0,
0,
1,
1,
2
  ]
]
 {code}


> cannot create a chunked_array from dictionary_encoding result
> -
>
> Key: ARROW-6882
> URL: https://issues.apache.org/jira/browse/ARROW-6882
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.15.0
>Reporter: Artem KOZHEVNIKOV
>Priority: Major
> Fix For: 0.15.1
>
>
> I've experienced a strange error raise when trying to apply 
> `pa.chunked_array` directly on the indices of dictionary_encoding (code is 
> below). Making a memory view solves the problem.
> {code:python}
> import pyarrow as pa
> ca = pa.array(['a', 'a', 'b', 'b', 'c'])  
>  
> fca = ca.dictionary_encode()  
>  
> fca.indices   
>  
> 
> [
>   0,
>   0,
>   1,
>   1,
>   2
> ]
> pa.chunked_array([fca.indices])   
>  
> ---
> ArrowInvalid  Traceback (most recent call last)
>  in 
> > 1 pa.chunked_array([fca.indices])
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
>  in pyarrow.lib.chunked_array()
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
>  in pyarrow.lib.check_status()
> ArrowInvalid: Unexpected dictionary values in array of type int32
> # with 

[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van den Bossche updated ARROW-6882:
-
Fix Version/s: 0.15.1

> cannot create a chunked_array from dictionary_encoding result
> -
>
> Key: ARROW-6882
> URL: https://issues.apache.org/jira/browse/ARROW-6882
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.15.0
>Reporter: Artem KOZHEVNIKOV
>Priority: Major
> Fix For: 0.15.1
>
>
> I've experienced a strange error raise when trying to apply 
> `pa.chunked_array` directly on the indices of dictionary_encoding (code is 
> below). Making a memory view solves the problem.
> {code:python}
> import pyarrow as pa
> ca = pa.array(['a', 'a', 'b', 'b', 'c'])  
>  
> fca = ca.dictionary_encode()  
>  
> fca.indices   
>  
> 
> [
>   0,
>   0,
>   1,
>   1,
>   2
> ]
> pa.chunked_array([fca.indices])   
>  
> ---
> ArrowInvalid  Traceback (most recent call last)
>  in 
> > 1 pa.chunked_array([fca.indices])
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/table.pxi
>  in pyarrow.lib.chunked_array()
> ~/Projects/miniconda3/envs/pyarrow/lib/python3.7/site-packages/pyarrow/error.pxi
>  in pyarrow.lib.check_status()
> ArrowInvalid: Unexpected dictionary values in array of type int32
> # with another memory view it's  OK
> pa.chunked_array([pa.Array.from_buffers(type=pa.int32(), 
> length=len(fca.indices), buffers=fca.indices.buffers())]) 
> Out[45]: 
> 
> [
>   [
> 0,
> 0,
> 1,
> 1,
> 2
>   ]
> ]
>  {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)