[jira] [Created] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-94:


 Summary: Expand list example to clarify null vs empty list
 Key: ARROW-94
 URL: https://issues.apache.org/jira/browse/ARROW-94
 Project: Apache Arrow
  Issue Type: Bug
  Components: Format
Reporter: Micah Kornfield


The current spec seems to make the possibility of an empty list impossible (or 
at least conflates it with null).  




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223286#comment-15223286
 ] 

Wes McKinney commented on ARROW-94:
---

The only difference is the null bit -- you must propagate the current offset 
even with a stretch of nulls so that random access is preserved . Best to 
clarify in the spec as you aren't the first person to bring this up. 

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ARROW-85) C++: memcmp can be avoided in Equal when comparing with the same Buffer

2016-04-03 Thread Kai Zheng (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223572#comment-15223572
 ] 

Kai Zheng commented on ARROW-85:


https://github.com/apache/arrow/pull/57

> C++: memcmp can be avoided in Equal when comparing with the same Buffer
> ---
>
> Key: ARROW-85
> URL: https://issues.apache.org/jira/browse/ARROW-85
> Project: Apache Arrow
>  Issue Type: Improvement
>Reporter: Kai Zheng
>
> It looks too expensive to use memcmp to compare two buffers. Instead, the 
> starting address and length/capacity would be good enough to use. Higher 
> level codes relying on memcmp behaviour can be done in higher level.
> Update: memcmp should be avoided in Equal when comparing with the same 
> Buffer. In other cases, it's still needed to know the content are the same or 
> not.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (ARROW-86) Python: Implement zero-copy Arrow-to-Pandas conversion

2016-04-03 Thread Wes McKinney (JIRA)

 [ 
https://issues.apache.org/jira/browse/ARROW-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-86.
---
Resolution: Fixed

Issue resolved by pull request 52
[https://github.com/apache/arrow/pull/52]

> Python: Implement zero-copy Arrow-to-Pandas conversion
> --
>
> Key: ARROW-86
> URL: https://issues.apache.org/jira/browse/ARROW-86
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>
> For this to work, we need:
>  * a check if the Arrow Array could be represented as Pandas Series using the 
> same memory buffer
>  * a {{numpy.ndarray}} subclass that keeps a reference to the Arrow Array
> With the NumPy C-API it is already possible to create a {{numpy.ndarray}} 
> instance that uses the Arrow Array memory buffer internally but there is no 
> way to keep track if the {{numpy.ndarray}} still lives or the underlying 
> memory can be free'd.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Micah Kornfield (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223529#comment-15223529
 ] 

Micah Kornfield commented on ARROW-94:
--

I agree, you can piece it together from the spec, but it would be good to be a 
little bit more explicit.  I can try to update spec, the one thing that might 
be difficult is updating the images, are there source files for them or are 
they hand-drawn?

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Micah Kornfield (JIRA)

 [ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield reassigned ARROW-94:


Assignee: Micah Kornfield

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223544#comment-15223544
 ] 

Wes McKinney commented on ARROW-94:
---

There are OmniGraffle files I can send you. I'm thinking it might be better to 
transition to plain text "illustrations", i.e. 

{code}
data: ["foo", "", null, null, "bar2"]

List

bitmap   00010011
length: 5
null_count: 2
offsets: 0  3  3  3  3  7
children:
  - Int32
-
length: 7
null_count: 0
values: f o o b a r 2
{code}

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Wes McKinney (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223544#comment-15223544
 ] 

Wes McKinney edited comment on ARROW-94 at 4/3/16 11:05 PM:


There are OmniGraffle files I can send you. I'm thinking it might be better to 
transition to plain text "illustrations", i.e. 

{code}
data ["foo", "", null, null, "bar2"]

List

bitmap   00010011
length: 5
null_count: 2
offsets: 0  3  3  3  3  7
children:
  - uint8 (char)
-
length: 7
null_count: 0
values: f o o b a r 2
{code}


was (Author: wesmckinn):
There are OmniGraffle files I can send you. I'm thinking it might be better to 
transition to plain text "illustrations", i.e. 

{code}
data ["foo", "", null, null, "bar2"]

List

bitmap   00010011
length: 5
null_count: 2
offsets: 0  3  3  3  3  7
children:
  - Int32
-
length: 7
null_count: 0
values: f o o b a r 2
{code}

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ARROW-94) Expand list example to clarify null vs empty list

2016-04-03 Thread Micah Kornfield (JIRA)

[ 
https://issues.apache.org/jira/browse/ARROW-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223648#comment-15223648
 ] 

Micah Kornfield commented on ARROW-94:
--

Sounds good, I will take a pass through and do the conversion as part of the PR 
for this item.  I might also, try to add some hyperlinks back to 
definitions/salient points so new readers won't have to cache as much in there 
head.

> Expand list example to clarify null vs empty list
> -
>
> Key: ARROW-94
> URL: https://issues.apache.org/jira/browse/ARROW-94
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Format
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>
> The current spec seems to make the possibility of an empty list impossible 
> (or at least conflates it with null).  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)