[jira] [Commented] (ARROW-1993) [Python] Add function for determining implied Arrow schema from pandas.DataFrame

Krisztian Szucs (JIRA) Fri, 21 Sep 2018 03:57:33 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623410#comment-16623410
 ]


Krisztian Szucs commented on ARROW-1993:
----------------------------------------

[~xhochy] the PR seems incomplete and abandoned, should We delay it to 0.12 or 
reimplement it?

> [Python] Add function for determining implied Arrow schema from 
> pandas.DataFrame
> --------------------------------------------------------------------------------
>
>                 Key: ARROW-1993
>                 URL: https://issues.apache.org/jira/browse/ARROW-1993
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Wes McKinney
>            Assignee: Uwe L. Korn
>            Priority: Major
>              Labels: beginner, pull-request-available
>             Fix For: 0.11.0
>
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> Currently the only option is to use {{Table/Array.from_pandas}} which does 
> significant unnecessary work and allocates memory. If only the schema is of 
> interest, then we could do less work and not allocate memory.
> We should provide the user a function {{pyarrow.Schema.from_pandas}} which 
> takes a DataFrame as an input and returns the respective Arrow schema. The 
> functionality for determing the schema is already available in the Python 
> code, it is at moment just very tightly bound to the conversion 
> infrastructure.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (ARROW-1993) [Python] Add function for determining implied Arrow schema from pandas.DataFrame

Reply via email to