Bryan Cutler created ARROW-2101:
-----------------------------------

             Summary: [Python] from_pandas reads 'str' types as binary Arrow 
data with Python 2
                 Key: ARROW-2101
                 URL: https://issues.apache.org/jira/browse/ARROW-2101
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.8.0
            Reporter: Bryan Cutler


Using Python 2, converting Pandas with 'str' data to Arrow results in Arrow 
data of binary type, even if the user supplies type information.  conversion of 
'unicode' type works to create Arrow data of string types.  For example

{code}
In [25]: pa.Array.from_pandas(pd.Series(['a'])).type
Out[25]: DataType(binary)

In [26]: pa.Array.from_pandas(pd.Series(['a']), type=pa.string()).type
Out[26]: DataType(binary)

In [27]: pa.Array.from_pandas(pd.Series([u'a'])).type
Out[27]: DataType(string)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to