Juan David Agudelo created ARROW-7996:
-----------------------------------------
Summary: Error serializing empty pandas DataFrame with pyarrow
Key: ARROW-7996
URL: https://issues.apache.org/jira/browse/ARROW-7996
Project: Apache Arrow
Issue Type: Bug
Reporter: Juan David Agudelo
The following code does not work:
{code:python}
import pandas
import pyarrow
df = pandas.DataFrame({"timestamp": [], "value_123": [], "context_123": []})
data = [df]
context = pyarrow.default_serialization_context()
serialized_data = context.serialize(data)
file_path = "file.txt"
with open(file_path, "wb") as f:
serialized_data.write_to(f)
with open(file_path, "rb") as f:
context = pyarrow.default_serialization_context()
decoded_data = context.deserialize(f.read())
{code}
Throws the following error:
{code:java}
ArrowInvalid: strides must not involve buffer over run{code}
I am using Python 3.6.9 in Ubuntu 18.04 and the version of pyarrow is 0.16.0.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)