Clément Bouscasse created ARROW-2082: ----------------------------------------
Summary: SegFault in pyarrow.parquet.write_table with specific options Key: ARROW-2082 URL: https://issues.apache.org/jira/browse/ARROW-2082 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.8.0 Environment: tested on MacOS High Sierra with python 3.6 and Ubuntu Xenial (Python 3.5) Reporter: Clément Bouscasse I originally filed an issue in the pandas project but we've tracked it down to arrow itself, when called via pandas in specific circumstances: [https://github.com/pandas-dev/pandas/issues/19493] basically using {code:java} df.to_parquet('filename.parquet', flavor='spark'){code} gives a seg fault if `df` contains a datetime column. Under the covers, pandas translates this to the following call: {code:java} pq.write_table(table, 'output.parquet', flavor='spark', compression='snappy', coerce_timestamps='ms') {code} which gives me an instant crash. There is a repo on the github ticket. -- This message was sent by Atlassian JIRA (v7.6.3#76005)