[ https://issues.apache.org/jira/browse/ARROW-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16443243#comment-16443243 ]
Travis Brady commented on ARROW-2459: ------------------------------------- [~joshuastorck] In general I'd say PyArrow should never segfault. Throw a `ValueError` or something, but a hard crash of the interpreter is not acceptable in production. > pyarrow: Segfault with pyarrow.deserialize_pandas > ------------------------------------------------- > > Key: ARROW-2459 > URL: https://issues.apache.org/jira/browse/ARROW-2459 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Environment: OS X, Linux > Reporter: Travis Brady > Priority: Major > > Following up from [https://github.com/apache/arrow/issues/1884] wherein I > found that calling deserialize_pandas in the linked app.py script in the repo > linked below causes the app.py process to segfault. > I initially observed this on OS X, but have since confirmed that the behavior > exists on Linux as well. > Repo containing example: [https://github.com/travisbrady/sanic-arrow] > And more generally: what is the right way to get a Java-based HTTP > microservice to talk to a Python-based HTTP microservice using Arrow as the > serialization format? I'm exchanging DataFrame type objects (they are > pandas.DataFrame's on the Python side) between the two services for real-time > scoring in a few xgboost models implemented in Python. -- This message was sent by Atlassian JIRA (v7.6.3#76005)