krishna deepak created ARROW-13649:
--------------------------------------
Summary: pyarrow is causing segfault randomly
Key: ARROW-13649
URL: https://issues.apache.org/jira/browse/ARROW-13649
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 5.0.0
Environment: openSUSE Leap 15.2
conda python3.9 env
Reporter: krishna deepak
I'm using pyarrow to read feather files. I'm randomly getting the following
segfault error.
*** SIGSEGV received at time=1629226305 on cpu 3 ***
PC: @ 0x7fa9e177272a (unknown) arrow::BitUtil::SetBitmap()
@ 0x7fa9f5dec2d0 (unknown) (unknown)
Segmentation fault (core dumped)
I initially thought its because of some bug in my cython code, but then even
after removing all cython calls, I get this error randomly.
The python code is very simple read
{{ index_data = ds.dataset(INDEX_DATA_PATH / self.ticker / str(year)
/ 'indexed_table.feather',
format='feather')
index_data = index_data.to_table()
trade_days = self.get_trading_days(year)
options_data = ds.dataset(OPTIONS_DATA_PATH / self.ticker /
self.expiry_type / str(year), format='feather')
options_data = options_data.to_table(
filter=(
(ds.field('dt') >= trade_days[0]) & (ds.field('dt') <=
trade_days[-1])
),
columns=options_data_columns
)
expiry_dts = [x.as_py() for x in
pc.unique(options_data['expiry_dt'])]
expiry_dts.sort()}}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)