[
https://issues.apache.org/jira/browse/ARROW-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661605#comment-17661605
]
Rok Mihevc commented on ARROW-4582:
-----------------------------------
This issue has been migrated to [issue
#21126|https://github.com/apache/arrow/issues/21126] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [C++/Python] Memory corruption on Pandas->Arrow conversion
> ----------------------------------------------------------
>
> Key: ARROW-4582
> URL: https://issues.apache.org/jira/browse/ARROW-4582
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Python
> Affects Versions: 0.11.0, 0.11.1, 0.12.0
> Reporter: Uwe Korn
> Assignee: Uwe Korn
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.12.1, 0.13.0
>
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> When converting DataFrames with numerical columns to Arrow tables we were
> seeing random segfaults in core Python code. This only happened in
> environments where we had a high level of parallelisation or slow code
> execution (e.g. in AddressSanitizer builds).
> The reason for these segfaults was that we were incrementing the reference
> count of the underlying NumPy buffer but were not holding the GIL while
> changing the reference count.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)