[
https://issues.apache.org/jira/browse/ARROW-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16239566#comment-16239566
]
Uwe L. Korn commented on ARROW-1769:
------------------------------------
We generate temporary DataFrames inside of {{write_to_dataset}} in the above
case. This could probably be fixed by
https://github.com/pandas-dev/pandas/issues/15746
> Python: pyarrow.parquet.write_to_dataset creates cyclic references
> ------------------------------------------------------------------
>
> Key: ARROW-1769
> URL: https://issues.apache.org/jira/browse/ARROW-1769
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.7.1
> Reporter: Uwe L. Korn
> Fix For: 0.8.0
>
>
> See https://github.com/apache/arrow/issues/1285 for the initial issue. Having
> cyclic references is a valid state in Python as they can be cleaned up by the
> garbage collector. But as the garbage collector normally runs at a point
> which is not clear to the user and we deal here normally with larger objects,
> we should get rid of the cyclic reference to evict data as soon as possible
> from main memory.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)