0x0L opened a new issue, #47702:
URL: https://github.com/apache/arrow/issues/47702
### Describe the bug, including details regarding any error messages,
version, and platform.
Hello,
On both macOS/linux py3.12/3.13 pandas 2.3.0 & 2.3.3, the following script
leaks memory with pyarrow 21.0.0. 20.0.0 is ok.
```python
import os
import psutil
import numpy as np
import pyarrow as pa
n = 10_000_000
tbl = pa.table({"a": np.arange(n)})
proc = psutil.Process(os.getpid())
while True:
tbl.to_pandas()
print(f"{proc.memory_info().rss / 1024**3:.2f} GB")
```
Using `jemalloc` mitigates the issue but does not fix it completely.
### Component(s)
Python
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]