0x0L opened a new issue, #47702:
URL: https://github.com/apache/arrow/issues/47702

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   Hello,
   
   On both macOS/linux py3.12/3.13 pandas 2.3.0 & 2.3.3, the following script 
leaks memory with pyarrow 21.0.0. 20.0.0 is ok.
   
   ```python
   import os
   import psutil
   import numpy as np
   import pyarrow as pa
   
   n = 10_000_000
   tbl = pa.table({"a": np.arange(n)})
   proc = psutil.Process(os.getpid())
   
   while True:
       tbl.to_pandas()
       print(f"{proc.memory_info().rss / 1024**3:.2f} GB")
   ```
   
   Using `jemalloc` mitigates the issue but does not fix it completely.
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to