Tom-Newton commented on issue #44342:
URL: https://github.com/apache/arrow/issues/44342#issuecomment-2597925727

   I think I'm actually hitting the same problem on x86 CPUs, but only some. I 
can reproduce on  `Intel(R) Xeon(R) Silver 4114` and `AMD EPYC 7V12` but not on 
`AMD Ryzen 9 5950X`. It also requires a very obscure set of dependencies to 
reproduce and I can reproduce on `pyarrow==17.0.0`, `pyarrow==18.0.0`, 
`pyarrow==18.1.0` and `pyarrow==19.0.0`.
   
   
   I'm fairly confident its the same issue because `gdb` looks almost the same 
as the original report.
   ```
   [Detaching after vfork from child process 836479]
   [Detaching after vfork from child process 836480]
   [Detaching after vfork from child process 836484]
   [New Thread 0x7ff1761ff700 (LWP 836622)]
   --Type <RET> for more, q to quit, c to continue without paging--c
   
   Thread 80 "jemalloc_bg_thd" received signal SIGSEGV, Segmentation fault.
   [Switching to Thread 0x7ff1761ff700 (LWP 836622)]
   0x00007ff178f06881 in background_thread_entry () from 
/<redacted>/pip-svc_pyarrow/site-packages/pyarrow/libarrow.so.1801
   (gdb) backtrace
   #0  0x00007ff178f06881 in background_thread_entry ()
      from /<redacted>/pip-svc_pyarrow/site-packages/pyarrow/libarrow.so.1801
   #1  0x00007ff332f71609 in start_thread (arg=<optimized out>) at 
pthread_create.c:477
   #2  0x00007ff332d30353 in clone () at 
../sysdeps/unix/sysv/linux/x86_64/clone.S:95
   (gdb) bt full
   #0  0x00007ff178f06881 in background_thread_entry ()
      from /<redacted>/pip-svc_pyarrow/site-packages/pyarrow/libarrow.so.1801
   No symbol table info available.
   #1  0x00007ff332f71609 in start_thread (arg=<optimized out>) at 
pthread_create.c:477
           ret = <optimized out>
           pd = <optimized out>
           unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140675045652224, 
7270211746057996573, 140737080934526, 
                   140737080934527, 140737080934528, 140675045650368, 
-7276102654996461283, -7277079231991199459}, 
                 mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data 
= {prev = 0x0, cleanup = 0x0, 
                 canceltype = 0}}}
           not_first_call = 0
   #2  0x00007ff332d30353 in clone () at 
../sysdeps/unix/sysv/linux/x86_64/clone.S:95
   No locals.
   (gdb) 
   ```
   
   Building the wheel with `jemalloc` disabled solves the problem. I'm also 
tempted to do a `git bisect` to try to determine exactly which commit started 
this problem. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to