pitrou commented on issue #43497:
URL: https://github.com/apache/arrow/issues/43497#issuecomment-3018933031

   I do not exactly get a hang on PyArrow 20.0, though it crashes at the end:
   ```
   $ python ~/arrow/dev/issue_43497.py 
   
   
   Schema from dataset:
   URL: string
   TEXT: string
   WIDTH: double
   HEIGHT: double
   similarity: double
   punsafe: double
   pwatermark: double
   AESTHETIC_SCORE: double
   hash: int64
   
   Data from dataset:
                                                    URL                         
                      TEXT   WIDTH  ...  pwatermark  AESTHETIC_SCORE            
     hash
   0  https://endscan.com/media/36373/fb0bf7b2abe7ac...  View 47 photos of this 
3 bed, 4 bath, and 2,49...  2080.0  ...    0.100449         5.040063   
929872200875109155
   1  https://static0.colliderimages.com/wordpress/w...                         
john barrowman - photo #12  2011.0  ...    0.732896         5.544570  
8338800302313723098
   2  https://images.squarespace-cdn.com/content/v1/...  A black and white 
limited edition portraits of...  1655.0  ...    0.081598         5.969655  
7578604913656441916
   
   [3 rows x 9 columns]
   Fatal Python error: PyGILState_Release: auto-releasing thread-state, but no 
thread-state for this thread
   Python runtime state: finalizing (tstate=0x0000000000ba6ac8)
   
   Abandon
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to