pitrou commented on issue #43497:
URL: https://github.com/apache/arrow/issues/43497#issuecomment-3018933031
I do not exactly get a hang on PyArrow 20.0, though it crashes at the end:
```
$ python ~/arrow/dev/issue_43497.py
Schema from dataset:
URL: string
TEXT: string
WIDTH: double
HEIGHT: double
similarity: double
punsafe: double
pwatermark: double
AESTHETIC_SCORE: double
hash: int64
Data from dataset:
URL
TEXT WIDTH ... pwatermark AESTHETIC_SCORE
hash
0 https://endscan.com/media/36373/fb0bf7b2abe7ac... View 47 photos of this
3 bed, 4 bath, and 2,49... 2080.0 ... 0.100449 5.040063
929872200875109155
1 https://static0.colliderimages.com/wordpress/w...
john barrowman - photo #12 2011.0 ... 0.732896 5.544570
8338800302313723098
2 https://images.squarespace-cdn.com/content/v1/... A black and white
limited edition portraits of... 1655.0 ... 0.081598 5.969655
7578604913656441916
[3 rows x 9 columns]
Fatal Python error: PyGILState_Release: auto-releasing thread-state, but no
thread-state for this thread
Python runtime state: finalizing (tstate=0x0000000000ba6ac8)
Abandon
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]