[
https://issues.apache.org/jira/browse/ARROW-11390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17272708#comment-17272708
]
Lance Dacey commented on ARROW-11390:
-------------------------------------
That makes sense. I checked further and the base image I was using is this:
https://github.com/jupyter/docker-stacks/blob/master/pyspark-notebook/Dockerfile
Which pins pyarrow at 2.0:
{code:java}
RUN conda install --quiet --yes --satisfied-skip-solve \
'pyarrow=2.0.*' && \
{code}
I'll try again now that 3.0 is on conda-forge
> [Python] pyarrow 3.0 issues with turbodbc
> -----------------------------------------
>
> Key: ARROW-11390
> URL: https://issues.apache.org/jira/browse/ARROW-11390
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 3.0.0
> Environment: pyarrow 3.0.0
> fsspec 0.8.4
> adlfs v0.5.9
> pandas 1.2.1
> numpy 1.19.5
> turbodbc 4.1.1
> Reporter: Lance Dacey
> Priority: Major
> Labels: python, turbodbc
>
> This is more of a turbodbc issue I think, but perhaps someone here would have
> some idea of what changed to cause potential issues.
> {code:java}
> cursor = connection.cursor()
> cursor.execute("select top 10 * from dbo.tickets")
> table = cursor.fetchallarrow(){code}
> I am able to run table.num_rows and it will print out 10.
> If I run table.to_pandas() or table.schema or try to write the table to a
> dataset, my kernel dies with no explanation. I reverted back to pyarrow 2.0
> and the same code works again.
> [https://github.com/blue-yonder/turbodbc/issues/289]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)