dudzicp opened a new issue, #131:
URL: https://github.com/apache/arrow-datafusion-python/issues/131

   **Describe the bug**
   I am unable to display the contents of delta tables stored locally
   
   **To Reproduce**
   
   ```
   [tool.poetry.dependencies]
   python = "^3.10"
   datafusion = "^0.7.0"
   deltalake = "^0.6.4"
   ```
   
   then run the following code:
   
   ```
   import pyarrow as pa
   import pyarrow.dataset as ds
   
   from deltalake import DeltaTable
   import datafusion
   
   ctx = datafusion.SessionContext()
   
   delta_table = 
DeltaTable("/home/pdudzic/work/deltalake/refined/rcsb/entity_poly_numbered/")
   pa_dataset = dt.to_pyarrow_dataset()
   
   ctx.register_dataset("pa_dataset", pa_dataset)
   
   tmp = ctx.sql("SELECT * FROM pa_dataset limit 10")
   tmp.show()
   ```
   When executed in notebook in vs code, this script can run for >20 min and I 
am unable to interrupt the execution.
   
   **Expected behavior**
   Top rows displayed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to