timsaucer commented on issue #715:
URL: 
https://github.com/apache/datafusion-python/issues/715#issuecomment-2131229316

   My statement above about testing on rust side is likely incorrect. I ran the 
same test above but loading the dataframe from a parquet file instead of 
creating in memory and the expected behavior is reproduced.
   
   If you amend these lines to the bottom of the minimal example
   
   ```
   df.write_parquet("save_out.parquet")
   
   df_reread = ctx.read_parquet("save_out.parquet")
   
   df_reread.show()
   df_reread.select(col("a")["outer_1"]["inner_2"]).show()
   ```
   
   You get the expected result
   ```
   DataFrame()
   +-------------------------------------+
   | a                                   |
   +-------------------------------------+
   | {outer_1: {inner_1: 1, inner_2: 2}} |
   | {outer_1: {inner_1: 1, inner_2: }}  |
   | {outer_1: }                         |
   +-------------------------------------+
   DataFrame()
   +-----------------------------+
   | ?table?.a[outer_1][inner_2] |
   +-----------------------------+
   | 2                           |
   |                             |
   |                             |
   +-----------------------------+
   ```
   
   It also shows the original table is reproduced. I'll continue digging but I 
no longer am convinced this is a python binding issue.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to