[GitHub] [arrow] AlenkaF commented on issue #37794: Add option to read schema of a hive partitioned parquet file (in python)

via GitHub Tue, 26 Sep 2023 05:03:44 -0700


AlenkaF commented on issue #37794:
URL: https://github.com/apache/arrow/issues/37794#issuecomment-1735409795


   Reading the documentation I think `pq.read_schema` is meant to read schema 
from a single Parquet file
   
https://arrow.apache.org/docs/python/generated/pyarrow.parquet.read_schema.html
   
   But a dataset can be partitioned. If the data is written out to a 
partitioned directory and you want to inspect the schema, directory etc., this 
might help:
   https://arrow.apache.org/docs/python/dataset.html#dataset-discovery


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] AlenkaF commented on issue #37794: Add option to read schema of a hive partitioned parquet file (in python)

Reply via email to