Dear Arrow developers,

Our lab is planning to use pyarrow to store some biological information in
Parquet files. We also have to store some metadata alongside, e.g. which
sample the data comes from, how it was obtained and processed, etc.

Parquet seems to support file-wide metadata, but I cannot find how the
write it via pyarrow. The closest thing I could find is how to write
row-group metadata (https://github.com/pandas-dev/pandas/pull/20534), but
this seems like an overkill, since our metadata is the same for all row
groups in the file.

Is there any way to write file-wide Parquet metadata with pyarrow?

Thank you!
Anton.

Reply via email to