kylebarron opened a new issue, #41608:
URL: https://github.com/apache/arrow/issues/41608
### Describe the enhancement requested
In pyarrow, it would be great to have the ability to update Parquet
key-value metadata of an existing `ParquetWriter` instance before closing the
file. For example:
```py
import pyarrow.parquet as pq
with pq.ParquetWriter("output.parquet", schema) as writer:
writer.write_batch(...)
writer.add_key_value_metadata({b"hello": "world"})
```
This is akin to the arrow-rs
[`ArrowWriter::append_key_value_metadata`](https://docs.rs/parquet/latest/parquet/arrow/arrow_writer/struct.ArrowWriter.html#method.append_key_value_metadata),
which lets you mutate an existing writer to add key-value metadata before
writing the final metadata.
My use case for this is in writing to GeoParquet efficiently.
### Component(s)
Python
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]