Fokko opened a new issue, #6564: URL: https://github.com/apache/iceberg/issues/6564
### Feature Request / Improvement This is a placeholder ticket for implementing write support for PyIceberg. Since we don't want PyIceberg to write the actual data, and only handle the metadata part of the Iceberg table format, we need to get an overview of the frameworks that we most likely want to integrate with (PyArrow, Dask (fastparquet?), etc). Missing pieces: - Able to efficiently get statistics (upper and lower bound) for the written files - Able to write Avro files (ManifestList, ManifestEntry) - Proper integration tests between Java and Python to make sure that we don't brick any tables (start in https://github.com/apache/iceberg/pull/6398) ### Query engine None -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
