devinjdangelo opened a new pull request, #7244:
URL: https://github.com/apache/arrow-datafusion/pull/7244

   ## Which issue does this PR close?
   
   None, but progresses towards the goals of 
https://github.com/apache/arrow-datafusion/issues/5076 and 
https://github.com/apache/arrow-datafusion/issues/7079. Follow on to 
https://github.com/apache/arrow-datafusion/pull/7141 and #7212 .
   
   ## Rationale for this change
   
   Inserting to parquet backed tables is a common use case. The file format 
itself does not support appending to an existing file, s o only appending new 
files to a table is supported.
   
   This implementation relies on `AsyncArrowWriter` to handle coordination 
between the Parquet serializer and ObjectStore writer. So, the implementation 
for parquet insert into support diverges a bit from JSON and CSV. 
   
   ## What changes are included in this PR?
   
   - Added parquet write config options to `ExecutionOptions`
   - Allow specifying parquet schema so empty tables can be created
   - Move write specific code to write.rs mod
   - Add test coverage for inserting to parquet table
   
   ## Are these changes tested?
   
   Yes.
   
   ## Are there any user-facing changes?
   
   Inserting to parquet backed tables is possible.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to