devinjdangelo commented on issue #1718:
URL: https://github.com/apache/arrow-rs/issues/1718#issuecomment-1707141716

   I am interested in working on this. Does anyone know if there are existing 
parallelized parquet write implementations in other languages we could 
reference? I am particularly interested in what the best approach is between:
   
   1. Serialize multiple columns in a single row group in parallel
   2. Serialize multiple row groups in parallel
   3. A combination of 1 and 2
   
   Number 2 could be a challenge if we don't know up front how many total row 
groups we want in the file. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to