jnturton opened a new pull request, #2716:
URL: https://github.com/apache/drill/pull/2716

   # [DRILL-8366](https://issues.apache.org/jira/browse/DRILL-8366): Late 
release of compressor memory in the Parquet writer
   
   ## Description
   
   The Parquet writer waits until the end of the entire write before releasing 
its compression codec factory. The factory in turn releases compressors which 
release direct memory buffers used during compression. This deferred release 
leads a build up of direct memory use and can cause large write jobs to fail. 
The Parquet writer can instead release the abovementioned each time that a 
file/row group is flushed.
   
   ## Documentation
   N/A
   
   ## Testing
   
   Manually confirm the release of allocated compression buffers after each 
flush in the debug log output.
   Manually monitor memory usage during a big Parquet write job.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@drill.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to