[GitHub] [airflow] potiuk edited a comment on pull request #20238: Optimize dockerfiles for local rebuilds

GitBox Tue, 11 Jan 2022 01:37:41 -0800


potiuk edited a comment on pull request #20238:
URL: https://github.com/apache/airflow/pull/20238#issuecomment-1009762408



   > FWIW, the gzipped size comes to about 300 megs. I suppose when the image 
is mapped to a filesystem then it has to uncompress each layer. There's some 
discussion about that here: 
[moby/moby#24515](https://github.com/moby/moby/issues/24515).
   
   Yeah - compressed size is much smaller - and that's the size that "matters" 
when the file gets pulled really. The moby discussion is about users who are 
already dynamically decompressing the data they store. This I think is kind of 
antipattern when you use images - the image layers are compressed by default - 
maybe not most efficient compression on the planet because it is a "generic" 
compression - but it does the job. in vast majority of cases I saw it is 1:3 
compression rate for most binary data and 1:10 at least for text data (roughly 
- I never made a detailed calculation), But if someone attempts to store 
compressed data that reaches similar levels of compression in an already 
compressed image layer, it's a pretty much loss (unless you care about the 
final space used when image is decompressed and you decompress on-the-flight 
and never store the decompressed data). 
   
   So in essence I just take the compression done by container layers as 
"granted" and don't try to tweak around it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [airflow] potiuk edited a comment on pull request #20238: Optimize dockerfiles for local rebuilds

Reply via email to