Do we have a consolidated resource with information and recommendations about use of the above? For instance, I ran a simple test using PerformanceEvaluation, examining just the size of data on disk for 1G of input data. The matrix below has some surprising results:
+--------------------+--------------+ | MODIFIER | SIZE (bytes) | +--------------------+--------------+ | none | 1108553612 | +--------------------+--------------+ | compression:SNAPPY | 427335534 | +--------------------+--------------+ | compression:LZO | 270422088 | +--------------------+--------------+ | compression:GZ | 152899297 | +--------------------+--------------+ | codec:PREFIX | 1993910969 | +--------------------+--------------+ | codec:DIFF | 1960970083 | +--------------------+--------------+ | codec:FAST_DIFF | 1061374722 | +--------------------+--------------+ | codec:PREFIX_TREE | 1066586604 | +--------------------+--------------+ Where does a wayward soul look for guidance on which combination of the above to choose for their application? Thanks, Nick
