wgtmac opened a new pull request, #3371:
URL: https://github.com/apache/parquet-java/pull/3371

   - Support sorting by column sizes in the descending order.
   - Support printing ratio as percentage.
   
   # Before this change
   ```
   ~/Projects/parquet-testing/data (master*) » parquet-cli column-size 
alltypes_plain.snappy.parquet
   
   float_col-> Size In Bytes: 51 Size In Ratio: 0.08947369
   int_col-> Size In Bytes: 51 Size In Ratio: 0.08947369
   string_col-> Size In Bytes: 53 Size In Ratio: 0.092982456
   bool_col-> Size In Bytes: 26 Size In Ratio: 0.045614034
   date_string_col-> Size In Bytes: 55 Size In Ratio: 0.096491225
   smallint_col-> Size In Bytes: 51 Size In Ratio: 0.08947369
   timestamp_col-> Size In Bytes: 63 Size In Ratio: 0.110526316
   tinyint_col-> Size In Bytes: 51 Size In Ratio: 0.08947369
   bigint_col-> Size In Bytes: 59 Size In Ratio: 0.10350877
   id-> Size In Bytes: 51 Size In Ratio: 0.08947369
   double_col-> Size In Bytes: 59 Size In Ratio: 0.10350877
   ```
   
   # After this change
   ```
   ~/Projects/parquet-testing/data (master*) » parquet-cli column-size 
alltypes_plain.snappy.parquet -s -p
   
   timestamp_col-> Size In Bytes: 63 Size In Ratio: 11.0526%
   bigint_col-> Size In Bytes: 59 Size In Ratio: 10.3509%
   double_col-> Size In Bytes: 59 Size In Ratio: 10.3509%
   date_string_col-> Size In Bytes: 55 Size In Ratio: 9.6491%
   string_col-> Size In Bytes: 53 Size In Ratio: 9.2982%
   float_col-> Size In Bytes: 51 Size In Ratio: 8.9474%
   int_col-> Size In Bytes: 51 Size In Ratio: 8.9474%
   smallint_col-> Size In Bytes: 51 Size In Ratio: 8.9474%
   tinyint_col-> Size In Bytes: 51 Size In Ratio: 8.9474%
   id-> Size In Bytes: 51 Size In Ratio: 8.9474%
   bool_col-> Size In Bytes: 26 Size In Ratio: 4.5614%
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to