dongjoon-hyun opened a new pull request #664:
URL: https://github.com/apache/orc/pull/664


   ### What changes were proposed in this pull request?
   
   This PR aims to support ZSTD compression at benchmark ORC data generation.
   
   ### Why are the changes needed?
   
   Apache ORC 1.6+ supports ZSTD compression.
   
   ### How was this patch tested?
   
   Manually
   ```
   $ cd java/bench
   $ mvn clean package
   $ ./fetch-data.sh // Note that this takes a long time.
   
   $ java -jar core/target/orc-benchmarks-core-*-uber.jar generate data -c zstd 
-f orc
   
   $ ls -alhR data/generated
   data/generated:
   total 20K
   drwxrwxr-x 5 dongjoon dongjoon 4.0K Mar 22 17:14 .
   drwxrwxr-x 4 dongjoon dongjoon 4.0K Mar 22 17:05 ..
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:14 github
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:11 sales
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:08 taxi
   
   data/generated/github:
   total 4.2G
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:14 .
   drwxrwxr-x 5 dongjoon dongjoon 4.0K Mar 22 17:14 ..
   -rw-r--r-- 1 dongjoon dongjoon 4.1G Mar 22 17:29 orc.zstd
   -rw-r--r-- 1 dongjoon dongjoon  33M Mar 22 17:29 .orc.zstd.crc
   
   data/generated/sales:
   total 2.5G
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:11 .
   drwxrwxr-x 5 dongjoon dongjoon 4.0K Mar 22 17:14 ..
   -rw-r--r-- 1 dongjoon dongjoon 2.5G Mar 22 17:14 orc.zstd
   -rw-r--r-- 1 dongjoon dongjoon  20M Mar 22 17:14 .orc.zstd.crc
   
   data/generated/taxi:
   total 452M
   drwxrwxr-x 2 dongjoon dongjoon 4.0K Mar 22 17:08 .
   drwxrwxr-x 5 dongjoon dongjoon 4.0K Mar 22 17:14 ..
   -rw-r--r-- 1 dongjoon dongjoon 448M Mar 22 17:11 orc.zstd
   -rw-r--r-- 1 dongjoon dongjoon 3.5M Mar 22 17:11 .orc.zstd.crc
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to