How to make ORC use libz.so instead of libzip.so

Lihao Xu Thu, 07 Feb 2019 07:19:42 -0800



Hi all,

We are conducting a project involving replacing (Linux) system's

libz.so with our own hardware based implementation, but this requires usto replace libzip.so with our own so that small zip processing doesn't gothrough hardware, as hardware actually cannot process these requestscorrectly due to structural differences between hardware and softwareimplementations of the deflate algorithm. Anyhow these changes work forother format files when MapRed is used where compression operations ofuser data do go through hardware library, but not ORC files. We found outthat ORC files actually go through libzip.so instead of libz.so. So myquestions are, in order to make ORC compression/decompression processinggoes to libz.so ( hardware ) instead of libzip.so ( software ):

1. are there any places in Hadoop/Hive configuration files we can changeto make this happen?

2. if not, what should be changed in libzip.so so that requests for ORCformat can be forwarded to libz.so instead? An equivalent question is: inlinzip.so, is there a way to detect incoming data format? If so, whichsources in jdk should be looked at?


Many thanks in advance for any help.


--
======Not=Sent=From=Any=Phone=or=Pad=But=Stored=On=Hydra=======
Lihao Xu                                email: li...@ieee.org

How to make ORC use libz.so instead of libzip.so

Reply via email to