Hi all,

We are conducting a project involving replacing (Linux) system's
libz.so with our own hardware based implementation, but this requires us to replace libzip.so with our own so that small zip processing doesn't go through hardware, as hardware actually cannot process these requests correctly due to structural differences between hardware and software implementations of the deflate algorithm. Anyhow these changes work for other format files when MapRed is used where compression operations of user data do go through hardware library, but not ORC files. We found out that ORC files actually go through libzip.so instead of libz.so. So my questions are, in order to make ORC compression/decompression processing goes to libz.so ( hardware ) instead of libzip.so ( software ):

1. are there any places in Hadoop/Hive configuration files we can change to make this happen?

2. if not, what should be changed in libzip.so so that requests for ORC format can be forwarded to libz.so instead? An equivalent question is: in linzip.so, is there a way to detect incoming data format? If so, which sources in jdk should be looked at?

Many thanks in advance for any help.


--
======Not=Sent=From=Any=Phone=or=Pad=But=Stored=On=Hydra=======
Lihao Xu                                email: li...@ieee.org

Reply via email to