Is there an example of LZO and is that what I want customers to deliver as, correct? As when LZO comes in and is layed over hadoop, it is split into chunks on different nodes so I can decompress in parallel?
We have a customer who sends 10 million line file. I see some other stats from production on a higher end system that is taking 5 hours to decompress so naturally I would like to make this faster. Thanks, Dean This message and any attachments are intended only for the use of the addressee and may contain information that is privileged and confidential. If the reader of the message is not the intended recipient or an authorized representative of the intended recipient, you are hereby notified that any dissemination of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by e-mail and delete the message and any attachments from your system.
