For those people using LZO compression: While I know there is
http://github.com/kevinweil/hadoop-lzo The native stuff makes it a bit of a hurdle. Especially if you are just running on Amazon Elastic Map Reduce it's way easier to just run this java-only indexer instead. http://github.com/tcurdt/lzo-index It's not much tested yet and I am sure it will still need some work ...but I thought I just announce it and maybe get some more testers and maybe some feedback. cheers -- Torsten
