On 07/23/15 22:04, Jason Altekruse wrote: > I'm very glad to hear that it exceeded your expectations. An important > point I would like to add, when you unzipped the file you likely allowed > drill to ready not only on both nodes, but also on multiple threads on each > node. When the file was compressed, only a single thread was reading and > processing it.
Also bzip2 does not work out of the box in drill. Parallelization seems not possible So, when it comes to the need of compression it seems parquet is needed or there are further tests made howto calculate an query plan for a compressed file. (if this is even possible at all) Anyway, thanks for the help, using uncompressed csv did the trick for my first problem anyway Best Regards Jürgen
