On 07/23/15 22:04, Jason Altekruse wrote:
> I'm very glad to hear that it exceeded your expectations. An important
> point I would like to add, when you unzipped the file you likely allowed
> drill to ready not only on both nodes, but also on multiple threads on each
> node. When the file was compressed, only a single thread was reading and
> processing it.


Also bzip2 does not work out of the box in drill. Parallelization seems
not possible

So, when it comes to the need of compression it seems parquet is needed
or there are further tests made howto calculate an query plan for a
compressed file. (if this is even possible at all)

Anyway, thanks for the help, using uncompressed csv did the trick for my
first problem anyway

Best Regards

Jürgen

Reply via email to