I'm querying 20G of gzipped JSONs split in ~5600 small files with sizes ranging from 1M to 30Mb. Drill is running in aws in 4 m4.xlarge nodes and it's taking around 50 minutes before the query starts executing.

Any idea what could be causing this delay? What's the best way to debug this?

Thanks,

--
Oscar Morante
"Self-education is, I firmly believe, the only kind of education there is."
                                                         -- Isaac Asimov.

Attachment: signature.asc
Description: Digital signature

Reply via email to