Hello, I m currently trying to use Drill to query MinIO (S3 API) but I am having a lot of problems related to the time it takes (I got a lot of timeouts). Both services (one instance each) are running in docker in my local computer.
The problem is that the first query takes like 40+ seconds and, after it has finished, it takes less than 1 second. I am querying a very small parquet file. As an instance, these are two queries that I executed. The first query planning took 27.08 seconds: 01/10/2020 13:42:04 anonymous SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 5.421 sec 0eff029cf8dc 01/10/2020 13:37:27 anonymous SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 33.508 sec 0eff029cf8dc This is not an isolated case. It happens everytime I try to use it. I run a new docker clean image each time. Also, if I try to execute the same query multiple times (because of timeout) I get the same problem till the first query (48.296s planning in this case) finishes. Some times I even get slow queries after thath (3+ seconds). 01/28/2020 10:33:59 anonymous <http://localhost:9000/profiles/21cff1e7-a8a8-6128-d329-ac369bd69c32> SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 3.494 sec 86acfa9818e1 01/28/2020 10:21:14 anonymous <http://localhost:9000/profiles/21cff4e5-fe98-2db8-c617-d15c96470235> SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 4.595 sec 86acfa9818e1 01/28/2020 10:20:33 anonymous <http://localhost:9000/profiles/21cff50d-ae9c-1629-f80f-db5c3a253762> SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 31.801 sec 86acfa9818e1 01/28/2020 10:20:16 anonymous <http://localhost:9000/profiles/21cff51e-e55f-456e-c399-51289fadb77a> SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 49.098 sec 86acfa9818e1 01/28/2020 10:20:03 anonymous <http://localhost:9000/profiles/21cff52e-2792-79e8-48b5-f258e6efb02b> SELECT N_NAME as COUNTRY FROM minio_jupyter.`nation.parquet` WHERE N_REGIONKEY = 2 Succeeded 01 min 2.494 sec 86acfa9818e1 Thank you for your help, Marc