Dear team,
We are planning to use apache drill in our project to query the
parquet files resides in the file system/openstack swift which we would use
it in our web application for analytics purpose.
We need the below questions to be clarified to take further decision.
1.If we are having 1000 parquet files in a directory and we have our
required results in only 5 files. Does drill search the entire 1000 parquet
files metadata information or it will search only the associated 5 files?
2.Is it possible to install apache drill in cluster mode with out using
HDFS for scaling?
Thanks,
Basil