I have no such experience.
The performance loss could vary from minor to profound depending on your
query, network and disk setup.
On Tue, May 9, 2017 at 11:56 PM, Rahul Raj
wrote:
> Any experience of running drill on GlusterFS or similar storage systems?
>
Any experience of running drill on GlusterFS or similar storage systems?
How much performance loss would incur because of unavailability of data
locality?
Regards,
Rahul
On Wed, May 10, 2017 at 11:11 AM, Abhishek Girish
wrote:
> Do you wish to use Drill in distributed mode
Do you wish to use Drill in distributed mode with each node having it's own
local file system or do you plan to use it with a different data source
which is also a distributed file system (but not HDFS / MapR-FS)?
If the former, yes you should be able to form a Drill cluster by bringing
up
Using Drill against any kind of distributed data store is a fine thing. If
data locality matters, then it is nice if Drill can see what data is where.
Regardless, using Drill with out HDFS works great.
I should point out that using Drill with MapR is technically using it
without HDFS, but since
Hey Matt,
Yes we can use Drill in distribute mode or install on a cluster we did that
but for dev purpose in prod environment we had hadoop still you can do that
and steps are pretty much available in
https://drill.apache.org/docs/installing-drill-on-the-cluster/
Best Regards,
ANKIT BEOHAR
On