Any experience of running drill on GlusterFS or similar storage systems?
How much performance loss would incur because of unavailability of data
locality?

Regards,
Rahul

On Wed, May 10, 2017 at 11:11 AM, Abhishek Girish <[email protected]>
wrote:

> Do you wish to use Drill in distributed mode with each node having it's own
> local file system or do you plan to use it with a different data source
> which is also a distributed file system (but not HDFS / MapR-FS)?
>
> If the former, yes you should be able to form a Drill cluster by bringing
> up Drillbits in standalone mode on multiple disjoint nodes. You will still
> need ZooKeeper for cluster coordination. But understand that since each
> node can only talk to files on it's local file system, the Drill cluster
> will not have a unified view and access of the files for distributed
> processing. Your queries may fail, as a Drillbit might fail to access data.
> To experiment, you can make sure the directories and files you need to
> query are identical on each node. However, this is untested and I'm not
> sure if it will indeed work.
>
> If it's the latter, can you share what data source you have in mind?
>
> On Mon, May 8, 2017 at 11:41 AM, Matt <[email protected]> wrote:
>
> > I have seen some posts in the past about Drill nodes mounted "close to
> the
> > data", and am wondering if its possible to use Drill as a cluster without
> > HDFS?
> >
> > Using ZK would not be an issue in itself, and there are apparently
> options
> > like https://github.com/mhausenblas/dromedar
> >
> > Any experiences with this?
> >
>

-- 
**** This email and any files transmitted with it are confidential and 
intended solely for the use of the individual or entity to whom it is 
addressed. If you are not the named addressee then you should not 
disseminate, distribute or copy this e-mail. Please notify the sender 
immediately and delete this e-mail from your system.****

Reply via email to