Any experience of running drill on GlusterFS or similar storage systems? How much performance loss would incur because of unavailability of data locality?
Regards, Rahul On Wed, May 10, 2017 at 11:11 AM, Abhishek Girish <[email protected]> wrote: > Do you wish to use Drill in distributed mode with each node having it's own > local file system or do you plan to use it with a different data source > which is also a distributed file system (but not HDFS / MapR-FS)? > > If the former, yes you should be able to form a Drill cluster by bringing > up Drillbits in standalone mode on multiple disjoint nodes. You will still > need ZooKeeper for cluster coordination. But understand that since each > node can only talk to files on it's local file system, the Drill cluster > will not have a unified view and access of the files for distributed > processing. Your queries may fail, as a Drillbit might fail to access data. > To experiment, you can make sure the directories and files you need to > query are identical on each node. However, this is untested and I'm not > sure if it will indeed work. > > If it's the latter, can you share what data source you have in mind? > > On Mon, May 8, 2017 at 11:41 AM, Matt <[email protected]> wrote: > > > I have seen some posts in the past about Drill nodes mounted "close to > the > > data", and am wondering if its possible to use Drill as a cluster without > > HDFS? > > > > Using ZK would not be an issue in itself, and there are apparently > options > > like https://github.com/mhausenblas/dromedar > > > > Any experiences with this? > > > -- **** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom it is addressed. If you are not the named addressee then you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and delete this e-mail from your system.****
