Re: SOLR Data Locality

2017-03-20 Thread Shawn Heisey
On 3/17/2017 11:14 AM, Imad Qureshi wrote: > I understand that but unfortunately that's not an option right now. We > already have 16 TB of index in HDFS. > > So let me rephrase this question. How important is data locality for SOLR. Is > performance impacted if SOLR data is on a remote node? W

Re: SOLR Data Locality

2017-03-17 Thread Toke Eskildsen
Imad Qureshi wrote: > I understand that but unfortunately that's not an option right now. > We already have 16 TB of index in HDFS. > > So let me rephrase this question. How important is data locality for > SOLR. Is performance impacted if SOLR data is on a remote node? The short answer is yes,

Re: SOLR Data Locality

2017-03-17 Thread Imad Qureshi
Hi Mike I understand that but unfortunately that's not an option right now. We already have 16 TB of index in HDFS. So let me rephrase this question. How important is data locality for SOLR. Is performance impacted if SOLR data is on a remote node? Thanks Imad > On Mar 17, 2017, at 12:02 PM,

Re: SOLR Data Locality

2017-03-17 Thread Mike Thomsen
I've only ever used the HDFS support with Cloudera's build, but my experience turned me off to use HDFS. I'd much rather use the native file system over HDFS. On Tue, Mar 14, 2017 at 10:19 AM, Muhammad Imad Qureshi < imadgr...@yahoo.com.invalid> wrote: > We have a 30 node Hadoop cluster and each