RE: Solr with Hadoop

2013-07-18 Thread Saikat Kanjilal
I'm familiar with and have used both the DSE cluster as well as am in the process of evaluating cloudera search, in general cloudera search has tight integration with hdfs and takes care of replication and sharding transparently by using the pre-existing hdfs replication and sharding, however

Re: preferred container for running SolrCloud

2013-07-11 Thread Saikat Kanjilal
We're running under jetty. Sent from my iPhone On Jul 11, 2013, at 6:06 PM, Ali, Saqib docbook@gmail.com wrote: 1) Jboss 2) Jetty 3) Tomcat 4) Other.. ?

RE: preferred container for running SolrCloud

2013-07-11 Thread Saikat Kanjilal
? On Thu, Jul 11, 2013 at 7:01 PM, Saikat Kanjilal sxk1...@hotmail.comwrote: We're running under jetty. Sent from my iPhone On Jul 11, 2013, at 6:06 PM, Ali, Saqib docbook@gmail.com wrote: 1) Jboss 2) Jetty 3) Tomcat 4) Other.. ?

RE: preferred container for running SolrCloud

2013-07-11 Thread Saikat Kanjilal
, 2013 at 7:01 PM, Saikat Kanjilal sxk1...@hotmail.comwrote: We're running under jetty. Sent from my iPhone On Jul 11, 2013, at 6:06 PM, Ali, Saqib docbook@gmail.com wrote: 1) Jboss 2) Jetty 3) Tomcat 4) Other.. ?

RE: Content based recommender using lucene/solr

2013-06-28 Thread Saikat Kanjilal
Why not just use mahout to do this, there is an item similarity algorithm in mahout that does exactly this :) https://builds.apache.org/job/Mahout-Quality/javadoc/org/apache/mahout/cf/taste/hadoop/similarity/item/ItemSimilarityJob.html You can use mahout in distributed and non-distributed mode

RE: Content based recommender using lucene/solr

2013-06-28 Thread Saikat Kanjilal
, 2013 at 12:42 PM, Saikat Kanjilal sxk1...@hotmail.comwrote: Why not just use mahout to do this, there is an item similarity algorithm in mahout that does exactly this :) https://builds.apache.org/job/Mahout-Quality/javadoc/org/apache/mahout/cf/taste/hadoop/similarity/item

RE: Creating a new core programmicatically in solr

2013-06-04 Thread Saikat Kanjilal
I'm aware of the CoreAdminRequest API, however given the fact that our solr cluster machines have their own internal configurations I'd prefer to use the http approach rather then having to specify the instanceDir or the solrServer. One issue I was thinking of was the double quotes needed

RE: Creating a new core programmicatically in solr

2013-06-04 Thread Saikat Kanjilal
I need to simulate this curl command line with java code: curl http://10.42.6.74:8983/solr/admin/cores?action=CREATEname=NEW_SCHEMA.solr; Obviously doing a simple HttpGet with the appropriate query parameters is not the answer. I dont believe your example is not going to work because I am

RE: Creating a new core programmicatically in solr

2013-06-04 Thread Saikat Kanjilal
of the , which tells the shell to run the preceding command in the background. The quotes around the full URL escape the . -- Jack Krupansky -Original Message- From: Saikat Kanjilal Sent: Tuesday, June 04, 2013 12:56 PM To: solr-user@lucene.apache.org Subject: RE: Creating a new

RE: Keeping a rolling window of indexes around solr

2013-05-28 Thread Saikat Kanjilal
, Saikat Kanjilal sxk1...@hotmail.com wrote: Hello Solr community folks, I am doing some investigative work around how to roll and manage indexes inside our solr configuration, to date I've come up with an architecture that separates a set of masters that are focused on writes and get

Re: Keeping a rolling window of indexes around solr

2013-05-28 Thread Saikat Kanjilal
Volume of data: 1 log insert every 30 seconds, queries done sporadically asynchronously every so often at a much lower frequency every few days Also the majority of the requests are indeed going to be within a splice of time (typically hours or at most a few days) Type of queries: Keyword or

Keeping a rolling window of indexes around solr

2013-05-24 Thread Saikat Kanjilal
Hello Solr community folks, I am doing some investigative work around how to roll and manage indexes inside our solr configuration, to date I've come up with an architecture that separates a set of masters that are focused on writes and get replicated periodically and a set of slave shards

RE: Keeping a rolling window of indexes around solr

2013-05-24 Thread Saikat Kanjilal
window. Date: Fri, 24 May 2013 09:07:38 -0600 From: elyog...@elyograg.org To: solr-user@lucene.apache.org Subject: Re: Keeping a rolling window of indexes around solr On 5/24/2013 8:56 AM, Shawn Heisey wrote: On 5/24/2013 8:25 AM, Saikat Kanjilal wrote: Anyways would love to hear thoughts

Re: Keeping a rolling window of indexes around solr

2013-05-24 Thread Saikat Kanjilal
: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Fri, May 24, 2013 at 10:25 AM, Saikat Kanjilal sxk1...@hotmail.com wrote: Hello Solr community