How does ReplicationHandler backup work?

2009-08-28 Thread vivek sar
Hi, As one of our requirement we need to backup Master indexes to Slave periodically. I've been able to successfully sync the index using fetchIndex command,

Re: Adding cores dynamically

2009-08-25 Thread vivek sar
There were two main reasons we went with multi-core solution, 1) We found the indexing speed starts dipping once the index grow to a certain size - in our case around 50G. We don't optimize, but we have to maintain a consistent index speed. The only way we could do that was keep creating new

Re: Replication over multi-core solr

2009-08-19 Thread vivek sar
Licinio, Please open a separate thread - as it's a different issue - and I can respond there. -vivek 2009/8/19 Licinio Fernández Maurelo licinio.fernan...@gmail.com: Hi Vivek, currently we want to add cores dynamically when the active one reaches some capacity, can you give me some hints

Re: Adding cores dynamically

2009-08-19 Thread vivek sar
Lici, We're doing similar thing with multi-core - when a core reaches capacity (in our case 200 million records) we start a new core. We are doing this via web service call (Create web service), http://wiki.apache.org/solr/CoreAdmin This is all done in java code - before writing we check

Replication over multi-core solr

2009-08-18 Thread vivek sar
Hi, We use multi-core setup for Solr, where new cores are added dynamically to solr.xml. Only one core is active at a time. My question is how can the replication be done for multi-core - so every core is replicated on the slave? I went over the wiki,

Re: Boosting for most recent documents

2009-08-03 Thread vivek sar
Hi, Related question to getting the latest records first. After trying few suggested ways (function query, index time boosting) of getting the latest first I settled for simple sort parameter, sort=field+asc As per wiki, http://wiki.apache.org/solr/SchemaDesign?highlight=(sort), Lucene

Re: Boosting for most recent documents

2009-07-15 Thread vivek sar
: http://wiki.apache.org/solr/SolrRelevancyFAQ#head-b1b1cdedcb9cd9bfd9c994709b4d7e540359b1fd Bill On Thu, Jul 9, 2009 at 5:58 PM, vivek sar vivex...@gmail.com wrote: How do we sort by internal doc id (say on one index only) using Solr? I saw couple of threads saying it (Sort.INDEXORDER

Re: Boosting for most recent documents

2009-07-10 Thread vivek sar
/SolrRelevancyFAQ#head-b1b1cdedcb9cd9bfd9c994709b4d7e540359b1fd Bill On Thu, Jul 9, 2009 at 5:58 PM, vivek sar vivex...@gmail.com wrote: How do we sort by internal doc id (say on one index only) using Solr? I saw couple of threads saying it (Sort.INDEXORDER) was not supported in Solr, http

Re: Boosting for most recent documents

2009-07-09 Thread vivek sar
: vivek sar vivex...@gmail.com To: solr-user solr-user@lucene.apache.org Sent: Wednesday, July 8, 2009 8:34:16 PM Subject: Boosting for most recent documents Hi,   I'm trying to find a way to get the most recent entry for the searched word. For ex., if I have a document with field name user

Re: Boosting for most recent documents

2009-07-09 Thread vivek sar
with the timestamp approach. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Thursday, July 9, 2009 1:13:54 PM Subject: Re: Boosting for most recent documents Thanks Otis. I

Boosting for most recent documents

2009-07-08 Thread vivek sar
Hi, I'm trying to find a way to get the most recent entry for the searched word. For ex., if I have a document with field name user. If I search for user:vivek, I want to get the document that was indexed most recently. Two ways I could think of, 1) Sort by some time stamp field - but with

Re: Servlet filter for Solr

2009-06-10 Thread vivek sar
I've tried both url-pattern (/*) and servlet-name in the filter mapping , but none of it seem to intercept the call. If I put (/*) only up to /solr gets intercepted. Since, I'm using multicore - calls like /solr/core0 don't get intercepted. I want both select and update to be monitored. Any ideas?

Servlet filter for Solr

2009-06-09 Thread vivek sar
Hi, I've to intercept every request to solr (search and update) and log some performance numbers. In order to do so I tried a Servlet filter and added this to Solr's web.xml, filter filter-nameIndexFilter/filter-name

Re: Defining DataDir in Multi-Core

2009-05-19 Thread vivek sar
(in solrconfig.xml file) and create the core via REST call. It should work!!! Thanks regards Prabhu.K vivek sar wrote: Hi,   I tried the latest nightly build (04-01-09) - it takes the dataDir property now, but it's creating the Data dir at the wrong location. For ex., I've the following

Re: Defining DataDir in Multi-Core

2009-05-19 Thread vivek sar
specific examples that shows the way you used the create statement to register new cores on the fly. Thank you . --KK On Tue, May 19, 2009 at 1:17 PM, vivek sar vivex...@gmail.com wrote: Yeah, it was sometime back - it did work. Thanks for following up. On Tue, May 19, 2009 at 12:34 AM

Re: Solr memory requirements?

2009-05-15 Thread vivek sar
if that can cause any problem. I do use range queries for dates - would that have any effect? Any other ideas? Thanks, -vivek On Thu, May 14, 2009 at 8:38 PM, vivek sar vivex...@gmail.com wrote: Thanks Mark. I checked all the items you mentioned, 1) I've omitnorms=true for all my indexed fields

Re: Solr memory requirements?

2009-05-14 Thread vivek sar
and you said this is a large index?  That doesn't smell right... Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Wednesday, May 13, 2009 5:12:00 PM Subject: Re: Solr

Re: Solr memory requirements?

2009-05-14 Thread vivek sar
Erick On Wed, May 13, 2009 at 4:42 PM, vivek sar vivex...@gmail.com wrote: Thanks Otis. Our use case doesn't require any sorting or faceting. I'm wondering if I've configured anything wrong. I got total of 25 fields (15 are indexed and stored, other 10 are just stored). All my fields are basic

Re: Solr memory requirements?

2009-05-14 Thread vivek sar
be holding on and how can we change that behavior? Thanks, -vivek On Thu, May 14, 2009 at 11:33 AM, vivek sar vivex...@gmail.com wrote: I don't know if field type has any impact on the memory usage - does it? Our use cases require complete matches, thus there is no need of any analysis in most

Re: Solr memory requirements?

2009-05-14 Thread vivek sar
interval and raise it. Drop on deck searchers setting. Even then, 800 million...time to distribute I'd think. vivek sar wrote: Some update on this issue, 1) I attached jconsole to my app and monitored the memory usage. During indexing the memory usage goes up and down, which I think

Re: Commits taking too long

2009-05-13 Thread vivek sar
while the commit is happening. We are using Solr 1.4 (nightly build from 3/29/09). Thanks, -vivek On Wed, Apr 15, 2009 at 11:41 AM, Mark Miller markrmil...@gmail.com wrote: vivek sar wrote: Hi,  I've index where I commit every 50K records (using Solrj). Usually this commit takes 20sec

Solr memory requirements?

2009-05-13 Thread vivek sar
Hi, I'm pretty sure this has been asked before, but I couldn't find a complete answer in the forum archive. Here are my questions, 1) When solr starts up what does it loads up in the memory? Let's say I've 4 cores with each core 50G in size. When Solr comes up how much of it would be loaded in

Re: Solr memory requirements?

2009-05-13 Thread vivek sar
- Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Wednesday, May 13, 2009 3:04:46 PM Subject: Solr memory requirements? Hi,   I'm pretty sure this has been asked before, but I couldn't find a complete answer in the forum archive. Here are my

Re: Solr memory requirements?

2009-05-13 Thread vivek sar
, not a characteristic of a field. :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Wednesday, May 13, 2009 4:42:16 PM Subject: Re: Solr memory requirements? Thanks Otis

Re: Solr memory requirements?

2009-05-13 Thread vivek sar
the trick - at least the heap size is not growing as soon as Solr starts up. I ran some searches and they all came out fine. Index rate is also pretty good. Would there be any impact of disabling these listeners? Thanks, -vivek On Wed, May 13, 2009 at 2:12 PM, vivek sar vivex...@gmail.com wrote: Otis

Re: Solr memory requirements?

2009-05-13 Thread vivek sar
and have specified 8G heap size. Any other suggestion on what can I do to control the Solr memory consumption? Thanks, -vivek On Wed, May 13, 2009 at 2:53 PM, vivek sar vivex...@gmail.com wrote: Just an update on the memory issue - might be useful for others. I read the following,  http

Re: Solr memory requirements?

2009-05-13 Thread vivek sar
, Jack Godwin god...@gmail.com wrote: Have you checked the maxBufferedDocs?  I had to drop mine down to 1000 with 3 million docs. Jack On Wed, May 13, 2009 at 6:53 PM, vivek sar vivex...@gmail.com wrote: Disabling first/new searchers did help for the initial load time, but after 10-15 min

Re: Control segment size

2009-05-11 Thread vivek sar
which is not limited by any property - is that true? Is there any work around to limit the index size, beside limiting the index itself? Thanks, -vivek On Fri, May 8, 2009 at 10:02 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Fri, May 8, 2009 at 1:30 AM, vivek sar vivex...@gmail.com

Re: Control segment size

2009-05-07 Thread vivek sar
...@yahoo.com wrote: Hi, You are looking for maxMergeDocs, I believe. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Thursday, April 23, 2009 1:08:20 PM Subject: Control

Delete complete core without stopping Solr

2009-05-06 Thread vivek sar
Hi, I'm using multi-core feature of Solr. Each Solr instance maintains multiple-core - each core of size 100G. I would like to delete older cores directory completely after 2 weeks (using file.delete). Currently, Solr loads all the cores that are listed in solr.xml. I was thinking of following,

Using UUID for unique key

2009-05-05 Thread vivek sar
Hi, I've a distributed Solr instances. I'm using Java's UUID (UUID.randomUUID()) to generate the unique id for my documents. Before adding unique key I was able to commit 50K records in 15sec (pretty constant over the growing index), after adding unique key it's taking over 35 sec for 50k and

Re: Using UUID for unique key

2009-05-05 Thread vivek sar
://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Tuesday, May 5, 2009 1:49:21 PM Subject: Using UUID for unique key Hi, I've a distributed Solr instances. I'm using Java's UUID (UUID.randomUUID

Multiple Solr-instance share same solr.home

2009-04-19 Thread vivek sar
Hi, Is it possible to have two solr instances share the same solr.home? I've two Solr instances running on the same box and I was wondering if I can configure them to have the same solr.home. I tried it, but looks like the second instance overwrites the first one's value in the solr.xml (I'm

Re: Multiple Solr-instance share same solr.home

2009-04-19 Thread vivek sar
trouble - only 1 writer can write to a specific index at a time. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Sunday, April 19, 2009 4:33:00 AM Subject: Multiple Solr

Re: Using CSV for indexing ... Remote Streaming disabled

2009-04-16 Thread vivek sar
Any help on this? Could this error be because of something else (not remote streaming issue)? Thanks. On Wed, Apr 15, 2009 at 10:04 AM, vivek sar vivex...@gmail.com wrote: Hi,  I'm trying using CSV (Solr 1.4, 03/29) for indexing following wiki (http://wiki.apache.org/solr/UpdateCSV). I've

Re: Solr Search Error

2009-04-16 Thread vivek sar
Hi, I'm using the Solr 1.4 (03/29 nightly build) and when searching on a large index (40G) I get the same exception as in this thread, HTTP Status 500 - 13724 java.lang.ArrayIndexOutOfBoundsException: 13724 at org.apache.lucene.search.TermScorer.score(TermScorer.java:74) at

Using CSV for indexing ... Remote Streaming disabled

2009-04-15 Thread vivek sar
Hi, I'm trying using CSV (Solr 1.4, 03/29) for indexing following wiki (http://wiki.apache.org/solr/UpdateCSV). I've updated the solrconfig.xml to have this lines, requestDispatcher handleSelect=true requestParsers enableRemoteStreaming=true multipartUploadLimitInKB=20480 /

Commits taking too long

2009-04-15 Thread vivek sar
Hi, I've index where I commit every 50K records (using Solrj). Usually this commit takes 20sec to complete, but every now and then the commit takes way too long - from 10 min to 30 min. I see more delays as the index size continues to grow - once it gets over 5G I start seeing long commit

Re: Question on StreamingUpdateSolrServer

2009-04-15 Thread vivek sar
.  Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Friday, April 10, 2009 5:59:37 PM Subject: Re: Question on StreamingUpdateSolrServer I also noticed that the Solr app has

Re: Question on StreamingUpdateSolrServer

2009-04-14 Thread vivek sar
The machine's ulimit is set to 9000 and the OS has upper limit of 12000 on files. What would explain this? Has anyone tried Solr with 25 cores on the same Solr instance? Thanks, -vivek 2009/4/13 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@gmail.com: On Tue, Apr 14, 2009 at 7:14 AM, vivek sar vivex

Re: Question on StreamingUpdateSolrServer

2009-04-13 Thread vivek sar
. getting a decent search perf w/o autowarming is not easy . autowarmCount is an attribute of a cache .see here http://wiki.apache.org/solr/SolrCaching On Mon, Apr 13, 2009 at 3:32 AM, vivek sar vivex...@gmail.com wrote: Thanks Shalin. I noticed couple more things. As I index around 100 million

Re: Question on StreamingUpdateSolrServer

2009-04-13 Thread vivek sar
, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Mon, Apr 13, 2009 at 12:36 PM, vivek sar vivex...@gmail.com wrote: I index in 10K batches and commit after 5 index cyles (after 50K). Is there any limitation that I can't search during commit or auto-warming? I got 8 CPU cores and only 2 were

Re: Question on StreamingUpdateSolrServer

2009-04-13 Thread vivek sar
help is very much appreciated. Thanks, -vivek On Mon, Apr 13, 2009 at 10:52 AM, vivek sar vivex...@gmail.com wrote: Here is some more information about my setup, Solr - v1.4 (nightly build 03/29/09) Servlet Container - Tomcat 6.0.18 JVM - 1.6.0 (64 bit) OS -  Mac OS X Server 10.5.6

Re: Question on StreamingUpdateSolrServer

2009-04-12 Thread vivek sar
want to make sure that whenever it happens it is fast enough and returns result (instead of exception or a blank screen). Thanks for all the help. -vivek On Sat, Apr 11, 2009 at 1:48 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Sun, Apr 12, 2009 at 2:15 AM, vivek sar vivex

Re: Question on StreamingUpdateSolrServer

2009-04-11 Thread vivek sar
, Apr 11, 2009 at 3:29 AM, vivek sar vivex...@gmail.com wrote: I also noticed that the Solr app has over 6000 file handles open -    lsof | grep solr | wc -l   - shows 6455 I've 10 cores (using multi-core) managed by the same Solr instance. As soon as start up the Tomcat the open file count

Re: Question on Solr Distributed Search

2009-04-10 Thread vivek sar
, vivek sar vivex...@gmail.com wrote: Just an update. I changed the schema to store the unique id field, but I still get the connection reset exception. I did notice that if there is no data in the core then it returns the 0 result (no exception), but if there is data and you search using shards

Question on StreamingUpdateSolrServer

2009-04-10 Thread vivek sar
Hi, I was using CommonsHttpSolrServer for indexing, but having two threads writing (10K batches) at the same time was throwing, ProtocolException: Unbuffered entity enclosing request can not be repeated. I switched to StreamingUpdateSolrServer (using addBeans) and I don't see the problem

Re: Question on StreamingUpdateSolrServer

2009-04-10 Thread vivek sar
the segments from all the cores - is it because of auto-warmer? 2) How can I reduce the open file count? 3) Is there a way to stop the auto-warmer? 4) Could this be related to Tomcat returning blank page for every request? Any ideas? Thanks, -vivek On Fri, Apr 10, 2009 at 1:48 PM, vivek sar

Re: httpclient.ProtocolException using Solrj

2009-04-09 Thread vivek sar
the MultiThreadedHttpConnectionManager when creating the HttpClient instance? On Wed, Apr 8, 2009 at 10:13 PM, vivek sar vivex...@gmail.com wrote: single thread everything works fine. Two threads are fine too for a while and all the sudden problem starts happening. I tried indexing using REST services as well (instead

Re: Searching on mulit-core Solr

2009-04-09 Thread vivek sar
, vivek sar vivex...@gmail.com wrote: Any help on this issue? Would distributed search on multi-core on same Solr instance even work? Does it has to be different Solr instances altogether (separate shards)? I'm kind of stuck at this point right now. Keep getting one of the two errors (when

Re: Searching on mulit-core Solr

2009-04-09 Thread vivek sar
$Worker.run(JIoEndpoint.java:447) at java.lang.Thread.run(Thread.java:637) Any tips on how can I search on multicore on same solr instance? Thanks, -vivek On Thu, Apr 9, 2009 at 2:56 AM, Erik Hatcher e...@ehatchersolutions.com wrote: On Apr 9, 2009, at 3:00 AM, vivek sar wrote:  Can

Re: Searching on mulit-core Solr

2009-04-09 Thread vivek sar
else can I do this? Thanks, -vivek On Fri, Apr 3, 2009 at 1:02 PM, vivek sar vivex...@gmail.com wrote: Hi,  I've a multi-core system (one core per day), so there would be around 30 cores in a month on a box running one Solr instance. We have two boxes running the Solr instance and input data

Re: httpclient.ProtocolException using Solrj

2009-04-09 Thread vivek sar
to be running in the same jvm with Solr webapp? Thanks, -vivek 2009/4/9 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@gmail.com: how many documents are you inserting ? may be you can create multiple instances of CommonshttpSolrServer and upload in parallel On Thu, Apr 9, 2009 at 11:58 AM, vivek sar

Re: httpclient.ProtocolException using Solrj

2009-04-09 Thread vivek sar
#head-2046bbaba3759b6efd0e33e93f5502038c01ac65 I could index at the rate of 10,000 docs/sec using this and BinaryRequestWriter On Thu, Apr 9, 2009 at 10:36 PM, vivek sar vivex...@gmail.com wrote: I'm inserting 10K in a batch (using addBeans method). I read somewhere in the wiki that it's

Question on Solr Distributed Search

2009-04-09 Thread vivek sar
Hi, I've another thread on multi-core distributed search, but just wanted to put a simple question here on distributed search to get some response. I've a search query, http://etsx19.co.com:8080/solr/20090409_9/select?q=usa - returns with 10 result now if I add shards parameter to it,

Re: Question on Solr Distributed Search

2009-04-09 Thread vivek sar
) at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583) at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447) at java.lang.Thread.run(Thread.java:637) On Thu, Apr 9, 2009 at 5:01 PM, vivek sar vivex...@gmail.com wrote: Hi,  I've another

Re: Question on Solr Distributed Search

2009-04-09 Thread vivek sar
) at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:398) On Thu, Apr 9, 2009 at 6:51 PM, vivek sar vivex...@gmail.com wrote: I think the reason behind the connection reset is. Looking at the code it points to QueryComponent.mergeIds() resultIds.put

Re: httpclient.ProtocolException using Solrj

2009-04-08 Thread vivek sar
) at org.apache.solr.client.solrj.request.UpdateRequest.process(UpdateRequest.java:259) at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:48) at org.apache.solr.client.solrj.SolrServer.addBeans(SolrServer.java:57) Thanks, -vivek On Sat, Apr 4, 2009 at 1:07 AM, vivek sar vivex...@gmail.com wrote: Hi

Re: Searching on mulit-core Solr

2009-04-08 Thread vivek sar
,localhost:8085/solr/core1,10.4.x.x:8080/solr/core0,10.4.x.x:8085/solr/core0,10.4.x.x:8080/solr/core1,10.4.x.x:8085/solr/core1indent=trueq=vivek+japan I get 404 error. Is this the right URL construction for my setup? How else can I do this? Thanks, -vivek On Fri, Apr 3, 2009 at 1:02 PM, vivek sar vivex

Re: httpclient.ProtocolException using Solrj

2009-04-08 Thread vivek sar
thread? what is the version of SolrJ that you use? On Wed, Apr 8, 2009 at 1:19 PM, vivek sar vivex...@gmail.com wrote: Hi,  Any ideas on this issue? I ran into this again - once it starts happening it keeps happening. One of the thread keeps failing. Here are my SolrServer settings

Re: Searching on mulit-core Solr

2009-04-08 Thread vivek sar
work fine) as mentioned in this thread earlier. Thanks, -vivek On Wed, Apr 8, 2009 at 1:57 AM, vivek sar vivex...@gmail.com wrote: Thanks Fergus. I'm still having problem with multicore search. I tried the following with two cores (they both share the same schema and solrconfig.xml

Re: Searching on mulit-core Solr

2009-04-06 Thread vivek sar
:8085/solr/core1,10.4.x.x:8080/solr/core0,10.4.x.x:8085/solr/core0,10.4.x.x:8080/solr/core1,10.4.x.x:8085/solr/core1indent=trueq=vivek+japan I get 404 error. Is this the right URL construction for my setup? How else can I do this? Thanks, -vivek On Fri, Apr 3, 2009 at 1:02 PM, vivek sar vivex

httpclient.ProtocolException using Solrj

2009-04-04 Thread vivek sar
Hi, I'm sending 15K records at once using Solrj (server.addBeans(...)) and have two threads writing to same index. One thread goes fine, but the second thread always fails with, org.apache.solr.client.solrj.SolrServerException: org.apache.commons.httpclient.ProtocolException: Unbuffered entity

Searching on mulit-core Solr

2009-04-03 Thread vivek sar
Hi, I've a multi-core system (one core per day), so there would be around 30 cores in a month on a box running one Solr instance. We have two boxes running the Solr instance and input data is feeded to them in round-robin fashion. Each box can have up to 30 cores in a month. Here are questions,

Solr 1.4 (nightly build) seem hung under load

2009-04-03 Thread vivek sar
Hi, I'm using Solr 1.4 (nightly build - 03/29/09). I'm stress testing my application with Solr. My app uses Solrj to write to remote Solr (on same box, but different JVM). The stress test sends over 2 million records (1 record = 500 bytes, with each record having 10 fields) within 5 minutes.

Re: Solr 1.4 (nightly build) seem hung under load

2009-04-03 Thread vivek sar
not running any optimize command. What could cause Solr to hang for 80 min? Thanks, -vivek On Fri, Apr 3, 2009 at 1:55 PM, vivek sar vivex...@gmail.com wrote: Hi,  I'm using Solr 1.4 (nightly build - 03/29/09). I'm stress testing my application with Solr. My app uses Solrj to write to remote Solr

Re: Runtime exception when adding documents using solrj

2009-04-02 Thread vivek sar
system. 6) The use solrj to add beans to the recently created core On Wed, Apr 1, 2009 at 8:26 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Thu, Apr 2, 2009 at 2:34 AM, vivek sar vivex...@gmail.com wrote: Thanks Shalin. I added that in the solrconfig.xml, but now I get

Re: java.lang.ClassCastException: java.lang.Long using Solrj

2009-04-02 Thread vivek sar
://wiki.apache.org/solr/Solrj#head-12c26b2d7806432c88b26cf66e236e9bd6e91849 On Thu, Apr 2, 2009 at 4:21 AM, vivek sar vivex...@gmail.com wrote: Hi,  I'm using solrj (released v 1.3) to add my POJO objects (server.addbeans(...)), but I'm getting this exception, java.lang.ClassCastException

Re: Defining DataDir in Multi-Core

2009-04-01 Thread vivek sar
, you can pass the dataDir as an extra parameter? On Wed, Apr 1, 2009 at 7:41 AM, vivek sar vivex...@gmail.com wrote: Hi,  I'm trying to set up cores dynamically. I want to use the same schema.xml and solrconfig.xml for all the created cores, so plan to pass the same instance directory

Re: Merging Solr Indexes

2009-04-01 Thread vivek sar
approach below, but without the headache of managing multiple cores and index merging (not yet possible to do programatically). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org

Re: Defining DataDir in Multi-Core

2009-04-01 Thread vivek sar
Mangar shalinman...@gmail.com wrote: On Wed, Apr 1, 2009 at 1:48 PM, vivek sar vivex...@gmail.com wrote: I'm using the latest released one - Solr 1.3. The wiki says passing dataDir to CREATE action (web service) should work, but that doesn't seem to be working. That is a Solr 1.4 feature

Re: Defining DataDir in Multi-Core

2009-04-01 Thread vivek sar
, -vivek On Wed, Apr 1, 2009 at 9:45 AM, vivek sar vivex...@gmail.com wrote: Thanks Shalin. Is it available in the latest nightly build? Is there any other way I can create cores dynamically (using CREATE service) which will use the same schema.xml and solrconfig.xml, but write to different data

Re: Runtime exception when adding documents using solrj

2009-04-01 Thread vivek sar
Hi, I'm trying to add the list of POJO objects (using annotations) using solrj, but the server.addBeans(...) is throwing this exception, org.apache.solr.common.SolrException: Bad Request Bad Request request: http://localhost:8080/solr/core0/update?wt=javabinversion=2.2 Note, I'm using

Re: Runtime exception when adding documents using solrj

2009-04-01 Thread vivek sar
at 1:13 AM, vivek sar vivex...@gmail.com wrote: Hi,  I'm trying to add the list of POJO objects (using annotations) using solrj, but the server.addBeans(...) is throwing this exception, org.apache.solr.common.SolrException: Bad Request Bad Request request: http://localhost:8080/solr/core0

java.lang.ClassCastException: java.lang.Long using Solrj

2009-04-01 Thread vivek sar
Hi, I'm using solrj (released v 1.3) to add my POJO objects (server.addbeans(...)), but I'm getting this exception, java.lang.ClassCastException: java.lang.Long at org.apache.solr.common.util.NamedListCodec.unmarshal(NamedListCodec.java:89) at

Merging Solr Indexes

2009-03-31 Thread vivek sar
Hi, As part of speeding up the index process I'm thinking of spawning multiple threads which will write to different temporary SolrCores. Once the index process is done I want to merge all the indexes in temporary cores to a master core. For ex., if I want one SolrCore per day then every index

Defining DataDir in Multi-Core

2009-03-31 Thread vivek sar
Hi, I'm trying to set up cores dynamically. I want to use the same schema.xml and solrconfig.xml for all the created cores, so plan to pass the same instance directory, but different dir directory. Here is what I got in solr.xml by default (I didn't want define any core here, but looks like we

Re: OOM at MultiSegmentReader.norms

2009-03-30 Thread vivek sar
use of that nice ramBufferSizeMB setting. :) Also, use omitNorms=true for fields that don't need norms (if their types don't already do that). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr

Re: How to optimize Index Process?

2009-03-28 Thread vivek sar
Thanks Otis. This is very useful. I'll try all your suggestions and post my findings (and improvements). Thanks, -vivek On Fri, Mar 27, 2009 at 7:08 PM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi, Answers inlined. -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

How to optimize Index Process?

2009-03-27 Thread vivek sar
Hi, We have a distributed Solr system (2-3 boxes with each running 2 instances of Solr and each Solr instance can write to multiple cores). Our use case is high index volume - we can get up to 100 million records (1 record = 500 bytes) per day, but very low query traffic (only administrators

OOM at MultiSegmentReader.norms

2009-03-27 Thread vivek sar
Hi, I've index of size 50G (around 100 million documents) and growing - around 2000 records (1 rec = 500 byes) are being written every second continuously. If I make any search on this index I get OOM. I'm using default cache settings (512,512,256) in the solrconfig.xml. The search is using

Re: Partition index by time using Solr

2009-03-26 Thread vivek sar
up with a Solr Proxy component that abstract some/all of this and pretends to be Solr. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: vivek sar vivex...@gmail.com To: solr-user@lucene.apache.org Sent: Wednesday, March 25, 2009 3:52:11

Re: Partition index by time using Solr

2009-03-26 Thread vivek sar
Thanks again Otis. Few more questions, 1) My app currently is a stand-alone java app (not part of Solr JVM) that simply calls update webservice on Solr (running in a separate web container) passing 10k documents at once. In your example you mentioned getting list of Indexers and adding document

Partition index by time using Solr

2009-03-25 Thread vivek sar
Hi, I've used Lucene before, but new to Solr. I've gone through the mailing list, but unable to find any clear idea on how to partition Solr indexes. Here is what we want, 1) Be able to partition indexes by timestamp - basically partition per day (create a new index directory every day)