Re: Requesting to add into a Contributor Group

2013-05-05 Thread Robert Muir
done. let us know if you have any problems. On Sat, May 4, 2013 at 10:12 AM, Krunal jariwalakru...@gmail.com wrote: Dear Sir, Kindly add me to the contributor group to help me contribute to the Solr wiki. My Email id: jariwalakru...@gmail.com Login Name: Krunal Specific changes I would

Re: Why is SolrCloud doing a full copy of the index?

2013-05-05 Thread Erick Erickson
Second the thanks Erick On Sat, May 4, 2013 at 6:08 PM, Lance Norskog goks...@gmail.com wrote: Great! Thank you very much Shawn. On 05/04/2013 10:55 AM, Shawn Heisey wrote: On 5/4/2013 11:45 AM, Shawn Heisey wrote: Advance warning: this is a long reply. I have condensed some

Re: How to get solr synonyms in result set.

2013-05-05 Thread Erick Erickson
Sure, you can specify a separate synonyms list at query time, just define an index and query time analysis chain one without the synonym filter factory and one without. Be aware that index-time and query-time have some different characteristics, especially around multi-word synonyms see:

Re: Why is SolrCloud doing a full copy of the index?

2013-05-05 Thread Kumar Limbu
Thanks for the replies. It is really appreciated. Based on the replies it seems like upgrading to the latest version of Solr is something that will probably resolve this issue. We are also update quite frequently. We update every 5 minutes. We will try and set this to higher interval and see if

Re: custom tokenizer error

2013-05-05 Thread Jack Krupansky
I didn't notice any call to the reset method for your base tokenizer. Is there any reason that you didn't just use char filters to replace colon and periods with spaces? -- Jack Krupansky -Original Message- From: Sarita Nair Sent: Friday, May 03, 2013 2:43 PM To:

zookeeper errors

2013-05-05 Thread Michael Della Bitta
Hi, I'm running a Solr 4.2.1 cloud with an external three-node Zookeeper 2.4.5 setup. I'm seeing a lot of these errors in the zookeeper logs: 2013-05-05 15:06:22,863 - WARN [NIOServerCxn.Factory: 0.0.0.0/0.0.0.0:2181:NIOServerCnxn@349] - caught end of stream exception Also some of these:

Re: disaster recovery scenarios for solr cloud and zookeeper

2013-05-05 Thread Jack Krupansky
From the wiki: SolrCloud can continue to serve results without interruption as long as at least one server hosts every shard. You can demonstrate this by judiciously shutting down various instances and looking for results. If you have killed all of the servers for a particular shard, requests

Re: disaster recovery scenarios for solr cloud and zookeeper

2013-05-05 Thread Mark Miller
When Solr loses it's connection to ZooKeeper, updates will start being rejected. Read requests will continue as normal. This is regardless of how long ZooKeeper is down. So it's pretty simple - when you lost the ability to talk to ZK, everything keeps working based on the most recent

Re: zookeeper errors

2013-05-05 Thread Mark Miller
It sounds like you probably need to raise the default 15 sec zk client timeout. We have it default to a fairly aggressive setting. For high load envs, you may have to bring it to 30 or 45 seconds. Also, be sure you are not using a stop the world gc collector - CMS (the concurrent low pause

Re: zookeeper errors

2013-05-05 Thread Michael Della Bitta
Mark, I'm definitely using CMS, so I'll look into the zk client timeout. Thanks! Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Sun, May 5, 2013 at 2:21

Re: disaster recovery scenarios for solr cloud and zookeeper

2013-05-05 Thread Jack Krupansky
Is soul retrieval possible when ZooKeeper is down? -- Jack Krupansky -Original Message- From: Mark Miller Sent: Sunday, May 05, 2013 2:19 PM To: solr-user@lucene.apache.org Subject: Re: disaster recovery scenarios for solr cloud and zookeeper When Solr loses it's connection to

Re: Why is SolrCloud doing a full copy of the index?

2013-05-05 Thread Kristopher Kane
Advance warning: this is a long reply. Awesome Shawn. Thanks!

iterate through each document in Solr

2013-05-05 Thread Mingfeng Yang
Dear Solr Users, Does anyone know what is the best way to iterate through each document in a Solr index with billion entries? I tried to use select?q=*:*start=xxrows=500 to get 500 docs each time and then change start value, but it got very slow after getting through about 10 million docs.

Re: iterate through each document in Solr

2013-05-05 Thread Michael Sokolov
On 5/5/13 7:48 PM, Mingfeng Yang wrote: Dear Solr Users, Does anyone know what is the best way to iterate through each document in a Solr index with billion entries? I tried to use select?q=*:*start=xxrows=500 to get 500 docs each time and then change start value, but it got very slow after

Re: How to get solr synonyms in result set.

2013-05-05 Thread varun srivastava
Hi Suneel, After discovering that only query time synonym work with solr I found a good article on pros and cons of query and index time synonyms . It may help you http://nolanlawson.com/2012/10/31/better-synonym-handling-in-solr/ Regards Varun On Sun, May 5, 2013 at 9:20 AM, Erick Erickson

Re: How to get solr synonyms in result set.

2013-05-05 Thread Upayavira
There is no way to identify *which* synonyms triggered in your search output. You could implement a synonyms search component, that looks in the stored values of configured fields for synonyms, and adds another block of XML to the output. This old be useful component. Upayavira On Mon, May 6,