Re: Solr Deleted Docs Issue

2015-03-19 Thread vicky desai
Hi, Thanks erick and shawn for the reply. Just wanted to clarify that commit size of 10 was only an example and in production commit is handled via auto-commit feature of solr. The requirement we have is to store around 20-30 lakh docs out of which around 5-6 lakh docs get updated daily. What I

Solr Deleted Docs Issue

2015-03-16 Thread vicky desai
Hi, I am having an issue with my solr setup. In my solr config I have set following property *mergeFactor10/mergeFactor* Now consider following situation. I have* 200* documents in my index. I need to update all the 200 docs If total commit operations I hit are* 20* i.e I update batches of 10

Facing issue while implementing connection pooling with solr

2014-07-25 Thread vicky desai
0 down vote favorite I have this requirement where I want to limit the number of concurrent calls to solr say 50. So I am trying to implement connection pooling in HTTP client which is then used in solr object HttpSolrServer. Please find the code below HttpClient httpclient = new

Issues faced after docValues migration

2013-11-20 Thread vicky desai
Hi, I am using solr 4.3 version. I am planning to use the docValues feature introduced in solr 4.2. Although I see a significant improvement in facet and group query , there is a degrade in group.facet and group.ngroups query. Has anybody faced a similar issue? Any work arounds? -- View

Re: Solr cloud view shows core as down after using reload action

2013-11-18 Thread vicky desai
Hi Shalin, There is only one shard for this core. I even tried reloading it using the collection API still in cloud view it is shown as orange. However, as I said the core is working perfectly fine. So no exception in logs. It is just the cloud view that is bothering me -- View this message

Re: Solr cloud view shows core as down after using reload action

2013-11-18 Thread vicky desai
Hi Shalin, I am using solr 4.3. The cluster state is as follows {znode:{ path:/clusterstate.json,prop:{ version:186, aversion:0, children_count:0, ctime:Mon Nov 18 12:30:53 IST 2013 (1384758053678), cversion:0, czxid:39, dataLength:34472,

Re: Solr cloud view shows core as down after using reload action

2013-11-18 Thread vicky desai
Hi Shalin, Core ONL-3117132084 is the one which is reloaded -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-cloud-view-shows-core-as-down-after-using-reload-action-tp4101625p4101651.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr cloud view shows core as down after using reload action

2013-11-18 Thread vicky desai
Hi Shalin, Thanks a lot. Yes it is the same issue. Due to some restrictions I cannot migrate my product to solr 4.5 as of now. But I will test it offline and post the reply here. -- View this message in context:

Solr cloud view shows core as down after using reload action

2013-11-17 Thread vicky desai
Hi, I have a solr set up using external zookeeper . Whenever there are any schema changes to be made I make those changes and upload the new config via cloud-scripts. I then reload the core using the action http://localhost:8190/solr/admin/cores?action=RELOADcore=coreName Everything works fine

Group.ngroup query slower with docValues

2013-11-06 Thread vicky desai
Hi, I am planning to use the docValues feature of Solr. I have added docValues= true parameter to a few fields in my schema on which there is heavy faceting and grouping query involved. While I noticed a considerable improvement in faceting queries. I didnt get any improvements in grouping query

Re: Group.ngroup query slower with docValues

2013-11-06 Thread vicky desai
Hi Shawn, Thanks for the reply. But the issue you pointed out talk about general performance issue of ngroups. However what i noticed is that after using docValues the performance of group.ngroups had degraded about 2-3 times. This is stopping me from using docValues which otherwise in case of

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-20 Thread vicky desai
Hi All, There were two fixes for the issue I was facing 1. By changing the version in schema form* 1.1* to *1.5* OR 2. keeping the version to 1.1 and adding *autoGeneratePhraseQueries*=false to the field type However the issue is not completely resolved yet on searching for content:speedPost the

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-20 Thread vicky desai
Hi Erik, I was going to come to that. Now if I have the word *speedpost* in the index and if I dont use catenation at the query end then query for the word speedPost wont fetch me the results. It would then might make sense to remove the entire WDFF from query and search for a few possible

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-20 Thread vicky desai
Hi Jack, As mentioned earliear a part of the issue was resolved by the two fixes I mentioned above and for the query u mentioned I am getting the same result as yours. What is not working though is the query *q=content:speedPost* with the text enclosed in inverted commas -- View this message

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-20 Thread vicky desai
Hi Jack, Thanks for the expalnation -- View this message in context: http://lucene.472066.n3.nabble.com/struggling-with-solr-WordDelimiterFilterFactory-tp4085021p4085661.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-19 Thread vicky desai
Hi, I have created a new index. So reindexing shouldnt be the issue. Analysis page shows me correct result and match should be found as per the analysis page.But no output on actual query The Output of debug query is as follows str name=rawquerystringcontent:speedPost/str str

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-19 Thread vicky desai
Hi Erik, These are the request handlers defined in solrconfig.xml requestHandler name=/analysis/field class=solr.FieldAnalysisRequestHandler / requestHandler name=standard class=solr.StandardRequestHandler default=true / requestHandler name=/update

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-19 Thread vicky desai
Hi, Another observation while testing Docs having the value for content field as below 1. content:speedPost 2. content:sPeedpost 3. content:speEdpost 4. content:speedposT matches the query q=content:speedPost. So basically if in the entire word there is one 1 letter that is camel cased then it

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-19 Thread vicky desai
Hi Aloke, I have multiple fields in my schema which are of type text. i tried the same case on all the fields. Not working for me on any of them. If possible for u can u please post your dummy solrconfig.xml and schema.xml. I can replace them and check -- View this message in context:

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-19 Thread vicky desai
Hi Aloke, After taking the schema.xml and solrconfig.xml with the changes u mentioned it worked fine. However simply making this changes in schema.xml doesnt work. So seems like there is an issue in some configuration in solrconfig.xml. I will figure that out and post it here. Anyways thanks a

struggling with solr.WordDelimiterFilterFactory

2013-08-16 Thread vicky desai
Hi All, I have a query regarding the use of wordDelimiterFilterFactory. My schema definition for the text field is as follows fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-16 Thread vicky desai
Hi Aloke, I am using the same analyzer for indexing as well as quering so LowerCaseFilterFactory should work for both, right? -- View this message in context: http://lucene.472066.n3.nabble.com/struggling-with-solr-WordDelimiterFilterFactory-tp4085021p4085025.html Sent from the Solr - User

Re: struggling with solr.WordDelimiterFilterFactory

2013-08-16 Thread vicky desai
Hi, Another Example I found is q=Content:wi-fi doesn't match for documents with word wifi. I think it is not catenating the query keywords correctly -- View this message in context: http://lucene.472066.n3.nabble.com/struggling-with-solr-WordDelimiterFilterFactory-tp4085021p4085030.html Sent

Re: Querying a specific core in solr cloud

2013-08-04 Thread vicky desai
Hi Erik, I did check the logs and request is going to ip1 if core is not present on ip2. This should be a bug right? -- View this message in context: http://lucene.472066.n3.nabble.com/Querying-a-specific-core-in-solr-cloud-tp4079964p4082454.html Sent from the Solr - User mailing list

Exception on solr Unload

2013-08-04 Thread vicky desai
Hi All, I am getting an exception on unloading a core from solr. This happens only in the case where the core name and the collection name is same. I am getting the below mentioned exception on using solrj as well as solr admin UI. I have a configuration of 1 leader and 1 replica with the core

Re: Querying a specific core in solr cloud

2013-07-26 Thread vicky desai
Hi Erick, First Of all sorry for the late reply. The scenario is as follows 1. Create a solr set up on two machines say (ip1 and ip2) with shard=1 and external zoo-keeper 2. Now if i create a core x on machine with ip1 only and use the query http://ip1:port1/solr/x/select?q=*:*distrib=false

Re: Querying a specific core in solr cloud

2013-07-25 Thread vicky desai
Hi Erik, Thanks for the reply But does distrib=true work for replicas as well. As i mentioned earliear I have a set up of 1 leader and 1 replica. If a core is up on either of the instances querying to both the instances gives me results even with distrib=false -- View this message in

Re: Querying a specific core in solr cloud

2013-07-25 Thread vicky desai
Hi, I have also noticed that once I put the core up on both the machine distrib=false works well. could this be a possible bug that when a core is down on one instance distrib=false doesnt work -- View this message in context:

Querying a specific core in solr cloud

2013-07-24 Thread vicky desai
Hi, I had a requirement wherein I wanted to query a specific core on a specific solr instance . I found the following content in solr wiki * Explicitly specify the addresses of shards you want to query: http://localhost:7574/solr/collection1/select?shards=localhost:7574/solr/collection1/* Now

Re: Querying a specific core in solr cloud

2013-07-24 Thread vicky desai
I m not bothered about the leader. I just want to check if a particluar core is up on a particular solr instance. My Use case is as follows I have to create a core on one instance and then there is some DB code. If after creating the core the DB action fails then the entire task is repeated

Re: Is it possible to find a leader from a list of cores in solr via java code

2013-07-15 Thread vicky desai
Hi, I got the solution to the above problem . Sharing the code so that it could help people in future PoolingClientConnectionManager poolingClientConnectionManager = new PoolingClientConnectionManager(); poolingClientConnectionManager.setMaxTotal(2);

Re: Is it possible to find a leader from a list of cores in solr via java code

2013-07-12 Thread vicky desai
Hi, As per the suggestions above I shifted my focus to using CloudSolrServer. In terms of sending updates to the leaders and reducing network traffic it works great. But i faced one problem in using CloudSolrServer is that it opens too many connections as large as five thousand connections. My

Re: Is it possible to find a leader from a list of cores in solr via java code

2013-07-06 Thread vicky desai
Hi Erik, I just wanted to clarify if u got my concern right. If i send some documents to the replica core wont it first have to send the documents to the leader core which in turn would be sending it back to the replica cores. If yes then this will lead to additional network traffic which can be

Is it possible to find a leader from a list of cores in solr via java code

2013-07-03 Thread vicky desai
Hi , I have a set up of 1 leader and 1 replica and I have a requirement where in I need to find the leader core from the collection. Is there an api in solrj by means of which this can be achieved. -- View this message in context:

Re: Is it possible to find a leader from a list of cores in solr via java code

2013-07-03 Thread vicky desai
Hi, I have a requirement where in I want to write to the leader and read from the replica. Reason being If a write request is sent to the replica it relays it to the leader and then the leader relays it to all the replicas. This will help me in saving some network traffic as my application

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
Hi sandeep, I made the changes u mentioned and tested again for the same set of docs but unfortunately the commit time increased. -- View this message in context: http://lucene.472066.n3.nabble.com/commit-in-solr4-takes-a-longer-time-tp4060396p4060622.html Sent from the Solr - User mailing

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
Hi Gopal, I added the opensearcher parameter as mentioned by you but on checking logs I found that apensearcher was still true on commit. it is only when I removed the autosoftcommit parameter the opensearcher parameter worked and provided faster updates as well. however I require soft commit in

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
My solrconfig.xml is as follows ?xml version=1.0 encoding=UTF-8 ? config luceneMatchVersionLUCENE_40/luceneMatchVersion indexConfig maxFieldLength2147483647/maxFieldLength lockTypesimple/lockType

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
Hi All, setting opensearcher flag to true solution worked and it give me visible improvement in commit time. One thing to make note of is that while using solrj client we have to call server.commit(false,false) which i was doing incorrectly and hence was not able to see the improvement earliear.

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
Hi, After using the following config updateHandler class=solr.DirectUpdateHandler2 autoSoftCommit maxDocs500/maxDocs maxTime1000/maxTime /autoSoftCommit autoCommit

Re: commit in solr4 takes a longer time

2013-05-03 Thread vicky desai
Hi, When a auto commit operation is fired I am getting the following logs INFO: start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false} setting the openSearcher to false definetly gave me a lot of performance improvement but

commit in solr4 takes a longer time

2013-05-02 Thread vicky desai
Hi all, I have recently migrated from solr 3.6 to solr 4.0. The documents in my core are getting constantly updated and so I fire a code commit after every 10 thousand docs . However moving from 3.6 to 4.0 I have noticed that for the same core size it takes about twice the time to commit in

Re: commit in solr4 takes a longer time

2013-05-02 Thread vicky desai
Hi, I am using 1 shard and two replicas. Document size is around 6 lakhs My solrconfig.xml is as follows ?xml version=1.0 encoding=UTF-8 ? config luceneMatchVersionLUCENE_40/luceneMatchVersion indexConfig maxFieldLength2147483647/maxFieldLength

Maximum number of facet query ina single query

2013-04-30 Thread vicky desai
Hi, Is there any upper limit on the number of facet queries I can include in a single query. Also is there any performance hit if I include too many facet queries in a single query Any help would be appreciated -- View this message in context:

Re: is phrase search possible in solr

2013-04-21 Thread vicky desai
Hi, Agreed it is a typo. And yes I can use one set of analyzers and tokenizers for query as well as indexing but that too will not solve my problem -- View this message in context: http://lucene.472066.n3.nabble.com/is-phrase-search-possible-in-solr-tp4057312p4057802.html Sent from the Solr -

Re: is phrase search possible in solr

2013-04-21 Thread vicky desai
Hi Jack, Making a changes in the schema either keyword tokenizer or copy field option which u suggested would require reindexing of entire data. Is there an option wherein if I have a query in double quotes it simply ignores all the tokenizers and analyzers. -- View this message in context:

Re: is phrase search possible in solr

2013-04-21 Thread vicky desai
Hi, If I use shinglingFilter than all type of queries will be impacted. I want queries within double quotes to be an exact search but for queries without double quotes all analyzers and tokenizers should be applied. Is there a setting or a configuration in schema.xml which can cater this

is phrase search possible in solr

2013-04-19 Thread vicky desai
I want to do a phrase search in solr without analyzers being applied to it eg - If I search for *DelhiDareDevil* (i.e - with inverted commas)it should search the exact text and not apply any analyzers or tokenizers on this field However if i search for *DelhiDareDevil* it should use tokenizers

Re: updateLog in Solr 4.2

2013-04-17 Thread vicky desai
If updateLog tag is manadatory than why is it given as a parameter in solrconfig.xml . I mean by default it should be always writing update logs in my data directory even if I dont use updateLog parameter in config file. Also the same config file works for solr 4.0 but not solr 4.2 I will be

updateLog in Solr 4.2

2013-04-12 Thread vicky desai
If i disable update log in solr 4.2 then i get the following exception SEVERE: :java.lang.NullPointerException at org.apache.solr.cloud.ShardLeaderElectionContext.runLeaderProcess(ElectionContext.java:190) at

Solr Replication

2013-03-14 Thread vicky desai
Hi, I am using solr 4 setup. For the backup purpose once in a day I start one additional tomcat server with cores having empty data folders and which acts as a slave server. However it does not replicate data from the master unless there is a commit on the master. Is there a possibility to pull

Re: Solr Replication

2013-03-14 Thread vicky desai
Hi, I have a multi core setup and there is continuous updation going on in each core. Hence I dont prefer a bckup as it would either cause a downtime or if during a backup there is a write activity my backup will be corrupted. Can you please suggest if there is a cleaner way to handle this --