Re: SolR InvalidTokenOffsetsException with Highlighter and Synonyms

2013-04-16 Thread juancesarvillalba
Hi, I am using the stander highlighting. http://wiki.apache.org/solr/HighlightingParameters Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/SolR-InvalidTokenOffsetsException-with-Highlighter-and-Synonyms-tp4053988p4056240.html Sent from the Solr - User mailing

Re: Usage of CloudSolrServer?

2013-04-16 Thread Furkan KAMACI
Hi Shawn; I am sorry but what kind of Load Balancing is that? I mean does it check whether some leaders are using much CPU or RAM etc.? I think a problem may occur at such kind of scenario: if some of leaders getting more documents than other leaders (I don't know how it is decided that into

Re: Empty Solr 4.2.1 can not create Collection

2013-04-16 Thread A.Eibner
Hi, sorry for pushing, but I just replayed the steps with solr 4.0 where everything works fine. Then I switched to solr 4.2.1 and replayed the exact same steps and the collection won't start and no leader will be elected. Any clues ? Should I try it on the developer mailing list, maybe it's a

Is cache useful for my scenario?

2013-04-16 Thread samabhiK
Hi, I am new in Solr and wish to use version 4.2.x for my app in production. I want to show hundreds and thousands of markers on a map with contents coming from Solr. As the user moves around the map and pans, the browser will fetch data/markers using a BBOX filter (based on the maps' viewport

Re: Usage of CloudSolrServer?

2013-04-16 Thread Upayavira
If you are accessing Solr from Java code, you will likely use the SolrJ client to do so. If your users are hitting Solr directly, you should think about whether this is wise - as well as providing them with direct search access, you are also providing them with the ability to delete your entire

first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
Hi, when we search with any new keyword at first time then solr 4.2.1 take to much time to give the result. we have 506 document is index in solr and it's size is 400GB. now when We search for keyword test it will take 1 min to give the response for 1 rows. we fire the query from the

SEVERE: shard update error StdNode on SolrCloud 4.2.1

2013-04-16 Thread Steve Woodcock
Hi We have a simple SolrCloud setup (4.2.1) running with a single shard and two nodes, and it's working fine except whenever we send an update request, the leader logs this error: SEVERE: shard update error StdNode: http://10.20.10.42:8080/solr/ts/:org.apache.solr.common.SolrException: Server at

Re: SolR InvalidTokenOffsetsException with Highlighter and Synonyms

2013-04-16 Thread Dmitry Kan
Could be a bug in the higlighter. But before claiming that, I would still play around different options, like hl.fragSize, hl.highlightMultiTerm. Also, have you considered storing synonyms in the index? On Tue, Apr 16, 2013 at 9:42 AM, juancesarvillalba juancesarvilla...@gmail.com wrote:

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Dmitry Kan
Hi, Things to google ;) 1. warmup queries 2. solr cache How much RAM does you index take now? Dmitry On Tue, Apr 16, 2013 at 1:22 PM, Montu v Boda montu.b...@highqsolutions.com wrote: Hi, when we search with any new keyword at first time then solr 4.2.1 take to much time to give the

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
Hi currently, my solr is deploy in tomcat1 and we have given 4GB memory of that tomcat Thanks Regards Montu v Boda -- View this message in context: http://lucene.472066.n3.nabble.com/first-time-with-new-keyword-solr-take-to-much-time-to-give-the-result-tp4056254p4056261.html Sent from the

Re: Storing Solr Index on NFS

2013-04-16 Thread Furkan KAMACI
Hi Walter; You said: It is not safe to share Solr index files between two Solr servers. Why do you think like that? 2013/4/16 Tim Vaillancourt t...@elementspace.com If centralization of storage is your goal by choosing NFS, iSCSI works reasonably well with SOLR indexes, although good

Re: Usage of CloudSolrServer?

2013-04-16 Thread Furkan KAMACI
Thanks for your detailed explanation. However you said: It will then choose one of those hosts/cores for each shard, and send a request to them as a distributed search request. Is there any document that explains of distributed search? What is the criteria for it? 2013/4/16 Upayavira

Re: Storing Solr Index on NFS

2013-04-16 Thread Yago Riveiro
Furkan, see this post. http://grokbase.com/t/lucene/solr-user/117t1eswyk/multiple-solr-servers-and-a-shared-index-again Cumprimentos -- Yago Riveiro Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Tuesday, April 16, 2013 at 12:15 PM, Furkan KAMACI wrote: Hi Walter; You said:

Solr 4.2.1 sorting by distance to polygon centre.

2013-04-16 Thread Guido Medina
Hi, I got everything in place, my polygons are indexing properly, I played a bit with LSP which helped me a lot, now, I have JTS 1.13 inside solr.war; here is my challenge: I have big polygon (A) which contains smaller polygons (B and C), B and C have some intersection, so if I search for a

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Toke Eskildsen
On Tue, 2013-04-16 at 12:22 +0200, Montu v Boda wrote: we have 506 document is index in solr and it's size is 400GB. now when We search for keyword test it will take 1 min to give the response for 1 rows. At this point, you have searched for other keywords before you measure on

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Dmitry Kan
In the admin page you can monitor the cache parameters, like eviction. If you cache evicts too much, you can increase its capacity. NOTE: this will affect on RAM consumption, so you would need to change the tomcat config too. On Tue, Apr 16, 2013 at 2:08 PM, Montu v Boda

Re: Some Questions About Using Solr as Cloud

2013-04-16 Thread Erick Erickson
Yes. Every node is really self-contained. When you send a doc to a cluster where each shard has a replica, the raw doc is sent to each node of that shard and indexed independently. About old docs, it's the same as Solr 3.6. Data associated with docs stays around in the index until it's merged

Re: Some Questions About Using Solr as Cloud

2013-04-16 Thread Furkan KAMACI
Hi Erick; Thanks for the explanation. You said: You cannot transfer just the indexed form of a document from one core to another, you have to re-index the doc. why do you think like that? 2013/4/16 Erick Erickson erickerick...@gmail.com Yes. Every node is really self-contained. When you send

Re: SolrException parsing error

2013-04-16 Thread Marc des Garets
Did you find anything? I have the same problem but it's on update requests only. The error comes from the solrj client indeed. It is solrj logging this error. There is nothing in solr itself and it does the update correctly. It's fairly small simple documents being updated. On 04/15/2013

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
Hi Thanks for info. we did the same thing but no effect for first time. what to do for first time query with new keyword? how we can make the query faster for first time with new keyword? say for ex if i try to search the text key word test first time then it will take to much time to

Re: Query Parser OR AND and NOT

2013-04-16 Thread Erick Erickson
The query language is NOT pure boolean. Hoss wrote this up: http://searchhub.org/2011/12/28/why-not-and-or-and-not/ Best Erick On Mon, Apr 15, 2013 at 12:54 PM, Roman Chyla roman.ch...@gmail.com wrote: Oh, sorry, I have assumed lucene query parser. I think SOLR qp must be different then,

how to display groups along with matching terms in solr auto-suggestion?

2013-04-16 Thread sharmila thapa
Hi, I have used Terms for auto-suggestion. But it just list the terms that matches terms.prefix from index , along with these term suggestions, I have to display the product groups that matches with the input prefix. Is it possible in solr auto-suggest? Somebody could please help me on this

SolrCloud Leader Response Mechanism

2013-04-16 Thread Furkan KAMACI
When a leader responses for a query, does it says that: If I have the data what I am looking for, I should build response with it, otherwise I should find it anywhere. Because it may be long to search it? or does it says I only index the data, I will tell it to other guys to build up the response

Function Query performance in combination with filters

2013-04-16 Thread Rogalon
Hi, I am using pretty complex function queries to completely customize (not only boost) the score of my result documents that are retrieved from an index of approx 10e7 documents. To get to an acceptable level of performance I combine my query with filters in the following way (very short

Re: Usage of CloudSolrServer?

2013-04-16 Thread Upayavira
I cannot say that I have researched it, but I have always taken it to be random. Upayavira On Tue, Apr 16, 2013, at 12:23 PM, Furkan KAMACI wrote: Thanks for your detailed explanation. However you said: It will then choose one of those hosts/cores for each shard, and send a request to them

terms starting with multilingual character don't list on solr auto-suggestion list

2013-04-16 Thread sharmila thapa
Hi, I have used /terms for solr auto-suggestion list. It works fine for English words. But I have problem on multi-language index words, I have tested for Russian language. If there is Russian charcter in between the word, then it gets displayed on suggesstion list like if I type 'кар', it list

Re: SolrException parsing error

2013-04-16 Thread Luis Lebolo
Turns out I spoke too soon. I was *not* sending the query via POST. Changing the method to POST solved the issue for me (maybe I was hitting a GET limit somewhere?). -Luis On Tue, Apr 16, 2013 at 7:38 AM, Marc des Garets m...@ttux.net wrote: Did you find anything? I have the same problem but

Solr 4.2 Startup Detects Corrupt Log And is Really Slow to Start

2013-04-16 Thread Umesh Prasad
Hi, We are migrating to Solr 4.2 from Solr 3.6 and Solr 4.2 is throwing Exception on Restart. What is More, it take a hell lot of Time ( More than one hour to get Up and Running) THE exception After Restart ... = Apr 16, 2013 4:47:31 PM

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Duncan Irvine
Are you actually trying to return 10,000 records, or is that the number of hits, and you're only retrieving the top 10? Cheers, Duncan. On 16 April 2013 12:39, Montu v Boda montu.b...@highqsolutions.com wrote: Hi Thanks for info. we did the same thing but no effect for first time.

Re: Function Query performance in combination with filters

2013-04-16 Thread Yonik Seeley
On Tue, Apr 16, 2013 at 7:51 AM, Rogalon nico.beche...@me.com wrote: Hi, I am using pretty complex function queries to completely customize (not only boost) the score of my result documents that are retrieved from an index of approx 10e7 documents. To get to an acceptable level of performance

Re: terms starting with multilingual character don't list on solr auto-suggestion list

2013-04-16 Thread Jack Krupansky
Can you share your auto-complete/suggestor configuration parameters? Including the search component. It sounds as if there is a field type with an analyzer that is mapping characters. -- Jack Krupansky -Original Message- From: sharmila thapa Sent: Tuesday, April 16, 2013 7:54 AM

Re: SolR InvalidTokenOffsetsException with Highlighter and Synonyms

2013-04-16 Thread juancesarvillalba
Hi, At moment, I am not considering store synonyms in the index, although is something that I have to do some time. Is strange that something common like multi-word synonyms have a bug with highligting but I couldn't find any solution. Thanks for your help. -- View this message in

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
hi we are trying to return 10,000 rows it is necessary to return 1 rows because from that 1, we are pick only top 100 record based on the user permission and permission is stored in database not on solr. and if we try to return 100 rows then it may possible that from the 100 rows, user

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Ahmet Arslan
Hi Montu, Regarding permissions, you may find this solution more elegant: http://searchhub.org/2012/02/22/custom-security-filtering-in-solr/ http://hokiesuns.blogspot.com/2012/11/using-solrs-postfiltering-to-collect.html --- On Tue, 4/16/13, Montu v Boda montu.b...@highqsolutions.com wrote:

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Raymond Wiker
On Tue, Apr 16, 2013 at 3:13 PM, Montu v Boda montu.b...@highqsolutions.com wrote: hi we are trying to return 10,000 rows it is necessary to return 1 rows because from that 1, we are pick only top 100 record based on the user permission and permission is stored in database not on

Re: Function Query performance in combination with filters

2013-04-16 Thread Rogalon
Am 16. April 2013 um 14:46 schrieb Yonik Seeley-4 [via Lucene] ml-node+s472066n4056299...@n3.nabble.com: On Tue, Apr 16, 2013 at 7:51 AM, Rogalon [hidden email] wrote: Hi, I am using pretty complex function queries to completely customize (not only boost) the score of my result

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Jack Krupansky
Why not just add a filter query for user permissions? -- Jack Krupansky -Original Message- From: Montu v Boda Sent: Tuesday, April 16, 2013 9:13 AM To: solr-user@lucene.apache.org Subject: Re: first time with new keyword, solr take to much time to give the result hi we are trying

Re: SolrException parsing error

2013-04-16 Thread Marc Des Garets
Problem solved for me as well. The client is running in tomcat and the connector had compression=true. I removed it and now it seems to work fine. On 04/16/2013 02:28 PM, Luis Lebolo wrote: Turns out I spoke too soon. I was *not* sending the query via POST. Changing the method to POST solved

Same Shards at Different Machines

2013-04-16 Thread Furkan KAMACI
Is it possible to use same shards at different machines at SolrCloud?

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
Hi problem is that the permission is frequently update in our system so that we have to update the index in the same manner other wise it will give wrong result. in that case i think the cache will get effect and the performance may be reduced. Thanks Regards Montu v Boda -- View this

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Montu v Boda
Hi problem is that the permission is frequently update in our system so that we have to update the index in the same manner other wise it will give wrong result. in that case i think the cache will get effect and the performance may be reduced. Thanks Regards Montu v Boda -- View this

Re: Solr 4.2.1 sorting by distance to polygon centre.

2013-04-16 Thread Smiley, David W.
Guido, The field type solr.SpatialRecursivePrefixTreeFieldType can only participate in distance reporting for indexed points, not other shapes. In fact, I recommend not attempting to get the distance if the field isn't purely indexed points, as it may get confused if it seems some small shapes.

Re: using maven to deploy solr on tomcat

2013-04-16 Thread Adeel Qureshi
the problem is i need to deploy it on servers where i dont know what the absolute path will be .. basically my goal is to load solr with a different set of configuration files based on the environment its in. Is there a a better different way to do this On Mon, Apr 15, 2013 at 11:29 PM, Shawn

Re: Solr 4.2.1 sorting by distance to polygon centre.

2013-04-16 Thread Guido Medina
David, I have been following your stackoverflow posts, I understand what you say, we decided to change the criteria and index an extra field (close to your suggestion), so the sorting will happen now by polygon area desc (Which induced another problem, calculation of polygon area on a

Re: Dynamic data model design questions

2013-04-16 Thread Marko Asplund
Shawn Heisey wrote: Solr does have some *very* limited capability for doing joins between indexes, but generally speaking, you need to flatten the data. thanks! So, using a dynamic schema I'd flatten the following JSON object graph { 'id':'xyz123', 'obj1': { 'child1': { 'prop1':

JavaScript transform switch statement during Data Import

2013-04-16 Thread paulblyth
Hello - I'm trying to add a switch statement into a JavaScript function that we use during an import; it's to replace an if else block that is becoming increasingly large. Bizarrely, the switch block is ignore entirely, and it doesn't have any effect whatsoever. Our version info: Solr

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Otis Gospodnetic
Hi, Have you considered ManifoldCF? Otis -- SOLR Performance Monitoring - http://sematext.com/spm/index.html On Tue, Apr 16, 2013 at 10:02 AM, Montu v Boda montu.b...@highqsolutions.com wrote: Hi problem is that the permission is frequently update in our system so that we have to update

Re: Solr 4.2.1 sorting by distance to polygon centre.

2013-04-16 Thread Guido Medina
David, I just peak it at github, the method will estimate well for our purpose, but depends on JTS which we included in our Solr server only, but we don't want LGPL libraries (v3) in our main project, kind of a show stopper, I understand is needed for spatial4j, Lucene and Solr in general,

Re: using maven to deploy solr on tomcat

2013-04-16 Thread Shawn Heisey
On 4/16/2013 8:47 AM, Adeel Qureshi wrote: the problem is i need to deploy it on servers where i dont know what the absolute path will be .. basically my goal is to load solr with a different set of configuration files based on the environment its in. Is there a a better different way to do this

Re: updateLog in Solr 4.2

2013-04-16 Thread Chris Hostetter
: : If i disable update log in solr 4.2 then i get the following exception : SEVERE: :java.lang.NullPointerException : at : org.apache.solr.cloud.ShardLeaderElectionContext.runLeaderProcess(ElectionContext.java:190) Hmmm.. if you don't have updateLog and you run in SolrCloud mode, solr

Re: Dynamic data model design questions

2013-04-16 Thread Jack Krupansky
'obj1/child1/prop1_ss' Try to stick to names that follow Java naming conventions: letter or underscore followed by letters, digits, and underscores. There are place in Solr which have limited rules for names because they support additional syntax. In this case, replace your slashes with

Re: Dynamic data model design questions

2013-04-16 Thread Shawn Heisey
On 4/16/2013 9:17 AM, Marko Asplund wrote: Shawn Heisey wrote: So, using a dynamic schema I'd flatten the following JSON object graph { 'id':'xyz123', 'obj1': { 'child1': { 'prop1': ['val1', 'val2', 'val3'] 'prop2': 123 } 'prop3': 'val4' }, 'obj2': {

Re: Solr restart is taking more than 1 hour

2013-04-16 Thread gpssolr2020
Thanks for detailed explanation. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-restart-is-taking-more-than-1-hour-tp4054165p4056355.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr 4.2.x replication events on slaves

2013-04-16 Thread Chris Hostetter
: In Solr 3.x, I was relying on a postCommit call to a listener in the update : handler to perform data update to caches, this data was used to perform : 'realtime' filtering on the documents. I can't find it at the moment, but IIRC this was a side effect of how snapshots are now loaded on

Re: CloudSolrServer vs ConcurrentUpdateSolrServer for indexing

2013-04-16 Thread J Mohamed Zahoor
It sure increased the performance . Thanks for the input. ./zahoor On 14-Apr-2013, at 10:13 PM, J Mohamed Zahoor zah...@indix.com wrote: Thanks.. Will try multithreading with CloudSolrServer. ./zahoor On 13-Apr-2013, at 9:11 PM, Mark Miller markrmil...@gmail.com wrote: On Apr 13,

Re: Troubles with solr replication

2013-04-16 Thread Chris Hostetter
: Also when I checked the solr log. : : [org.apache.solr.handler.SnapPuller] Master at: : http://192.168.2.204:8080/solr/replication is not available. Index fetch : failed. Exception: Connection refused : : : BTW, I was able to fetch the replication file with wget directly. Are you certian

zkState changes too often

2013-04-16 Thread J Mohamed Zahoor
Hi I am using SolrCloud (4.1) with 6 nodes. When i index the documents from the mapper and as the load increases.. i see these messages in my mapper logs… WHich looks like it is slowing down my indexing speed. 2013-04-16 06:04:18,013 INFO org.apache.solr.common.cloud.ZkStateReader: Updating

Document Missing from Share in Solr cloud

2013-04-16 Thread Cool Techi
Hi, We noticed a strange behavior in our solr cloud setup, we are using solr4.2 with 1:3 replication setting. We noticed that some of the documents were showing up in search sometimes and not at other, the reason being the document was not present in all the shards. We have restarted

Re: Document Missing from Share in Solr cloud

2013-04-16 Thread Timothy Potter
If you are using the default doc router for indexing in SolrCloud, then a document only exists in a single shard but can be replicated in that shard to any number of replicas. Can you clarify your question as it sounds like you're saying that the document is not replicated across all the replicas

Re: zkState changes too often

2013-04-16 Thread Mark Miller
Are you using a the concurrent low pause garbage collector or perhaps G1? Are you able to use something like visualvm to pinpoint what the bottleneck might be? Otherwise, keep raising the timeout. This means Solr and Zk are not able to talk for that much time - either something needs to be

RE: Document Missing from Share in Solr cloud

2013-04-16 Thread Cool Techi
That's what I am trying to say, the document is not replicated across all the replicas for a specific shard, hence the query show different results on every refresh. Date: Tue, 16 Apr 2013 11:34:18 -0600 Subject: Re: Document Missing from Share in Solr cloud From: thelabd...@gmail.com To:

Re: Solr 4.2.1 sorting by distance to polygon centre.

2013-04-16 Thread Smiley, David W.
Guido, I encourage you to try to open-source the shape-related code you have to Spatial4j. I realize that for some organizations, that can be really difficult. ~ David On 4/16/13 11:55 AM, Guido Medina guido.med...@temetra.com wrote: David, I just peak it at github, the method will

Re: Document Missing from Share in Solr cloud

2013-04-16 Thread Timothy Potter
Ok, that makes more sense and is definitely cause for concern. Do you have a sense for whether this is ongoing or happened a few times unexpectedly in the past? If ongoing, then will probably be easier to track down the root cause. On Tue, Apr 16, 2013 at 12:08 PM, Cool Techi

Re: updateLog in Solr 4.2

2013-04-16 Thread Mark Miller
Can you file a JIRA issue? - minimum you should get a better error. - Mark On Apr 12, 2013, at 9:17 AM, vicky desai vicky.de...@germinait.com wrote: If i disable update log in solr 4.2 then i get the following exception SEVERE: :java.lang.NullPointerException at

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Mark Miller
Leaders don't have much to do with querying - the node that you query will determine what other nodes it has to query to search the whole index and do a scatter/gather for you. (Though in some cases that request can be proxied to another node) - Mark On Apr 16, 2013, at 7:48 AM, Furkan KAMACI

solr 3.5 core rename issue

2013-04-16 Thread Jie Sun
We just tried to use .../solr/admin/cores?action=RENAMEcore=core0other=core5 to rename a core 'old' to 'new'. After the request is done, the solr.xml has new core name, and the solr admin shows the new core name in the list. But the index dir still has the old name as the directory name. I

Re: Document Missing from Share in Solr cloud

2013-04-16 Thread Timothy Potter
btw ... what is the field type of your unique ID field? On Tue, Apr 16, 2013 at 12:34 PM, Timothy Potter thelabd...@gmail.comwrote: Ok, that makes more sense and is definitely cause for concern. Do you have a sense for whether this is ongoing or happened a few times unexpectedly in the past?

Re: solr 3.5 core rename issue

2013-04-16 Thread Shawn Heisey
On 4/16/2013 2:02 PM, Jie Sun wrote: We just tried to use .../solr/admin/cores?action=RENAMEcore=core0other=core5 to rename a core 'old' to 'new'. After the request is done, the solr.xml has new core name, and the solr admin shows the new core name in the list. But the index dir still has the

Re: solr 3.5 core rename issue

2013-04-16 Thread Jie Sun
Hi Shawn, I do have persistent=true in my solr.xml: ?xml version=1.0 encoding=UTF-8 ? solr persistent=true cores adminPath=/admin/cores core name=default instanceDir=.// core name=413a instanceDir=.// core name=blah instanceDir=.// ... /cores /solr the command I ran was to rename

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Furkan KAMACI
Hi Mark; When I speak with proper terms I want to ask that: is there a data locality of spatial locality ( http://www.roguewave.com/portals/0/products/threadspotter/docs/2011.2/manual_html_linux/manual_html/ch_intro_locality.html - I mean if you have data on your machine, use it and don't search

Why indexing and querying performance is better at SolrCloud compared to older versions of Solr?

2013-04-16 Thread Furkan KAMACI
Is there any document that describes why indexing and querying performance is better at SolrCloud compared to older versions of Solr? I was examining that architecture to use: there will be a cloud of Solr that just do indexing and there will be another cloud that copies that indexes into them

Re: Pointing to Hbase for Docuements or Directly Saving Documents at Hbase

2013-04-16 Thread Furkan KAMACI
Hi Otis and Jack; I have made a research about highlights and debugged code. I see that highlight are query dependent and not stored. Why Solr uses Lucene for storing text, I mean i.e. content of a web page. Is there any comparison about to store texts at Hbase or any other databases versus

Re: Empty Solr 4.2.1 can not create Collection

2013-04-16 Thread Chris Hostetter
: sorry for pushing, but I just replayed the steps with solr 4.0 where : everything works fine. : Then I switched to solr 4.2.1 and replayed the exact same steps and the : collection won't start and no leader will be elected. : : Any clues ? : Should I try it on the developer mailing list, maybe

When a search query comes to a replica what happens?

2013-04-16 Thread Furkan KAMACI
I want to make it clear in my mind: When a search query comes to a replica what happens? -Does it forwards the search query to leader and leader collects all the data and prepares response (this will cause a performance issue because leader is responsible for indexing at same time) or - replica

How SolrCloud Balance Number of Documents at each Shard?

2013-04-16 Thread Furkan KAMACI
Is it possible that different shards have different number of documents or does SolrCloud balance them? I ask this question because I want to learn the mechanism behind how Solr calculete hash value of the identifier of the document. Is it possible that hash function produces more documents into

Re: When a search query comes to a replica what happens?

2013-04-16 Thread Otis Gospodnetic
Hi, No, I believe redirect from replica to leader would happen only at index time, so a doc first gets indexed to leader and from there it's replicated to non-leader shards. At query time there is no redirect to leader, I imagine, as that would quickly turn leaders into hotspots. Otis -- Solr

Re: How SolrCloud Balance Number of Documents at each Shard?

2013-04-16 Thread Otis Gospodnetic
They won't be exact, but should be close. Are you seeing some *big* differences? Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 6:11 PM, Furkan KAMACI furkankam...@gmail.com wrote: Is it possible that different shards have different number of documents or

Re: Pointing to Hbase for Docuements or Directly Saving Documents at Hbase

2013-04-16 Thread Otis Gospodnetic
People do use other data stores to retrieve data sometimes. e.g. Mongo is popular for that. Like I hinted in another email, I wouldn't necessarily recommend this for common cases. Don't do it unless you really know you need it. Otherwise, just store in Solr. Otis -- Solr ElasticSearch Support

Re: Why indexing and querying performance is better at SolrCloud compared to older versions of Solr?

2013-04-16 Thread Otis Gospodnetic
Correct. With SolrCloud you typically don't need to make this separation (with ElasticSearch one can designate some nodes as non-data nodes). SolrCloud won't necessarily always be faster because it typically involves sharding and thus a distributed search, while some non-SolrCloud setups can

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Otis Gospodnetic
If query comes to shard X on some node and this shard X is NOT a leader, but HAS data, it will just execute the query. If it needs to query shards on other nodes, it will have the info about which shards to query and will just do that and aggregate the results. It doesn't have to ask leader for

Re: When a search query comes to a replica what happens?

2013-04-16 Thread Furkan KAMACI
All in all will replica ask to its leader about where is remaining of data or it directly asks to Zookeper? 2013/4/17 Otis Gospodnetic otis.gospodne...@gmail.com Hi, No, I believe redirect from replica to leader would happen only at index time, so a doc first gets indexed to leader and from

Re: When a search query comes to a replica what happens?

2013-04-16 Thread Otis Gospodnetic
No. Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 6:23 PM, Furkan KAMACI furkankam...@gmail.com wrote: All in all will replica ask to its leader about where is remaining of data or it directly asks to Zookeper? 2013/4/17 Otis Gospodnetic

Re: How SolrCloud Balance Number of Documents at each Shard?

2013-04-16 Thread Furkan KAMACI
Hi Otis; Firstly thanks for your answers. So do you mean that hashing mechanism will randomly route a document into a randomly shard? I want to ask it because I consider about putting a load balancer in front of my SolrCloud and manually route some documents into some other shards to avoid

Re: Push/pull model between leader and replica in one shard

2013-04-16 Thread Otis Gospodnetic
Hi, Replication when everything is working well is push: * request comes to any node, ideally leader * doc is indexed on leader * doc is copied to replicas If replica falls too far behind (not exactly sure what the too far threshold is), it uses pull to replicate the whole index from leader.

Re: Pointing to Hbase for Docuements or Directly Saving Documents at Hbase

2013-04-16 Thread Furkan KAMACI
Thanks again for your answer. If I find any document about such comparisons that I would like to read. By the way, is there any advantage for using Lucene instead of anything else as like that: Using Lucene is naturally supported at Solr and if I use anything else I may face with some

Re: How SolrCloud Balance Number of Documents at each Shard?

2013-04-16 Thread Otis Gospodnetic
Hi, Routing is not random... have a look at https://issues.apache.org/jira/browse/SOLR-2341 . In short, you shouldn't have to route manually from your app. Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 6:26 PM, Furkan KAMACI furkankam...@gmail.com wrote:

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Furkan KAMACI
Hi Otis; You said: It can just do it because it knows where things are. Does it learn it from Zookeeper? 2013/4/17 Otis Gospodnetic otis.gospodne...@gmail.com If query comes to shard X on some node and this shard X is NOT a leader, but HAS data, it will just execute the query. If it needs

Re: Pointing to Hbase for Docuements or Directly Saving Documents at Hbase

2013-04-16 Thread Otis Gospodnetic
Use Solr. It's pretty clear you don't yet have any problems that would make you think about alternatives. Using Solr to store and not just index will make your life simpler (and your app simpler and likely faster). Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16,

Re: How do I recover the position and offset a highlight for solr (4.1/4.2)?

2013-04-16 Thread P Williams
Hi, It doesn't have the offset information, but checkout my patch https://issues.apache.org/jira/browse/SOLR-4722 which outputs the position of each term that's been matched. I'm eager to get some feedback on this approach and any improvements that might be suggested. Cheers, Tricia On Wed,

Re: how to display groups along with matching terms in solr auto-suggestion?

2013-04-16 Thread Otis Gospodnetic
Hi, Try Solr Suggester, though I'm not sure if you can group with it. tried http://search-lucene.com/?q=suggester+groupfc_project=Solr but it doesn't seem to yield much. If you need to group suggestions like what you see on http://search-lucene.com/ for example, we use our own AC from

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Otis Gospodnetic
Oui, ZK holds the map. Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 6:33 PM, Furkan KAMACI furkankam...@gmail.com wrote: Hi Otis; You said: It can just do it because it knows where things are. Does it learn it from Zookeeper? 2013/4/17 Otis

Re: Some Questions About Using Solr as Cloud

2013-04-16 Thread Otis Gospodnetic
See https://issues.apache.org/jira/browse/SOLR-4532 https://issues.apache.org/jira/browse/SOLR-1535 https://issues.apache.org/jira/browse/SOLR-4619 Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 7:37 AM, Furkan KAMACI furkankam...@gmail.com wrote: Hi Erick;

Re: SolrCloud Leader Response Mechanism

2013-04-16 Thread Furkan KAMACI
Replica asks to Zookeper and Leader does not do anything. Thanks for your answer Otis. 2013/4/17 Otis Gospodnetic otis.gospodne...@gmail.com Oui, ZK holds the map. Otis -- Solr ElasticSearch Support http://sematext.com/ On Tue, Apr 16, 2013 at 6:33 PM, Furkan KAMACI

Re: Storing Solr Index on NFS

2013-04-16 Thread Otis Gospodnetic
Yesterday, we spent 1 hour with a client looking at their cluster's performance metrics SPM, their indexing logs, etc. trying to figure out why some indexing was slower than it should have been. We traced issues to network hickups, to VMs that would move from host to host, etc. Really fancy and

Re: Is cache useful for my scenario?

2013-04-16 Thread Otis Gospodnetic
Hi Sam, Sounds like you may want to disable caches, yes. But instead of guessing, just look at the stats and based on that configure your caches. You can get stats from Solr Admin page or, if you need long-term stats and performance patterns, use SPM for Solr or something similar. Otis -- Solr

Re: solr 3.5 core rename issue

2013-04-16 Thread Shawn Heisey
On 4/16/2013 2:39 PM, Jie Sun wrote: Hi Shawn, I do have persistent=true in my solr.xml: ?xml version=1.0 encoding=UTF-8 ? solr persistent=true cores adminPath=/admin/cores core name=default instanceDir=.// core name=413a instanceDir=.// core name=blah instanceDir=.// ...

Re: Storing Solr Index on NFS

2013-04-16 Thread Furkan KAMACI
I don't want to bother but I try to understand that part: When yo perform a commit in solr you have (for an instant) two versions of the index. The commit produces new segments (with new documents, new deletions, etc). After creating these new segments a new index searcher is created and its

Re: Push/pull model between leader and replica in one shard

2013-04-16 Thread Mark Miller
On Apr 16, 2013, at 1:36 AM, SuoNayi suonayi2...@163.com wrote: Hi, can someone explain more details about what model is used to sync docs between the lead and replica in the shard? The model can be push or pull.Supposing I have only one shard that has 1 leader and 2 replicas, when the

Re: Push/pull model between leader and replica in one shard

2013-04-16 Thread Furkan KAMACI
Really nice presentation. 2013/4/17 Mark Miller markrmil...@gmail.com On Apr 16, 2013, at 1:36 AM, SuoNayi suonayi2...@163.com wrote: Hi, can someone explain more details about what model is used to sync docs between the lead and replica in the shard? The model can be push or

Re: Is cache useful for my scenario?

2013-04-16 Thread Chris Hostetter
: There will be a lot of data that will be indexed in Solr. My question is, : does caching help in my case? As the filter queries will vary for almost all : users ( because the viewport latitude/longitude would vary), in what ways : can I use Caching to increase performance. Should I completely

  1   2   >