Re: Performance on faceting using docValues

2015-03-05 Thread Mikhail Khludnev
Hello, I have one consideration on top of my head, would you mind to show a brief snapshot by a sampler? On Thu, Mar 5, 2015 at 10:18 PM, lei simpl...@gmail.com wrote: Hi there, I'm testing facet performance with vs without docValues in Solr 4.7, and found that on first request, performance

Re: Performance on faceting using docValues

2015-03-05 Thread lei
Here is the specs of some example query faceting on three fields (all string type): first call: 1+ sec (with docValues) vs. 4+ sec (w/o docValues) subsequent calls: 30+ ms (with docValues) vs. 100+ ms (w/o docValues) consistently the total # of docs returned is around 600,000 On Thu, Mar 5,

Performance on faceting using docValues

2015-03-05 Thread lei
Hi there, I'm testing facet performance with vs without docValues in Solr 4.7, and found that on first request, performance with docValues is much faster than non-docValues. However, for subsequent requests (where the queries are cached), the performance is slower for docValues than

Re: Performance on faceting using docValues

2015-03-05 Thread lei
Some mistake in the previous email. Here is the specs of some example query faceting on three fields (all string type): first call: 1+ sec (with docValues) vs. 4+ sec (w/o docValues) subsequent calls: 100+ ms (with docValues) vs. 30+ ms (w/o docValues) consistently the total # of docs returned is

RE: Performance on faceting using docValues

2015-03-05 Thread Ryan, Michael F. (LNG-DAY)
This is consistent with my experience. DocValues is faster for the first call (compared to UnInvertedField, which is what is used when there are no DocValues), but is slower on subsequent calls. I'm curious as to this as well, since I haven't heard anyone else before you also mention this. I

Re: Solrcloud Index corruption

2015-03-05 Thread Martin de Vries
Hi Erick, Thank you for your detailed reply. You say in our case some docs didn't made it to the node, but that's not really true: the docs can be found on the corrupted nodes when I search on ID. The docs are also complete. The problem is that the docs do not appear when I filter on certain

Re: How to start solr in solr cloud mode using external zookeeper ?

2015-03-05 Thread shamik
The other way you can do that is to specify the startup parameters in solr.in.sh. Example : SOLR_MODE=solrcloud ZK_HOST=zoohost1:2181,zoohost2:2181,zoohost3:2181 SOLR_PORT=4567 You can simply start solr by running ./solr start -- View this message in context:

RE: Solrcloud Index corruption

2015-03-05 Thread Garth Grimm
For updates, the document will always get routed to the leader of the appropriate shard, no matter what server first receives the request. -Original Message- From: Martin de Vries [mailto:mar...@downnotifier.com] Sent: Thursday, March 05, 2015 4:14 PM To: solr-user@lucene.apache.org

Re: solr cloud does not start with many collections

2015-03-05 Thread Damien Kamerman
I've tried a few variations, with 3 x ZK, 6 X nodes, solr 4.10.3, solr 5.0 without any success and no real difference. There is a tipping point at around 3,000-4,000 cores (varies depending on hardware) from where I can restart the cloud OK within ~4min, to the cloud not working and continuous

Re: Solrcloud Index corruption

2015-03-05 Thread Mark Miller
If you google replication can cause index corruption there are two jira issues that are the most likely cause of corruption in a solrcloud env. - Mark On Mar 5, 2015, at 2:20 PM, Garth Grimm garthgr...@averyranchconsulting.com wrote: For updates, the document will always get routed to

Re: Solrcloud Index corruption

2015-03-05 Thread Shawn Heisey
On 3/5/2015 3:13 PM, Martin de Vries wrote: I understand there is not a master in SolrCloud. In our case we use haproxy as a load balancer for every request. So when indexing every document will be sent to a different solr server, immediately after each other. Maybe SolrCloud is not able to

Re: Admin UI doesn't show logs?

2015-03-05 Thread Shawn Heisey
On 3/5/2015 6:01 PM, Jakov Sosic wrote: I'm running 4.10.3 under tomcat 7, and I have an issue with Admin UI. When I click on a Logging - I don't see actual entries but only: No Events available The logging tab in the admin UI only shows log entries where the severity of the log is at

Admin UI doesn't show logs?

2015-03-05 Thread Jakov Sosic
Hi, I'm running 4.10.3 under tomcat 7, and I have an issue with Admin UI. When I click on a Logging - I don't see actual entries but only: No Events available and round icon circling non stop. When I click on Level, I see the same icon, and message Loading Is there a hint or

Re: Admin UI doesn't show logs?

2015-03-05 Thread Alexandre Rafalovitch
And given that you configured it under Tomcat, I'd check that the logs are generated at all first. Just as a sanity check. Regards, Alex. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 5 March 2015 at 20:15, Shawn Heisey

Re: How to start solr in solr cloud mode using external zookeeper ?

2015-03-05 Thread Aman Tandon
Thanks shamik :) With Regards Aman Tandon On Fri, Mar 6, 2015 at 3:30 AM, shamik sham...@gmail.com wrote: The other way you can do that is to specify the startup parameters in solr.in.sh. Example : SOLR_MODE=solrcloud ZK_HOST=zoohost1:2181,zoohost2:2181,zoohost3:2181 SOLR_PORT=4567

Re: SOLR query parameters

2015-03-05 Thread Erick Erickson
Whew! I was afraid that my memory was failing since I'd no memory of ever seeing anything remotely like that! Erick On Thu, Mar 5, 2015 at 6:04 AM, phi...@free.fr wrote: Please ignore my question. These are form field names which I created a couple of months ago, not SOLR query parameters.

Re: Help needed to understand zookeeper in solrcloud

2015-03-05 Thread Julian Perry
I start out with 5 zk's. All good. One zk fails - I'm left with four. Are they guaranteed to split 4/0 or 3/1 - because if they split 2/2 I'm screwed, right? Surely to start with 5 zk's (or in fact any odd number - it could be 21 even), and from a single failure you drop to an even number -

Re: Help needed to understand zookeeper in solrcloud

2015-03-05 Thread svante karlsson
The network will only split if you get errors on your network hardware. (or fiddle with iptables) Let's say you placed your zookeepers in separate racks and someone pulls network cable between them - that will leave you with 5 working servers but they can't reach each other. This is split brain

Re: How to start solr in solr cloud mode using external zookeeper ?

2015-03-05 Thread Aman Tandon
Thanks Erick. So for the other audience who got stuck in same situation. Here is the solution. If you are able to run the remote/local zookeeper ensemble, then you can create the Solr Cluster by the following method. Suppose you have an zookeeper ensemble of 3 zookeeper server running on three

Re: [ANNOUNCE] Apache Solr 4.10.4 released

2015-03-05 Thread Oded Sofer
Hello Mike, How are you? This is Oded Sofer from IBM Guardium. We had moved to SolrCloud, I thought you may be able to help me find something. The Facet search is very slow, I do not know how to check what is the size of our facets (gb / count). Do you know how I can check it? On

Re: Issue while enabling clustering/integrating carrot2 with solr 4.4.0 and tomact under ubuntu

2015-03-05 Thread Erick Erickson
Class cast exceptions are usually the result of having a mix of old and new jars in your classpath, or even of having the same jar in two different places. Is this possible here? Best, Erick On Wed, Mar 4, 2015 at 6:44 PM, sthita sthit...@gmail.com wrote: 1.My solr.xml ?xml version=1.0

Re: Solrcloud Index corruption

2015-03-05 Thread Erick Erickson
Wait up. There's no master index in SolrCloud. Raw documents are forwarded to each replica, indexed and put in the local tlog. If a replica falls too far out of synch (say you take it offline), then the entire index _can_ be replicated from the leader and, if the leader's index was incomplete then

Labels for facets on Velocity

2015-03-05 Thread Henrique O. Santos
Hello, I’ve been trying to have a pretty name for my facets on Velocity Response Writer. Do you know how can I do that? For example, suppose that I am faceting field1. My query returns 3 facets: uglyfacet1, uglyfacet2 and uglyfacet3. I want to show them to the user a pretty name, like Pretty

RE: Cores and and ranking (search quality)

2015-03-05 Thread Markus Jelsma
Hello - facetting will be the same and distributed more like this is also possible since 5.0, and there is a working patch for 4.10.3. Regular search will work as well since 5.0 because of distributed IDF, which you need to enable manually. Behaviour will not be the same if you rely on average

SOLR query parameters

2015-03-05 Thread phiroc
Hello, could someone please explain what these SOLR query parameter keywords stand for: - ppcdb - srbycb - as For instance, http://searcharchives.iht.com:8983/solr/inytapdf0/browse?ppdcb=srbycb=as=q=kaisersort= I could not find them in the SOLR documentation. Many thanks. Philippe

Re: SOLR query parameters

2015-03-05 Thread phiroc
Please ignore my question. These are form field names which I created a couple of months ago, not SOLR query parameters. Philippe - Mail original - De: phi...@free.fr À: solr-user@lucene.apache.org Envoyé: Jeudi 5 Mars 2015 14:54:26 Objet: SOLR query parameters Hello, could someone

Re: Cores and and ranking (search quality)

2015-03-05 Thread Toke Eskildsen
On Thu, 2015-03-05 at 14:34 +0100, johnmu...@aol.com wrote: My question is this: if I put my data in multiple cores and use distributed search will the ranking be different if I had all my data in a single core? Yes, it will be different. The practical impact depends on how homogeneous your

[ANNOUNCE] Apache Solr 4.10.4 released

2015-03-05 Thread Michael McCandless
October 2014, Apache Solr™ 4.10.4 available The Lucene PMC is pleased to announce the release of Apache Solr 4.10.4 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-05 Thread Erick Erickson
I would, BTW, either just get rid of the maxBufferedDocs all together or make it much higher, i.e. 10. I don't think this is really your problem, but you're creating a lot of segments here. But I'm kind of at a loss as to what would be different about your setup. Is there _any_ chance that

Parsing cluster result's docs

2015-03-05 Thread Jorge Lazo
Hi, I have a Solr instance using the clustering component (with the Lingo algorithm) working perfectly. However when I get back the cluster results only the ID's of these come back with it. What is the easiest way to retrieve full documents instead? Should I parse these IDs into a new query

Solrcloud Index corruption

2015-03-05 Thread Martin de Vries
Hi, We have index corruption on some cores on our Solrcloud running version 4.8.1. The index is corrupt on several servers. (for example: when we do an fq search we get results on some servers, on other servers we don't, while the stored document contains the field on all servers). A full

Re: Solrcloud Index corruption

2015-03-05 Thread Andrew Butkus
We had a similar issue, when this happened we did a fetch index on each core out of sync to put them back right again Sent from my iPhone On 5 Mar 2015, at 14:40, Martin de Vries mar...@downnotifier.com wrote: Hi, We have index corruption on some cores on our Solrcloud running version

RE: Solrcloud Index corruption

2015-03-05 Thread Andrew Butkus
Force a fetchindex on slave from master command: http://slave_host:port/solr/replication?command=fetchindex - from http://wiki.apache.org/solr/SolrReplication The above command will download the whole index from master to slave, there are configuration options in solr to make this problem

Cores and and ranking (search quality)

2015-03-05 Thread johnmunir
Hi, I have data in which I will index and search on. This data is well define such that I can index into a single core or multiple cores like so: core_1:Jan2015, core_2:Feb2015, core_3:Mar2015, etc. My question is this: if I put my data in multiple cores and use distributed search will the

RE: Solrcloud Index corruption

2015-03-05 Thread Martin de Vries
Hi Andrew, Even our master index is corrupt, so I'm afraid this won't help in our case. Martin Andrew Butkus schreef op 05.03.2015 16:45: Force a fetchindex on slave from master command: http://slave_host:port/solr/replication?command=fetchindex - from

Re: Performance on faceting using docValues

2015-03-05 Thread Toke Eskildsen
On Thu, 2015-03-05 at 21:14 +0100, lei wrote: You present a very interesting observation. I have not noticed what you describe, but on the other hand we have not done comparative speed tests. q=*:*fq=country:USfq=category:112 First observation: Your query is '*:*, which is a magic query.

Re: problem with tutorial

2015-03-05 Thread gaohang wang
do you publish you solr in tomcat?which is the tomcat port? 2014-12-16 15:45 GMT+08:00 Xin Cai xincai2...@gmail.com: hi Everyone I am a complete noob when it comes to Solr and when I try to follow the tutorial and run Solr I get the error message Waiting to see Solr listening on port 8983