Re: Run Solr 5.3.0 as a Service on Windows using NSSM

2015-10-06 Thread Zheng Lin Edwin Yeo
Hi Adrian, I've waited for more than 5 minutes and most of the time when I refresh it says that the page cannot be found. Got one or twice the main Admin page is loaded, but none of the cores are loaded. I have 20 cores which I'm loading. The core are of various sizes, but the maximum one is

Re: Pressed optimize and now SOLR is not indexing while optimize is going on

2015-10-06 Thread Siddhartha Singh Sandhu
Nice. Will port it onto an SSD. A have a few questions about optimize. Is the search index fully searchable after a commit? How much time does one have to wait in case of a hard commit for the index to be available? I have an index of 180G. Do I need to hit the optimize on this chunk. This is

Solr cross core join special condition

2015-10-06 Thread Ali Nazemian
I was wondering how can I overcome this query requirement in Solr 5.2.1: I have two different Solr cores refer as "core1" and "core2". core1 has some fields such as field1, field2 and field3 and core2 has some other fields such as field1, field4 and field5. I am looking for Solr query which can

Re: Solr cross core join special condition

2015-10-06 Thread Mikhail Khludnev
Hello, Why do you need sibling core fields? do you facet? or just want to enrich result page with them? On Tue, Oct 6, 2015 at 6:04 PM, Ali Nazemian wrote: > I was wondering how can I overcome this query requirement in Solr 5.2.1: > > I have two different Solr cores

Re: indexing data to solrcloud with "implicit" is not distributing across cluster.

2015-10-06 Thread Shawn Heisey
On 10/6/2015 7:58 AM, Steve wrote: > I’ve been unable to get solrcloud to distribute data across 4 solr nodes > with the “route.name=implicit” feature of the collections API. > > The nodes are live, and the graphs are green. All the data (the “Films” > example data) shows up on one node, the

Re: Cannot connect to a zookeeper 3.4.6 instance via zkCli.cmd

2015-10-06 Thread Shawn Heisey
On 10/6/2015 3:38 AM, Adrian Liew wrote: > Thanks for the reply. Looks like this has been resolved by manually starting > the Zookeeper services on each server promptly so that the tickTime value > does not timeout too quickly to heartbeat other peers. Hence, I increased the > tickTime value to

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Bill Dueber
Just to add...my informal tests show that batching has way more effect than solrj vs json. I haven't look at CUSC in a while, last time I looked it was impossible to do anything smart about error handling, so check that out before you get too deeply into it. We use a strategy of sending a

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Alessandro Benedetti
mm one broken document in a batch should not break the entire batch , right ( whatever approach used) ? Are you referring to the fact that you want to programmatically re-index the broken docs ? Would be interesting to return the id of the broken docs along with the solr update response!

Re: Solr cross core join special condition

2015-10-06 Thread Ali Nazemian
Dear Mikhail, Hi, I want to enrich the result. Regards On Oct 6, 2015 7:07 PM, "Mikhail Khludnev" wrote: > Hello, > > Why do you need sibling core fields? do you facet? or just want to enrich > result page with them? > > On Tue, Oct 6, 2015 at 6:04 PM, Ali Nazemian

Re: Filter first-components result in solr.SearchHandler

2015-10-06 Thread Erik Hatcher
Seems like Solr’s QueryElevationComponent is what would suit your needs here. Or, perhaps, adding something like this to your request: bq={!terms f=id}3,5,6,8,9 — Erik Hatcher, Senior Solutions Architect http://www.lucidworks.com > On Oct 6, 2015, at 7:45 AM,

Re: Solr cross core join special condition

2015-10-06 Thread Mikhail Khludnev
On Wed, Oct 7, 2015 at 7:05 AM, Ali Nazemian wrote: > it > seems there is not any way to do that right now and it should be developed > somehow. Am I right? > yep -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics

hi

2015-10-06 Thread John
please unsubscribe me

Run Solr 5.3.0 as a Service on Windows using NSSM

2015-10-06 Thread Zheng Lin Edwin Yeo
Hi, I tried to follow this to start my Solr as a service using NSSM. http://www.norconex.com/how-to-run-solr5-as-a-service-on-windows/ Everything is fine when I start the services under Component Services. However, when I tried to point to the Solr Admin page, it says that the page cannot be

RE: Cannot connect to a zookeeper 3.4.6 instance via zkCli.cmd

2015-10-06 Thread Adrian Liew
Hi Shawn, Thanks for the reply. Understood your comments and will revert back to the defaults. However, I raised this issue because I realized that Zookeeper becomes impatient if it cannot heartbeat its other peers in time. So for example, if 1 ZK server goes down out of 3 ZK servers, the 1 ZK

Re: Solr cross core join special condition

2015-10-06 Thread Ali Nazemian
Yeah, but child document transformer is used for nested document inside single core but I am looking for multiple core result joining. Then it seems there is not any way to do that right now and it should be developed somehow. Am I right? Regards. On Oct 6, 2015 9:53 PM, "Mikhail Khludnev"

RE: Run Solr 5.3.0 as a Service on Windows using NSSM

2015-10-06 Thread Adrian Liew
Hi Edwin, I have setup NSSM on Solr 5.3.0 in an Azure VM and can start up Solr with a base standalone installation. You may have to give Solr some time to bootstrap things and wait for the page to reload. Are you still seeing the page after 1 minute or so? What are your core sizes? And how

If zookeeper is down, SolrCloud nodes will not start correctly, even if zookeeper is started later

2015-10-06 Thread Adrian Liew
Changing subject header. I am encountering this issue in Solr 5.3.0 whereby I am getting haywired leader election using SolrCloud. I am using NSSM 2.24 to startup my solr services with zookeeper set as a dependency. For example, if I have three servers marked as 10.0.0.4, 10.0.0.5 and

Re: Solr cross core join special condition

2015-10-06 Thread Mikhail Khludnev
thus, something like [child] https://cwiki.apache.org/confluence/display/solr/Transforming+Result+Documents can be developed. On Tue, Oct 6, 2015 at 6:45 PM, Ali Nazemian wrote: > Dear Mikhail, > Hi, > I want to enrich the result. > Regards > On Oct 6, 2015 7:07 PM,

Re: Pressed optimize and now SOLR is not indexing while optimize is going on

2015-10-06 Thread Shawn Heisey
On 10/6/2015 8:18 AM, Siddhartha Singh Sandhu wrote: > A have a few questions about optimize. Is the search index fully searchable > after a commit? If openSearcher is true on the commit, then changes to the index (additions, replacements, deletions) will be visible when the commit completes. >

Query to count matching terms and disable 'coord' multiplication

2015-10-06 Thread Tim Hearn
Hello everyone, I have two questions 1) Is there a way to query solr to rank results based purely on the amount of terms in the query which are contained in the document? Example: doc1: 'foo bar poo car foo' q1: 'foo, car, two, start' score(doc1, q1) = 2 (since both foo and car both occur in

Re: Pivot facets

2015-10-06 Thread Chris Hostetter
It's not entirely clear what your queries/data look like, orwhat results you are expecting to get back, please consider asking your question again with more details... https://wiki.apache.org/solr/UsingMailingLists ...in the mean time the best guess i can make is that perhaps you aren't

efficient sort by title (multi word field)

2015-10-06 Thread Gili Nachum
Hi, wanted to make sure I'm implementing sort in an efficient way... I need to allow users to sort by documents' title field. A title can contain 1-20 words. Title examples: "new project meeting minutes - Oct 2015 - new chance on the horizon" or "how to create a wonderful presentation". I'm

RE: Solr 5.2.1 and spatial polygon searches

2015-10-06 Thread Lee Duhl
We were able to resolve this issue by installing the JTS library on the server and updating the solr schema.xml to remap the "solr. SpatialRecursivePrefixTreeFieldType" class to the "JtsSpatialContextFactory" Thank You Lee V. Duhl Realcomp II Ltd. Phone: (248) 699-9133 www.realcomp.com

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Gili Nachum
CloudSolrServer Beyond sending documents to the right leader shard, it also do this in *parallel *(for a batch), employing its own thread pool, with a connection per shard. On Tue, Oct 6, 2015 at 8:15 PM, Walter Underwood

Re: indexing data to solrcloud with "implicit" is not distributing across cluster.

2015-10-06 Thread Shawn Heisey
On 10/6/2015 10:02 AM, Steve wrote: > Thanks Shawn, that fixed it ! > > The documentation int the Collections API says "The value can be ... > *implicit*, which uses an internal default hash". Thank you for pointing out this error in the documentation. I did not know it was there. I have

Solr 5.2.1 - ReplicationHandler - No route to a host that is long gone

2015-10-06 Thread Eric Torti
Hey guys! We have a deploy of SolrCloud 5.2.1 that is composed of 5 to 8 amazon linux ec2 c3.2xlarge instances. Our main core is composed of 4M docs (6GB) and we serve an average of 70 req/s per machine. We are using zookeeper 3.4.6 to provide cluster synchronization. The thing is we are

Filter first-components result in solr.SearchHandler

2015-10-06 Thread aniljayanti
Hi All, I am workng on solr 5.2.1. I wrote my own component to get employee id's from first-component. I am trying to pass these id's to normal solr.SearchHandler () to filter the employee id's. relevant request handler in solrconfig.xml file : explicit 100 text

Re: Recovery Thread Blocked

2015-10-06 Thread Rallavagu
Mark - currently 5.3 is being evaluated for upgrade purposes and hopefully get there sooner. Meanwhile, following exception is noted from logs during updates ERROR org.apache.solr.update.CommitTracker – auto commit error...:java.lang.IllegalStateException: this writer hit an

EdgeNGramFilterFactory question

2015-10-06 Thread vit
I have Solr 4.2 1) Is it possible to somehow use EdgeNGramFilterFactory ignoring white spaces in n-grams? 2) Is it possible to use EdgeNGramFilterFactory in combination with stemming ? Say applying this to "look for close hotel" instead of "looking for closest hotels" -- View this message

Re: Pressed optimize and now SOLR is not indexing while optimize is going on

2015-10-06 Thread Siddhartha Singh Sandhu
Thank you for helping out. Further inquiry: I am committing records to my solr implementation and they are not getting showing up in my search. I am search on the default id. Is this related to the fact that I dont have enough memory so my SOLR is taking a lot of time to actually making the

Re: Recovery Thread Blocked

2015-10-06 Thread Mark Miller
That amount of RAM can easily be eaten up depending on your sorting, faceting, data. Do you have gc logging enabled? That should describe what is happening with the heap. - Mark On Tue, Oct 6, 2015 at 4:04 PM Rallavagu wrote: > Mark - currently 5.3 is being evaluated for

Re: Recovery Thread Blocked

2015-10-06 Thread Rallavagu
GC logging shows normal. The "OutOfMemoryError" appears to be pertaining to a thread but not to JVM. On 10/6/15 1:07 PM, Mark Miller wrote: That amount of RAM can easily be eaten up depending on your sorting, faceting, data. Do you have gc logging enabled? That should describe what is

Re: indexing data to solrcloud with "implicit" is not distributing across cluster.

2015-10-06 Thread Chris Hostetter
: The documentation int the Collections API says "The value can be ... : *implicit*, which uses an internal default hash". : I think most people would assume the "hash" would be used to route the : data. : Meanwhile the description of CompositID in the "Document Routing" section : only discusses

Re: Facet queries blow out the filterCache

2015-10-06 Thread Chris Hostetter
: So, no SolrCloud, default example config, about as basic as you get. I : didn’t even bother indexing any docs. Then I issued this query: : : http://localhost:8983/solr/techproducts/select?q=name:foo=1=true : =popularity=0=-1 : This still causes an insert into the filterCache. the faceting

MailEntityProcessor

2015-10-06 Thread Gaurav Gupta
Hello Guys, I am trying MailEntityProcessor in Solr 5 Below is my configuration : Issue I am facing : 1. transformers not working i.e. *template* 2. Looks like delta-import not supported ? 3. If I am doing full-import then its considering dataimport.properties i.e. doing

RE: Implementing AbstractFullDistribZkTestBase

2015-10-06 Thread Markus Jelsma
A crap, i didn't spot the httpS! I have added SSL supression, so far the tests roll fine. Thanks! Markus -Original message- > From:Mark Miller > Sent: Tuesday 6th October 2015 2:27 > To: solr-user@lucene.apache.org > Subject: Re: Implementing

Re: Numeric Sorting with 0 and NULL Values

2015-10-06 Thread Todd Long
Chris Hostetter-3 wrote > ...i mention this as being a workarround for floats/doubles because the > functions are evaluated as doubles (no "casting" or "forced integer > context" type support at the moment), so with integer/float fields there > would be some loss of precision. Excellent, thank

Re: Recovery Thread Blocked

2015-10-06 Thread Mark Miller
If it's a thread and you have plenty of RAM and the heap is fine, have you checked raising OS thread limits? - Mark On Tue, Oct 6, 2015 at 4:54 PM Rallavagu wrote: > GC logging shows normal. The "OutOfMemoryError" appears to be pertaining > to a thread but not to JVM. > >

Re: Recovery Thread Blocked

2015-10-06 Thread Rallavagu
It is java thread though. Does it need increasing OS level threads? On 10/6/15 6:21 PM, Mark Miller wrote: If it's a thread and you have plenty of RAM and the heap is fine, have you checked raising OS thread limits? - Mark On Tue, Oct 6, 2015 at 4:54 PM Rallavagu wrote:

RE: Cannot connect to a zookeeper 3.4.6 instance via zkCli.cmd

2015-10-06 Thread Adrian Liew
Hi Edwin, Thanks for the reply. Looks like this has been resolved by manually starting the Zookeeper services on each server promptly so that the tickTime value does not timeout too quickly to heartbeat other peers. Hence, I increased the tickTime value to about 5 minutes to give some time for

Re: Filter first-components result in solr.SearchHandler

2015-10-06 Thread Erik Hatcher
Could you also provide an example of the type of request you want the client to make? Note that `qf` is a (e)dismax query parser parameter, in case that’s conflicting for you. — Erik Hatcher, Senior Solutions Architect http://www.lucidworks.com > On Oct 6,

Zookeeper HA with 3x ZK with Solr server nodes

2015-10-06 Thread Adrian Liew
Hi there, I have 3 Solr server Azure VM nodes participating in SolrCloud with ZK installed on each of these nodes (to avoid a single point of failure with ZK for leader election). Each Solr server is hosted in a Windows Server 2012 R2 environment. I was told by my peer that if one zookeeper

Re: Facet queries blow out the filterCache

2015-10-06 Thread Jeff Wartes
I dug far enough yesterday to find the GET_DOCSET, but not far enough to find why. Thanks, a little context is really helpful sometimes. So, starting with an empty filterCache... http://localhost:8983/solr/techproducts/select?q=name:foo=1=true =popularity New values: lookups: 0,

Re: FieldCache?

2015-10-06 Thread Alessandro Benedetti
We should make some precision here, When dealing with faceting , there are currently 2 main approaches : 1) *Enum Algorithm* - best for low cardinality value fields, it is based on retrieving the term enum for all the terms in the index, and then intersecting the related posting list with the

Re: FieldCache?

2015-10-06 Thread Alessandro Benedetti
For completeness this is the related issue : https://issues.apache.org/jira/browse/SOLR-8096 Cheers 2015-10-06 11:21 GMT+01:00 Alessandro Benedetti : > We should make some precision here, > When dealing with faceting , there are currently 2 main approaches : > > 1)

Re: Pressed optimize and now SOLR is not indexing while optimize is going on

2015-10-06 Thread Toke Eskildsen
On Mon, 2015-10-05 at 17:26 -0400, Siddhartha Singh Sandhu wrote: > Following up on that: Would having an SSD make considerable difference in > speed? Yes, but only to a point. The UK Web Archive has done some tests on optimizing indexes on both spinning drives and SSDs:

Re:

2015-10-06 Thread Alessandro Benedetti
>From Jetty documentation : acceptQueueSizeThe size of the pending connection backlog. The exact interpretation is JVM and operating system specific and you can ignore it. Higher values allow more connections to wait pending an acceptor thread. Because the exact interpretation is deployment

indexing data to solrcloud with "implicit" is not distributing across cluster.

2015-10-06 Thread Steve
I’ve been unable to get solrcloud to distribute data across 4 solr nodes with the “route.name=implicit” feature of the collections API. The nodes are live, and the graphs are green. All the data (the “Films” example data) shows up on one node, the node that received the CREATE command. My

Solr 5.2.1 and spatial polygon searches

2015-10-06 Thread Lee Duhl
The following query runs fine on Solr 4.x, but errors with a "Couldn't parse shape " error message in Solr 5.2.1 geoloc:"INTERSECTS(POLYGON((-83.38434219360353 42.51412013568205,-83.3474349975586 42.51196902987156,-83.3561897277832 42.495390378152244, -83.4001350402832

Re: Solr 5.2.1 and spatial polygon searches

2015-10-06 Thread Alessandro Benedetti
Hi lee, shot in the dark, have you tried using the *WKT *syntax with range spatial approach*?* for example : q=geoloc:["0 18" TO "18 100”] . I am using it in 5.3 Cheers On 6 October 2015 at 14:22, Lee Duhl wrote: > The following query runs fine on Solr 4.x, but

Re: Zookeeper HA with 3x ZK with Solr server nodes

2015-10-06 Thread Alessandro Benedetti
When you have a ZK Ensemble a quorum of active nodes is necessary to have the entire Ensemble to work ( elect leaders, manage the cluster topology etc etc) . The quorum is 50% living nodes +1 . If you have an ensemble of 3 nodes, the quorum is 3/2 +1 = 2 nodes . With an ensemble of 3 nodes, you

Re: ??

2015-10-06 Thread Alessandro Benedetti
I would suggest you to write proper mail to this mailing list to get better answers ... Even the mail subject is a set of mystery ??? ... The first thing I could suggest is to take a look to the youtube presentation related : https://www.youtube.com/watch?v=8JADOLMazs4 Now I can not take a look

Re: Filter first-components result in solr.SearchHandler

2015-10-06 Thread aniljayanti
Hi Erik, thanks for your response, let me explain briefly. i wanted to make 5 employee id's as a priority id's. so every time when i am searching with specific keyword, then i want to append these 5 employee id's as first 5 results to the search results. example : let's take 3,5,6,8,9 are

RE: Solr 5.2.1 and spatial polygon searches

2015-10-06 Thread Lee Duhl
Alessandro Thanks for the reply. I'm not familiar with WKT syntax however the query sample you supplied below errors on both my 4.x and 5.2.1 servers with the following errors: "error": { "msg": "org.apache.solr.search.SyntaxError: Cannot parse 'geoloc:[\"0 18\" TO \"18 100”] ':

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Walter Underwood
This is at Chegg. One of our indexes is textbooks. These are expensive and don’t change very often. It is better to keep yesterday’s index than to drop a few important books. We have occasionally had an error that happens with every book, like a new field that is not in the Solr schema. If we

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Alessandro Benedetti
Hi Walter, can you explain better your use case ? You index a batch of e-commerce products ( Solr documents) if one fails, you want to stop and invalidate the entire batch ( using the almost never used solr rollback, or manual deletion ?) And then log the exception indexing size. To then re-index

Re: Best Indexing Approaches - To max the throughput

2015-10-06 Thread Walter Underwood
It depends on the document. In a e-commerce search, you might want to fail immediately and be notified. That is what we do, fail, rollback, and notify. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 6, 2015, at 7:58 AM, Alessandro Benedetti

Re: indexing data to solrcloud with "implicit" is not distributing across cluster.

2015-10-06 Thread Steve
Thanks Shawn, that fixed it ! The documentation int the Collections API says "The value can be ... *implicit*, which uses an internal default hash". I think most people would assume the "hash" would be used to route the data. Meanwhile the description of CompositID in the "Document Routing"