Re: Solr Json Facet

2018-05-08 Thread Shawn Heisey
On 5/8/2018 11:36 AM, Kojo wrote: > If I tag the fq query and I query for a simple word it works fine too. But > if query a multi word with space in the middle it breaks: > > {'q':'*:*', 'fl': '*', > 'fq':'{!tag=city_colaboration_tag}city_colaboration:"College > Station"', 'json.facet':

Re: Solr Json Facet

2018-05-08 Thread Mikhail Khludnev
Single backslash escaping works for me. On Tue, May 8, 2018 at 8:36 PM, Kojo wrote: > Hello, > recently I have changed the way I get facet data from Solr. I was using GET > method on request but due to the limit of the query I changed to POST > method. > > Bellow is a sample

Rule based replica placement solr cloud 6.2.1

2018-05-08 Thread Natarajan, Rajeswari
Hi, Would like to have below rule set up in solr cloud 6.2.1. Not sure how to model this with default snitch. Any suggestions? Don’t assign more than 1 replica of this collection to a host Regards, Rajeswari

Re: Solr Json Facet

2018-05-08 Thread Yonik Seeley
On Tue, May 8, 2018 at 1:36 PM, Kojo wrote: > If I tag the fq query and I query for a simple word it works fine too. But > if query a multi word with space in the middle it breaks: Most likely the full query is not getting to Solr because of an HTTP protocol error (i.e. the

Re: Must clause with filter queries

2018-05-08 Thread Shawn Heisey
On 5/8/2018 9:58 AM, root23 wrote: > In case of frange query how do we specify the Must clause ? Looking at how frange works, I'm pretty sure that all queries with frange are going to be effectively single-clause.  So you don't need to specify MUST -- it's implied. > the reason we are using

managed resources and SolrJ

2018-05-08 Thread Hendrik Haddorp
Hi, we are looking into using manged resources for synonyms via the ManagedSynonymGraphFilterFactory. It seems like there is no SolrJ API for that. I would be especially interested in one via the CloudSolrClient. I found

Re: Solr Json Facet

2018-05-08 Thread Asher Shih
unsubscribe On Tue, May 8, 2018 at 9:19 PM, Kojo wrote: > Everything working now. The code is not that clean and I am rewriting, so I > don't know exactly what was wrong, but something malformed. > > I would like to ask another question regarding json facet. > > With GET

LTR performance issues

2018-05-08 Thread ilayaraja
LTR with grouping results in very high latency (3x) even while re-ranking 24 top groups. How is re-ranking implemented in Solr? Is it expected that it would result in 3x more query time. Need clarifications on: 1. How many top groups are actually re-ranked, is it exactly what we pass in

How to do multi-threading indexing on huge volume of JSON files?

2018-05-08 Thread Raymond Xie
I have a huge amount of JSON files to be indexed in Solr, it costs me 22 minutes to index 300,000 JSON files which were generated from 1 single bz2 file, this is only 0.25% of the total amount of data from the same business flow, there are 100+ business flow to be index'ed. I absolutely need a

Re: Solr Json Facet

2018-05-08 Thread Kojo
Thank you all. I tried escaping but still not working Yonik, I am using Python Requests. It works if my fq is a single word, even if I use double quotes on this single word without escaping. This is the HTTP response: response.content '\n\n400 Bad Request\n\nBad Request\nYour browser sent a

Re: Solr Json Facet

2018-05-08 Thread Kojo
Everything working now. The code is not that clean and I am rewriting, so I don't know exactly what was wrong, but something malformed. I would like to ask another question regarding json facet. With GET method, i was used to use many fq on the same query, each one with it's own tag. It was

Re: Solr Json Facet

2018-05-08 Thread Yonik Seeley
Looks like some sort of proxy server inbetween the python client and solr server. I would still check first if the output from the python client is correctly escaped/encoded HTTP. One easy way is to use netcat to pretend to be a server: $ nc -l 8983 And then send point the python client at that

How to do indexing on remote location

2018-05-08 Thread Raymond Xie
Please take this as no joking! Any suggestion is welcome and appreciated. I have data on remote WORM drive on a cluster that include 3 hosts, each host contains same copy of data. I have Solr server on a different host and need to do the indexing on the WORM drive. It is said the indexing can

Re: Solr Json Facet

2018-05-08 Thread Erick Erickson
Follow the instructions here: http://lucene.apache.org/solr/community.html#mailing-lists-irc. You must use the _exact_ same e-mail as you used to subscribe. If the initial try doesn't work and following the suggestions at the "problems" link doesn't work for you, let us know. But note you need to

Re: How to do multi-threading indexing on huge volume of JSON files?

2018-05-08 Thread Erick Erickson
I'd seriously consider a SolrJ program rather than posting, posting files is really intended to be a simple way to get started, when it comes to indexing large volumes it's not very efficient. As a comparison, I index 3-4K docs/second (Wikipedia dump) on my macbook pro. Note that if each of your

Re: Must clause with filter queries

2018-05-08 Thread Shawn Heisey
On 5/7/2018 9:51 AM, manuj singh wrote: > I am kind of confused how must clause(+) behaves with the filter queries. > e.g i have below query: > q=*:*=+{!frange cost=200 l=NOW-179DAYS u=NOW/DAY+1DAY incl=true > incu=false}date > > So i am filtering documents which are less then 179 old days. > So

Re: Async exceptions during distributed update

2018-05-08 Thread Emir Arnautović
Hi Jay, This is low ingestion rate. What is the size of your index? What is heap size? I am guessing that this is not a huge index, so I am leaning toward what Shawn mentioned - some combination of DBQ/merge/commit/optimise that is blocking indexing. Though, it is strange that it is happening

Re: Howto disable PrintGCTimeStamps in Solr

2018-05-08 Thread Shawn Heisey
On 5/7/2018 8:22 AM, Bernd Fehling wrote: > thanks for asking, I figured it out this morning. > If setting -Xloggc= the option -XX:+PrintGCTimeStamps will be set > as default and can't be disabled. It's inside JAVA. > > Currently using Solr 6.4.2 with > Java HotSpot(TM) 64-Bit Server VM

Re: Determine Solr Core Creation Timestamp

2018-05-08 Thread Shawn Heisey
On 5/7/2018 3:50 PM, Atita Arora wrote: > I noticed the same and hence overruled the idea to use it. > Further , while exploring the V2 api (as we're currently in Solr 6.6 and > will soon be on Solr 7.X) ,I came across the shards API which has > "property.index.version": "1525453818563" > > Which

Filter Must/Must not clauses and parenthesis

2018-05-08 Thread Alfonso Noriega
Hi everyone, I found solr 5.5.4 is doing some unexpected behavior (at least unexpected for me) when using Must and Must not operator and parenthesis for filtering and it would be great if someone can confirm if this is unexpected or not and why. To clarify I will write an example: The following

Re: Determine Solr Core Creation Timestamp

2018-05-08 Thread Atita Arora
Thank you Shawn for looking into this to such a depth. Let me try getting hold of someway to grab this information and use it and I may reach back to you or list for further thoughts. Thanks again, Atita On Tue, May 8, 2018, 3:11 PM Shawn Heisey wrote: > On 5/7/2018 3:50

Re: Solr Slave failed to initialize collection

2018-05-08 Thread Aji Viswanadhan
Hi Shawn , Thanks for the info!! As I mentioned master index was fine, only for one of the collection in salve index was corrupted. Yes, we fixed the issue by removing corrupted index and replicated again. The error message shared we have received from Admin UI of Solr. Replication strategy

Re: Howto disable PrintGCTimeStamps in Solr

2018-05-08 Thread Bernd Fehling
Hi Shawn, the goal is that some GCviewer get confused if both DateStamps and TimeStamps are present in solr_gc.log file. And _not_ to reduce the GC log size, that would be stupid. Now I have a Perl-Script which will remove the TimeStamps (and only leaf the DateStamps) for Analysis of solr_gc.log

Re:LTR performance issues

2018-05-08 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
Hello ilayaraja, I think it would be good to move this discussion on the Jira item: https://issues.apache.org/jira/browse/SOLR-8776?attachmentOrder=asc You can add your comments there, and also in the page I explained how it works. On the performance you are right: at the moment it is slow.

Re: Filter Must/Must not clauses and parenthesis

2018-05-08 Thread Erick Erickson
Just skimmed, but perhaps related to : https://issues.apache.org/jira/browse/SOLR-12212? Best, Erick On Tue, May 8, 2018 at 3:02 AM, Alfonso Noriega wrote: > Hi everyone, > I found solr 5.5.4 is doing some unexpected behavior (at least unexpected > for me) when

Re: Solr Slave failed to initialize collection

2018-05-08 Thread Shawn Heisey
On 5/8/2018 4:32 AM, Aji Viswanadhan wrote: Is this issue happened due to the size of the index? or any recommendations to not happen in future. Please let me know. I have no idea why it happened.  Running out of disk space could cause any number of problems.  Program operation becomes

Re: Async exceptions during distributed update

2018-05-08 Thread Jay Potharaju
Hi Emir, I was seeing this error as long as the indexing was running. Once I stopped the indexing the errors also stopped. Yes, we do monitor both hosts & solr but have not seen anything out of the ordinary except for a small network blip. In my experience solr generally recovers after a network

Re: Filter Must/Must not clauses and parenthesis

2018-05-08 Thread Shawn Heisey
On 5/8/2018 4:02 AM, Alfonso Noriega wrote: I found solr 5.5.4 is doing some unexpected behavior (at least unexpected for me) when using Must and Must not operator and parenthesis for filtering and it would be great if someone can confirm if this is unexpected or not and why. Do you have

Re: Filter Must/Must not clauses and parenthesis

2018-05-08 Thread Alfonso Noriega
Thanks Shawn! I was not thinking of it as a subtraction but it makes all the sense put like that. On 8 May 2018 at 17:55, Shawn Heisey wrote: > On 5/8/2018 4:02 AM, Alfonso Noriega wrote: > >> I found solr 5.5.4 is doing some unexpected behavior (at least >> unexpected

Re: Must clause with filter queries

2018-05-08 Thread root23
Hi Shawn, Thanks for the repsonse. We have multiple clauses. I was just giving an bare bone example. Usually all our queries will have more then one clause. In case of frange query how do we specify the Must clause ? the reason we are using frange instead of the normal syntax is that we need to

Solr Json Facet

2018-05-08 Thread Kojo
Hello, recently I have changed the way I get facet data from Solr. I was using GET method on request but due to the limit of the query I changed to POST method. Bellow is a sample of the data I send to Solr, in order to get facets. But there is something here that I don´t understand. If I do not