Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-24 Thread Alexandre Rafalovitch
Yes, swap will switch which core the name points to. For non Cloud setup. Just remember that your directory name does not get renamed, when you are deleting the old one. Just the core name in core.properties file. Regards, Alex On 24 Sep 2016 10:28 AM, "slee" wrote:

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-23 Thread slee
Erick / Alex, I want to thank you both. Your hints got me understand SOLR a bit better. I ended up with reversewildcard, and it speeds up performance a lot. That's what I'm expecting from SOLR... I also no longer experience the huge memory hog. The only down-side I can think of is, you need to

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-23 Thread Alexandre Rafalovitch
But if "SEF" and "OFF" are known to be searched for and especially if they are well-delimited, they could just be pulled-out into a separate field and just checked with an FQ. In the end, there may be no need for either EdgeNGram or wildcards. Just twisting the data during _indexing_ to represent

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread Erick Erickson
If you can break these up into tokens somehow, that's clearly best. But from the patterns you show it's not likely. WordDelimiterFactory won't quite work since it wouldn't be able to separate ASEF into the token SEF. You'll have a _lot_ fewer terms if you don't use edgengram. Try just using

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread Alexandre Rafalovitch
Not fully clear still, but perhaps you need several fields, at least one of which just contains your SEF and OFF values serving effectively as binary switches (FQ matches). And then maybe you strip the leading IDs that you are not matching on. Remember your Solr data shape does not need to match

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread slee
Alex, You do have a point with EdgeNGramFilterFactory. As requested, I've attached a sample screenshotfor your review. Erick, Here's my use-case. Assume I have the following term stored in global_Value as such: -

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread Erick Erickson
I totally missed EdgeNGram. Good catch Alex! Yeah, that's a killer. My shot in the dark here is that your analysis chain isn't the best choice to support your use-case and you're shooting yourself in the foot. So let's back up and talk about your use-case and maybe re-define your analysis chain

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread Alexandre Rafalovitch
Well, I am guessing this is the line that's causing the problem: Run your real sample for that field against your indexing definition in Admin UI and see how many tokens you end up with. You may have 50 tokens, but if each of them generates up to 47 representations.. Regards, Alex.

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread Erick Erickson
Wait: Are you really doing leading wildcard queries? If so, that's likely the root of the problem. Unless you add ReverseWildcardFilterFactory to your analysis chain, Lucene has to enumerate your entire set of terms to find likely candidates, which takes a lot of resources. What happens if you use

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-22 Thread slee
Here's what I have define in my schema: This is what I send in the query (2 values): q=global_Value:*mas+AND+global_Value:*sef=text=5=2.2=explicit=global_Value In addition, memory is taking way over 90%, given the heap

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-21 Thread Alexandre Rafalovitch
dering if multivalue fields cannot exceed certain terms? I only have 54 > to 60 terms. > > > Original Message > From: arafa...@gmail.com > Sent: September 21, 2016 7:40 PM > To: solr-user@lucene.apache.org > Reply-to: solr-user@lucene.apache.org > Subject: Re: Per

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-21 Thread Stan Lee
To: solr-user@lucene.apache.org Reply-to: solr-user@lucene.apache.org Subject: Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0] Do you _return_ the same set of fields in both queries? Is the difference truly just which field you search against? Regards,     Alex On 22 Sep 2016 3

Re: Performance Issue when querying Multivalued fields [SOLR 6.1.0]

2016-09-21 Thread Alexandre Rafalovitch
Do you _return_ the same set of fields in both queries? Is the difference truly just which field you search against? Regards, Alex On 22 Sep 2016 3:03 AM, "slee" wrote: > I've been doing a lot of reading on this forum with regards to performance > on > multivalued