Re: NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
e being OR'd together > and your search term is effectively > > fac OR ace > > Best, > Erick > > On Mon, Jul 2, 2018 at 8:01 AM, Kudrettin Güleryüz > wrote: > > Hi, > > > > When using NgramTokenizerFactory with settings min ngram size=3 and

Re: NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
ex > > On Mon, Jul 2, 2018, 11:01 AM Kudrettin Güleryüz, > wrote: > > > Hi, > > > > When using NgramTokenizerFactory with settings min ngram size=3 and max > > ngram size=3 I get the following behaviour. > > > > Assume that search term is, face > &

NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
Hi, When using NgramTokenizerFactory with settings min ngram size=3 and max ngram size=3 I get the following behaviour. Assume that search term is, face I expect the results to show documents with strings: * interface or * face or * faceted but not * ace or * fac Why would I get the matches

cloud disk space utilization

2018-08-27 Thread Kudrettin Güleryüz
Hi, We have six Solr nodes with ~1TiB disk space on each mounted as ext4. The indexers sometimes update the collections and create new ones if update wouldn't be faster than scratch indexing. (up to around 5 million documents are indexed for each collection) On average there are around 130

Re: list of collections

2018-07-17 Thread Kudrettin Güleryüz
at 12:12 PM, Kudrettin Güleryüz > wrote: > > > Hi, > > > > What is the suggested way to get list of collections from a solr Cloud > with > > a ZKhost? > > > > Thank you > > > > -- > > > This message and any attachment are confide

list of collections

2018-07-17 Thread Kudrettin Güleryüz
Hi, What is the suggested way to get list of collections from a solr Cloud with a ZKhost? Thank you

about precision preference in autoscaling API

2018-09-05 Thread Kudrettin Güleryüz
Hi, Can you please clarify "precision" preference definition? For instance, is freedisk "precision" is honored when comparing the max and the min freedisk of all nodes in the sortedNodes list? Or is it between any consecutive nodes in the sortedNodes list? I think it may be the latter but then

Re: node sort order and policy

2018-09-05 Thread Kudrettin Güleryüz
l come first and the last node will be the one with the > most available disk space. > > On Wed, Sep 5, 2018 at 3:01 AM Kudrettin Güleryüz > wrote: > > > Given exhibit A which shows the current sort order on a 7.3.1 cloud > > installation, why would the sort order b

node sort order and policy

2018-09-04 Thread Kudrettin Güleryüz
Given exhibit A which shows the current sort order on a 7.3.1 cloud installation, why would the sort order be from less freedisk to more freedisk? Shouldn't it be the reverse since the number of cores is the same for all nodes? preferences and policies are listed in exhibit B. Thank you,

Re: cloud disk space utilization

2018-08-29 Thread Kudrettin Güleryüz
"freedisk":347.91234970092773, "sysLoadAvg":86.0}, { "node":"test-44:8983_solr", "cores":137, "freedisk":341.1301383972168, "sysLoadAvg":160.0}, { "node":&q

Re: NgramTokenizerFactory question

2018-07-05 Thread Kudrettin Güleryüz
r maximum possible string. > > So with index-only min=3 and max=4, you will get: > 123 > 1234 > 234 > 2345 > 345 > 3456 > 456 > > Then 2349, not being ngrammed will not match anything, but 2345 will. > > Again, Admin UI will show that to you. > > Regards,

matches missing highlight information

2018-09-27 Thread Kudrettin Güleryüz
Hi, For some queries, response object returns matches without any highlight information. Solr node doesn't report any errors in Solr log. query term is g12312 number of matches is 7 only 4 of them gets highlight snippets. Any suggestions? "highlighting":{ ".../sources/test.cpp":{

Re: matches missing highlight information

2018-10-01 Thread Kudrettin Güleryüz
t only return 4 of them? > Or is it that there are 4 documents that matches the query term, but there > are 7 occurrences of the query term in these 4 documents? > > Regards, > Edwin > > On Fri, 28 Sep 2018 at 22:47, Kudrettin Güleryüz > wrote: > > > Hi Ed

Re: cloud disk space utilization

2018-08-30 Thread Kudrettin Güleryüz
uster policy also exists. We hope to > fix it in the next release (Solr 7.5). See > https://issues.apache.org/jira/browse/SOLR-12648 > > You may also be interested in > https://issues.apache.org/jira/browse/SOLR-12592 > > > On Tue, Aug 28, 2018 at 2:47 AM Kudrettin Güleryüz

Re: matches missing highlight information

2018-09-28 Thread Kudrettin Güleryüz
ret, > > What is your configuration for your /highlight requestHandler in > solrconfig.xml? > And also the query that you used when you get your above output? > > Regards, > Edwin > > On Fri, 28 Sep 2018 at 07:33, Kudrettin Güleryüz > wrote: > > > Hi, > &g

Re: Edismax leading wildcard search

2018-12-07 Thread Kudrettin Güleryüz
Hi, I am also wondering how to disable leading wildcards in Solr. Can you please suggest how to disable leading wildcards in Solr? I know in Lucene it is a flag that's set to false by default. > Do it on the client side. Just don't allow leading asterisks or question marks in your query term.

one node too busy

2018-11-27 Thread Kudrettin Güleryüz
Hi, How can I debug what is causing occasional hiccup of our Solr cloud instance? When this issue happens, I can see that one of the nodes is too busy and the others are just doing fine. We use 6 nodes, 6 shards (1 shard per node), 1 replica for each collection. Can you please suggest tools to

different numFound value /select vs. /export

2019-06-28 Thread Kudrettin Güleryüz
Hi, I'd like to give my website users ability to export a field for the full search result set. Specifying a very large pageSize seems to perform very poorly for this purpose. Therefore, considering using export requestHandler for exporting search results. When I play with a core, I noticed that

Re: different numFound value /select vs. /export

2019-06-28 Thread Kudrettin Güleryüz
lr-b49111a417e3 > which has them wrongly formatted > > On Fri, 28 Jun 2019 at 18:00, Kudrettin Güleryüz > wrote: > > > Thank you for responding. > > > > I didn't go though the parsers involved, I assume they'd be the defaults. > > > > I did notice

Re: different numFound value /select vs. /export

2019-06-28 Thread Kudrettin Güleryüz
olrCloud I’d expect major differences, which you haven’t told us about, > off by 5? 10,000?. > > Third, there was a bug at one point where export would leave off the last > packet IIRC, what version of Solr are you using? > > Best, > Erick > > > On Jun 28, 2019, at 7:11 A

Multiple versions of Solr nodes in the same cloud

2020-04-06 Thread Kudrettin Güleryüz
Hi, Can I run let's say, Solr 8.5.0 together with Solr 7.3.1. nodes while replacing older nodes with 8.5.0 one by one? We have a cloud with rather large disk space utilization. What is the suggested way to handle this kind of situation? Building indexes takes around 3 days to complete.

Re: All shards placed on the same node

2020-04-06 Thread Kudrettin Güleryüz
; Please change the precision in cluster preference for core to 1 instead of > 10 and then give a try. > > With current settings, 2 nodes are not treated different till they have a > difference of 10 cores. > > Thanks, > > > On Mon, Apr 6, 2020, 2:09 AM Kudrettin Güleryüz > wrote:

All shards placed on the same node

2020-04-05 Thread Kudrettin Güleryüz
Hi, Running 7.3.1 on an 8 node Solr cloud. Why would solr create all 6 shards on the same node? I don't want to restrict Solr to create up to x number of shards per node but creating all shards on the same node doesn't look right to me. Will Solr use all space on one node before using another

Autoscaling question

2020-03-26 Thread Kudrettin Güleryüz
Hi, I'd like to balance freedisk and cores across eight nodes. Here is my cluster-preferences and cluster-policy: { "responseHeader":{ "status":0, "QTime":0}, "cluster-preferences":[{ "precision":10, "maximize":"freedisk"} ,{ "minimize":"cores",