Re: Solr 8.5.1 cannot add replica using solr admin UI

2020-06-24 Thread ChienHuaWang
Use Solr 8.5.1, no issue to add replica/collection from my side.
What's the step you use? and for sure logs would be helpful



--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Constant score and stopwords strange behaviour

2020-06-24 Thread dbourassa
Hi,

I'm working on a Solr core where we don't want to use TF-IDF (BM25).
We rank documents with boost based on popularity, exact match, phrase match,
etc.

To bypass TF-IDF, we use constant score like this "q=harry^=0.5 potter^=0.5"
(score is always 1 before boost)
We have just noticed a strange behaviour with this method.
With "q=a cat", the stopword 'a' is automatically removed by the query
analyzer.
But with "q=a^0.5 cat^0.5", the stopword 'a' is not removed. 

We also tried something like "q=(a AND cat)^=1" but the problem still.

Someone have an idea or a better solution to bypass TF-IDF ?

relevant info in solrconfig :
...
edismax
590%
true
...

relevant info in schema :

...

...


Thanks



--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Developing update processor/Query Parser

2020-06-24 Thread Mikhail Khludnev
Hello, Vincenzo.
Presumably you can introduce a component which just holds a config data,
and then this component might be lookedup from QParser and UpdateFactory.
Overall, it seems like embedding logic into Solr core, which rarely works
well.

On Wed, Jun 24, 2020 at 8:00 PM Vincenzo D'Amore  wrote:

> Hi all,
>
> I've started to work on a couple of components very tight together.
> An update processor that writes few fields in the solr index and a Query
> Parser that, well, then reads such fields from the index.
>
> Such components share few configuration parameters together, I'm asking if
> there is a pattern, a draft, a sample, some guidelines or best practices
> that explains how to properly save configuration parameters.
>
> The configuration is written into the solrconfig.xml file, for example:
>
>
>  
>x1
>x2
>  
>
>
> And then query parser :
>
>  class="com.example.query.MyCustomQueryParserPlugin" />
>
> I'm struggling because the change of configuration on the updated processor
> has an impact on the query parser.
> For example the configuration info shared between those two components can
> be overwritten during a core reload.
> Basically, during an update or a core reload, there is a query parser that
> is serving requests while some other component is updating the index.
> So I suppose there should be a pattern, an approach, a common solution when
> a piece of configuration has to be loaded at boot, or when the core is
> loaded.
> Or when, after an update a new searcher is created and a new query parser
> is created.
>
> Any suggestion is really appreciated.
>
> Best regards,
> Vincenzo
>
>
>
> --
> Vincenzo D'Amore
>


-- 
Sincerely yours
Mikhail Khludnev


Solr 8.5.2: DataImportHandler failed to instantiate org.apache.solr.request.SolrRequestHandler

2020-06-24 Thread Peter van de Kerk
I'm migrating from solr v4.3.1 to v8.5.2 and using this guide: 
https://lucene.apache.org/solr/guide/8_5/installing-solr.html
I can't get data-import handler to work. I wanted to create a new core and then 
copy my old data-config fields into it to get me a quick start.

1. I ran command `solr create -c mytest` to create a new core.
2. I started solr: `C:\solr-8.5.2\bin\solr start -p 8983`
3. I look at http://localhost:8983/solr/#/, where I see my "mytest" core

On disk I see a `managed-schema` file, which I think matches Solr 4.3.1's 
`schema.xml`
I moved my data-config.xml to the new location 
C:\solr-8.5.2\server\solr\mytest\conf

In my old solrconfig.xml I had this code referring to the above data-config:

   

data-config.xml

  

This code was missing from the  
C:\solr-8.5.2\server\solr\mytest\conf\solrconfig.xml so I added it.
Then when I try to run 
`http://localhost:8983/solr/mytest/dataimport?command=full-import` I get the 
error:

> Error loading class 'org.apache.solr.handler.dataimport.DataImportHandler'

So I copied files from C:\solr-8.5.2\dist to C:\solr-8.5.2\server\lib

But then I get error

> org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: 
> Error Instantiating requestHandler, 
> org.apache.solr.handler.dataimport.DataImportHandler failed to instantiate 
> org.apache.solr.request.SolrRequestHandler

What can I do?


Re: solr fq with contains not returning any results

2020-06-24 Thread yaswanth kumar
Thanks Erick,

I have now added =query and found a diff between old solr and new solr

new solr (8.2) which is not giving results is as follows

"debug":{
"rawquerystring":"*:*",
"querystring":"*:*",
"parsedquery":"MatchAllDocsQuery(*:*)",
"parsedquery_toString":"*:*",
"explain":{},
"QParser":"LuceneQParser",
"filter_queries":["auto_nsallschools:*bostonschool*"],
"parsed_filter_queries":["auto_nsallschools:_star_bostonschool_star_"],

Where as solr 5.5 which is getting me the results is as follows

"debug":{
"rawquerystring":"*:*",
"querystring":"*:*",
"parsedquery":"MatchAllDocsQuery(*:*)",
"parsedquery_toString":"*:*",
"explain":{},
"QParser":"LuceneQParser",
"filter_queries":["auto_nsallschools:*bostonschool*"],
"parsed_filter_queries":["auto_nsallschools:*bostonschool*"],

I know in schema there are analyzer against this field but not getting on
why its making differences here.

Thanks,

On Wed, Jun 24, 2020 at 9:24 AM Erick Erickson 
wrote:

> You need to do several things to track down why.
>
> First, use something (admin UI, terms query, etc) to see
> exactly what’s in your index. The admin/analysis screen is useful here.
>
> Second, aldd =query to the query on both machines and
> see what the actual parsed query looks like.
>
> Comparing those should give you a clue.
>
> Best,
> Erick
>
> > On Jun 24, 2020, at 9:20 AM, yaswanth kumar 
> wrote:
> >
> > "nsallschools":["BostonSchool"]
> >
> > That's how the data is stored against the field.
> >
> > We have a functionality where we can do "Starts with, Contains, Ends
> with";
> > Also if you look at the above schema we are using
> >
> >  > synonyms="punctuation-whitelist.txt" ignoreCase="true" expand="false"/>
> >
> >
> >
> >
> > Also the strange part is that its working fine in Solr 5.5 but not in
> Solr
> > 8.2 any thoughts??
> >
> > Thanks,
> >
> > On Wed, Jun 24, 2020 at 3:15 AM Jörn Franke 
> wrote:
> >
> >> I don’t know your data, but could it be that you tokenize differently ?
> >>
> >> Why do you do the wildcard search at all? Maybe a different tokenizing
> >> strategy can bring you more effieciently results? Depends on what you
> need
> >> to achieve of course ...
> >>
> >>> Am 24.06.2020 um 05:37 schrieb yaswanth kumar :
> >>>
> >>> I am using solr 8.2
> >>>
> >>> And when trying to do fq=auto_nsallschools:*bostonschool*, the data is
> >> not
> >>> being returned. But if I do the same in solr 5.5 (which I already have
> >> and
> >>> we are in process of migrating to 8.2 ) its returning results.
> >>>
> >>> if I do fq=auto_nsallschools:bostonschool
> >>> or
> >>> fq=auto_nsallschools:bostonschool* its returning results but when I try
> >>> with contains like described above or
> fq=auto_nsallschools:*bostonschool
> >>> (ends with) it's not returning any results.
> >>>
> >>> The field which we are already using is a copy field and multi valued,
> >> am I
> >>> doing something wrong? or does 8.2 need some adjustment in the configs?
> >>>
> >>> Here is the schema
> >>>
> >>>  >> stored="true"
> >>> multiValued="true"/>
> >>>  indexed="true"
> >>> stored="false" multiValued="true"/>
> >>> 
> >>>
> >>>
> >>>  >>> positionIncrementGap="100">
> >>> 
> >>>   
> >>>   
> >>>   
> >>> 
> >>>   
> >>>
> >>>
> >>>  >>> positionIncrementGap="100">
> >>> 
> >>>>>> pattern="(\)" replacement="_and_" />
> >>>>>> pattern="(\$)" replacement="_dollar_" />
> >>>>>> pattern="(\*)" replacement="_star_" />
> >>>>>> pattern="(\+)" replacement="_plus_" />
> >>>>>> pattern="(\-)" replacement="_minus_" />
> >>>>>> pattern="(\#)" replacement="_sharp_" />
> >>>>>> pattern="(\%)" replacement="_percent_" />
> >>>>>> pattern="(\=)" replacement="_equal_" />
> >>>>>> pattern="(\)" replacement="_lessthan_" />
> >>>>>> pattern="(\)" replacement="_greaterthan_" />
> >>>>>> pattern="(\€)" replacement="_euro_" />
> >>>>>> pattern="(\¢)" replacement="_cent_" />
> >>>>>> pattern="(\£)" replacement="_pound_" />
> >>>>>> pattern="(\¥)" replacement="_yuan_" />
> >>>>>> pattern="(\©)" replacement="_copyright_" />
> >>>>>> pattern="(\®)" replacement="_registered_" />
> >>>>>> pattern="(\|)" replacement="_pipe_" />
> >>>>>> pattern="(\^)" replacement="_caret_" />
> >>>>>> pattern="(\~)" replacement="_tilt_" />
> >>>>>> pattern="(\™)" replacement="_treadmark_" />
> >>>>>> pattern="(\@)" replacement="_at_" />
> >>>>>> pattern="(\)" replacement=" _doublequote_ " />
> >>>>>> pattern="(\()" replacement=" _leftparentheses_ " />
> >>>>>> pattern="(\))" replacement=" _rightparentheses_ " />
> >>>>>> pattern="(\{)" replacement="_leftcurlybracket_" />
> >>>>>> pattern="(\})" replacement="_rightcurlybracket_" />
> >>>>>> pattern="(\[)" replacement="_leftsquarebracket_" />
> 

SOLR CDCR fails with JWT authorization configuration

2020-06-24 Thread Phatkar, Swapnil (Contractor)
Hi Team ,

I am trying to configure CDCR for SOLR 8.4.1 .
With the provided configuration I can able to replicate the indexes from Source 
server to Target server. This setup even works with SSL configuration using 
Https protocol.
But the moment I have introduced JWT authorization by enforcing security.json 
on both the server. I got an error at Target server side as shown below.
Due to which the index were not getting replicated at target server.

ERROR :

0200623 12:29:55.956 [ERROR] {qtp892083096-82} [   ] 
[org.apache.solr.security.PKIAuthenticationPlugin, 119] |
Could not decipher a header :8983_solr
$$$. No principal set

 Caused by: java.util.concurrent.ExecutionException: 
org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
Error from server at https://:8983/solr/express_shard1_replica_n3: 
Expected mime type application/octet-stream but got text/html. 


Error 401 Require authentication

HTTP ERROR 401
Problem accessing /solr/express_shard1_replica_n3/cdcr. Reason:
Require authentication




Caused by: 
org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error 
from server at
https://:8983/solr/express_shard1_replica_n3: Expected mime type 
application/octet-stream but got text/html. 


Error 401 Require authentication

HTTP ERROR 401
Problem accessing /solr/express_shard1_replica_n3/cdcr. Reason:
Require authentication



at 
org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:629)
at 
org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:265)
at 
org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:248)
at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1290)
at 
org.apache.solr.handler.CdcrRequestHandler$SliceCheckpointCallable.call(CdcrRequestHandler.java:868)
at 
org.apache.solr.handler.CdcrRequestHandler$SliceCheckpointCallable.call(CdcrRequestHandler.java:845)


Thanks and Regards,
Swapnil Phatkar
9167320216



Developing update processor/Query Parser

2020-06-24 Thread Vincenzo D'Amore
Hi all,

I've started to work on a couple of components very tight together.
An update processor that writes few fields in the solr index and a Query
Parser that, well, then reads such fields from the index.

Such components share few configuration parameters together, I'm asking if
there is a pattern, a draft, a sample, some guidelines or best practices
that explains how to properly save configuration parameters.

The configuration is written into the solrconfig.xml file, for example:

   
 
   x1
   x2
 
   

And then query parser :



I'm struggling because the change of configuration on the updated processor
has an impact on the query parser.
For example the configuration info shared between those two components can
be overwritten during a core reload.
Basically, during an update or a core reload, there is a query parser that
is serving requests while some other component is updating the index.
So I suppose there should be a pattern, an approach, a common solution when
a piece of configuration has to be loaded at boot, or when the core is
loaded.
Or when, after an update a new searcher is created and a new query parser
is created.

Any suggestion is really appreciated.

Best regards,
Vincenzo



-- 
Vincenzo D'Amore


Re: Solr 8.5.1 cannot add replica using solr admin UI

2020-06-24 Thread Colvin Cowie
Have you checked the logs to see what is happening? What error messages are
you getting?

On Wed, 24 Jun 2020 at 15:23, Shreyas Kothiya <
shreyas.koth...@archivesocial.com> wrote:

> Hello
>
> I was trying to add replica using solr admin UI. but it seems not working
> on solr  8.5.1 .
>
> I was not able to find any bug around this. is this a known bug or do I
> need to change any-settings to enable this,
>
> Thanks


Re: Solr caches per node or per core

2020-06-24 Thread Odysci
Thanks!

Reinaldo

On Wed, Jun 24, 2020 at 11:47 AM Emir Arnautović <
emir.arnauto...@sematext.com> wrote:

> Hi Reinaldo,
> It is per core. Single node can have cores from different collections,
> each configured differently. When you size caches from memory consumption
> point of view, you have to take into account how many cores will be placed
> on each node. Of course, you have to count replicas as well.
>
> HTH,
> Emir
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>
>
>
> > On 24 Jun 2020, at 16:38, Odysci  wrote:
> >
> > Hi,
> >
> > I have a Solrcloud configuration with 2 nodes and 2 shards/2 replicas.
> > I configure the sizes of the solr caches on solrconfig.xml, which I
> > believe apply to nodes.
> >
> > But when I look at the caches in the Solr UI, they are shown per core
> > (e.g., shard1_replica_N1). Are the cache sizes defined in the
> > solrconfig.xml the total size (adding up the caches for all cores in the
> > node)? or are the cache sizes defined in the solrconfig.xm applied to
> each
> > core separately?
> > Thanks
> >
> > Reinaldo
>
>


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Mike Drob
Brend,

I appreciate that you are trying to examine this issue from multiple sides
and consider future implications, but I don’t think that is a stirring
argument. By analogy, if we are out of eggs and my wife asks me to go to
the store to get some, refusing to do so on the basis that she might call
me while I’m there and also ask me to get milk would not be reasonable.

What will come next may be an interesting question philosophically, but we
are not discussing abstract concepts here. There is a concrete issue
identified, and we’re soliciting input in how best to address it.

Thank you for the suggestion of "guide/follower"

Mike

On Wed, Jun 24, 2020 at 6:30 AM Bernd Fehling <
bernd.fehl...@uni-bielefeld.de> wrote:

> I'm following this thread now for a while and I can understand
> the wish to change some naming/wording/speech in one or the other
> programs but I always get back to the one question:
> "Is it the weapon which kills people or the hand controlled by
> the mind which fires the weapon?"
>
> The thread started with slave - slavery, then turned over to master
> and followed by leader (for me as a german... you know).
> What will come next?
>
> And more over, we now discuss about changes in the source code and
> due to this there need to be changes to the documentation.
> What about the books people wrote about this programs and source code,
> should we force this authors to rewrite their books?
> May be we should file a request to all web search engines to reject
> all stored content about these "banned" words?
> And contact all web hosters about providing bad content.
>
> To sum things up, within my 40 years of computer science and writing
> programs I have never had a nanosecond any thoughts about words
> like master, slave, leader, ... other than thinking about computers
> and programming.
>
> Just my 2 cents.
>
> For what it is worth, I tend to guide/follower if there "must be" any
> changes.
>
> Bernd
>


Re: Solr caches per node or per core

2020-06-24 Thread Emir Arnautović
Hi Reinaldo,
It is per core. Single node can have cores from different collections, each 
configured differently. When you size caches from memory consumption point of 
view, you have to take into account how many cores will be placed on each node. 
Of course, you have to count replicas as well.

HTH,
Emir
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/



> On 24 Jun 2020, at 16:38, Odysci  wrote:
> 
> Hi,
> 
> I have a Solrcloud configuration with 2 nodes and 2 shards/2 replicas.
> I configure the sizes of the solr caches on solrconfig.xml, which I
> believe apply to nodes.
> 
> But when I look at the caches in the Solr UI, they are shown per core
> (e.g., shard1_replica_N1). Are the cache sizes defined in the
> solrconfig.xml the total size (adding up the caches for all cores in the
> node)? or are the cache sizes defined in the solrconfig.xm applied to each
> core separately?
> Thanks
> 
> Reinaldo



Re: CDCR stress-test issues

2020-06-24 Thread matthew sporleder
On Wed, Jun 24, 2020 at 9:46 AM Oakley, Craig (NIH/NLM/NCBI) [C]
 wrote:
>
> In attempting to stress-test CDCR (running Solr 7.4), I am running into a 
> couple of issues.
>
> One is that the tlog files keep accumulating for some nodes in the CDCR 
> system, particularly for the non-Leader nodes in the Source SolrCloud. No 
> quantity of hard commits seem to cause any of these tlog files to be 
> released. This can become a problem upon reboot if there are hundreds of 
> thousands of tlog files, and Solr fails to start (complaining that there are 
> too many open files).
>
> The tlogs had been accumulating on all the nodes of the CDCR set of 
> SolrClouds until I added these two lines to the solrconfig.xml file (for 
> testing purposes, using numbers much lower than in the examples):
> 5
> 2
> Since then, it is mostly the non-Leader nodes of the Source SolrCloud which 
> accumulates tlog files (the Target SolrCloud does seem to have a tendency to 
> clean up the tlog files, as does the Leader of the Source SolrCloud). If I 
> use ADDREPLICAPROP and REBALANCELEADERS to change which node is the Leader, 
> and if I then start adding more data, the tlogs on the new Leader sometimes 
> will go away, but then the old Leader begins accumulating tlog files. I am 
> dubious whether frequent reassignment of Leadership would be a practical 
> solution.
>
> I also have several times attempted to simulate a production environment by 
> running several loops simultaneously, each of which inserts multiple records 
> on each iteration of the loop. Several times, I end up with a dozen records 
> on (both replicas of) the Source which never make it to (either replica of) 
> the Target. The Target has thousands of records which were inserted before 
> the missing records, and thousands of records which were inserted after the 
> missing records (and all these records, the replicated and the missing, were 
> inserted by curl commands which only differed in sequential numbers 
> incorporated into the values being inserted).
>
> I also have a question regarding SOLR-13141: the 11/Feb/19 comment says that 
> the fix for Solr 7.3 had a problem; and the header says "Affects Version/s: 
> 7.5, 7.6": does that indicate that Solr 7.4 is not affected?
>
> Are  there any suggestions?
>
> Thanks

Just going to "me too" where i've had (non cdcr) installs accumulate
tlogs until eventual rebuilds or crashes.


Solr caches per node or per core

2020-06-24 Thread Odysci
Hi,

I have a Solrcloud configuration with 2 nodes and 2 shards/2 replicas.
I configure the sizes of the solr caches on solrconfig.xml, which I
believe apply to nodes.

But when I look at the caches in the Solr UI, they are shown per core
(e.g., shard1_replica_N1). Are the cache sizes defined in the
solrconfig.xml the total size (adding up the caches for all cores in the
node)? or are the cache sizes defined in the solrconfig.xm applied to each
core separately?
Thanks

Reinaldo


Solr 8.5.1 cannot add replica using solr admin UI

2020-06-24 Thread Shreyas Kothiya
Hello

I was trying to add replica using solr admin UI. but it seems not working on 
solr  8.5.1 . 

I was not able to find any bug around this. is this a known bug or do I need to 
change any-settings to enable this,

Thanks

RE: Restored collection cluster status rendering some values as Long (as opposed to String for other collections)

2020-06-24 Thread Aliaksandr Asiptsou
Sorry I forgot to mention: we use Solr 8.3.1

Best regards,
Aliaksandr Asiptsou
From: Aliaksandr Asiptsou
Sent: Wednesday, June 24, 2020 12:44 AM
To: solr-user@lucene.apache.org
Subject: Restored collection cluster status rendering some values as Long (as 
opposed to String for other collections)

Hello Solr experts,

Our team noticed the below behavior:

1. A collection is restored from a backup, and a replication factor is 
specified within the restore command:

/solr/admin/collections?action=RESTORE=backup_name=/backups/solr=collection_name=config_name=1=1

2. Collection restored successfully, but looking into cluster status we see 
several values are rendered as Long for this particular collection:

/solr/admin/collections?action=clusterstatus=xml

0
1
1
false
1
0
138

Whereas for all the other collections pullReplicas, replicationFactor, 
nrtReplicas and tlogReplicas are Strings.

Please advise whether it is known and expected or it needs to be fixed (if so, 
is there a Jira ticket already for this or should we create one)?

Best regards,
Aliaksandr Asiptsou


CDCR stress-test issues

2020-06-24 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
In attempting to stress-test CDCR (running Solr 7.4), I am running into a 
couple of issues.

One is that the tlog files keep accumulating for some nodes in the CDCR system, 
particularly for the non-Leader nodes in the Source SolrCloud. No quantity of 
hard commits seem to cause any of these tlog files to be released. This can 
become a problem upon reboot if there are hundreds of thousands of tlog files, 
and Solr fails to start (complaining that there are too many open files).

The tlogs had been accumulating on all the nodes of the CDCR set of SolrClouds 
until I added these two lines to the solrconfig.xml file (for testing purposes, 
using numbers much lower than in the examples):
5
2
Since then, it is mostly the non-Leader nodes of the Source SolrCloud which 
accumulates tlog files (the Target SolrCloud does seem to have a tendency to 
clean up the tlog files, as does the Leader of the Source SolrCloud). If I use 
ADDREPLICAPROP and REBALANCELEADERS to change which node is the Leader, and if 
I then start adding more data, the tlogs on the new Leader sometimes will go 
away, but then the old Leader begins accumulating tlog files. I am dubious 
whether frequent reassignment of Leadership would be a practical solution.

I also have several times attempted to simulate a production environment by 
running several loops simultaneously, each of which inserts multiple records on 
each iteration of the loop. Several times, I end up with a dozen records on 
(both replicas of) the Source which never make it to (either replica of) the 
Target. The Target has thousands of records which were inserted before the 
missing records, and thousands of records which were inserted after the missing 
records (and all these records, the replicated and the missing, were inserted 
by curl commands which only differed in sequential numbers incorporated into 
the values being inserted).

I also have a question regarding SOLR-13141: the 11/Feb/19 comment says that 
the fix for Solr 7.3 had a problem; and the header says "Affects Version/s: 
7.5, 7.6": does that indicate that Solr 7.4 is not affected?

Are  there any suggestions?

Thanks


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Emir Arnautović
Hi all,
Here is how I see it and explain to others that are not too familiar with Solr: 
Solr comes in two flavours - Cloud and Standalone. In any mode Solr writes to 
primary core(s). There is option to have different types of replicas, but in 
Standalone mode one can only have pull replica. In addition to different types 
of replicas, in SolrCloud mode multiple cores can be shards of a singe 
collection and primary is not fixed.

Emir
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/



> On 24 Jun 2020, at 15:19, Mark H. Wood  wrote:
> 
> On Wed, Jun 24, 2020 at 12:45:25PM +0200, Jan Høydahl wrote:
>> Master/slave and standalone are used interchangably to mean zk-less Solr. I 
>> have a feeling that master/slave is the more popular of the two, but 
>> personally I have been using both.
> 
> I've been trying to stay quiet and let the new-terminology issue
> settle, but I had a thought.  Someone has already pointed out that the
> so-called master/slave cluster is misnamed:  the so-called "master"
> node doesn't order the "slaves" about and indeed has no notion of
> being a master in any sense.  It acts as a servant to the "slave"
> nodes, which are in charge of keeping themselves updated.
> 
> So, it's kind of odd, but I could get used to calling this mode a
> "client/server cluster".
> 
> That leaves the question of what to call Solr Cloud mode, in which no
> node is permanently special.  I could see calling it a "herd" or
> suchlike.
> 
> Now I'll try to shut up again. :-)
> 
> -- 
> Mark H. Wood
> Lead Technology Analyst
> 
> University Library
> Indiana University - Purdue University Indianapolis
> 755 W. Michigan Street
> Indianapolis, IN 46202
> 317-274-0749
> www.ulib.iupui.edu



Re: solr fq with contains not returning any results

2020-06-24 Thread Erick Erickson
You need to do several things to track down why.

First, use something (admin UI, terms query, etc) to see
exactly what’s in your index. The admin/analysis screen is useful here.

Second, aldd =query to the query on both machines and
see what the actual parsed query looks like.

Comparing those should give you a clue.

Best,
Erick

> On Jun 24, 2020, at 9:20 AM, yaswanth kumar  wrote:
> 
> "nsallschools":["BostonSchool"]
> 
> That's how the data is stored against the field.
> 
> We have a functionality where we can do "Starts with, Contains, Ends with";
> Also if you look at the above schema we are using
> 
>  synonyms="punctuation-whitelist.txt" ignoreCase="true" expand="false"/>
>
>
>
> 
> Also the strange part is that its working fine in Solr 5.5 but not in Solr
> 8.2 any thoughts??
> 
> Thanks,
> 
> On Wed, Jun 24, 2020 at 3:15 AM Jörn Franke  wrote:
> 
>> I don’t know your data, but could it be that you tokenize differently ?
>> 
>> Why do you do the wildcard search at all? Maybe a different tokenizing
>> strategy can bring you more effieciently results? Depends on what you need
>> to achieve of course ...
>> 
>>> Am 24.06.2020 um 05:37 schrieb yaswanth kumar :
>>> 
>>> I am using solr 8.2
>>> 
>>> And when trying to do fq=auto_nsallschools:*bostonschool*, the data is
>> not
>>> being returned. But if I do the same in solr 5.5 (which I already have
>> and
>>> we are in process of migrating to 8.2 ) its returning results.
>>> 
>>> if I do fq=auto_nsallschools:bostonschool
>>> or
>>> fq=auto_nsallschools:bostonschool* its returning results but when I try
>>> with contains like described above or fq=auto_nsallschools:*bostonschool
>>> (ends with) it's not returning any results.
>>> 
>>> The field which we are already using is a copy field and multi valued,
>> am I
>>> doing something wrong? or does 8.2 need some adjustment in the configs?
>>> 
>>> Here is the schema
>>> 
>>> > stored="true"
>>> multiValued="true"/>
>>> >> stored="false" multiValued="true"/>
>>> 
>>> 
>>> 
>>> >> positionIncrementGap="100">
>>> 
>>>   
>>>   
>>>   
>>> 
>>>   
>>> 
>>> 
>>> >> positionIncrementGap="100">
>>> 
>>>   >> pattern="(\)" replacement="_and_" />
>>>   >> pattern="(\$)" replacement="_dollar_" />
>>>   >> pattern="(\*)" replacement="_star_" />
>>>   >> pattern="(\+)" replacement="_plus_" />
>>>   >> pattern="(\-)" replacement="_minus_" />
>>>   >> pattern="(\#)" replacement="_sharp_" />
>>>   >> pattern="(\%)" replacement="_percent_" />
>>>   >> pattern="(\=)" replacement="_equal_" />
>>>   >> pattern="(\)" replacement="_lessthan_" />
>>>   >> pattern="(\)" replacement="_greaterthan_" />
>>>   >> pattern="(\€)" replacement="_euro_" />
>>>   >> pattern="(\¢)" replacement="_cent_" />
>>>   >> pattern="(\£)" replacement="_pound_" />
>>>   >> pattern="(\¥)" replacement="_yuan_" />
>>>   >> pattern="(\©)" replacement="_copyright_" />
>>>   >> pattern="(\®)" replacement="_registered_" />
>>>   >> pattern="(\|)" replacement="_pipe_" />
>>>   >> pattern="(\^)" replacement="_caret_" />
>>>   >> pattern="(\~)" replacement="_tilt_" />
>>>   >> pattern="(\™)" replacement="_treadmark_" />
>>>   >> pattern="(\@)" replacement="_at_" />
>>>   >> pattern="(\)" replacement=" _doublequote_ " />
>>>   >> pattern="(\()" replacement=" _leftparentheses_ " />
>>>   >> pattern="(\))" replacement=" _rightparentheses_ " />
>>>   >> pattern="(\{)" replacement="_leftcurlybracket_" />
>>>   >> pattern="(\})" replacement="_rightcurlybracket_" />
>>>   >> pattern="(\[)" replacement="_leftsquarebracket_" />
>>>   >> pattern="(\])" replacement="_rightsquarebracket_" />
>>>   >> synonyms="punctuation-whitelist.txt" ignoreCase="true" expand="false"/>
>>>   
>>>   
>>>   
>>> 
>>>   
>>> 
>>> Thanks,
>>> 
>>> --
>>> Thanks & Regards,
>>> Yaswanth Kumar Konathala.
>>> yaswanth...@gmail.com
>> 
> 
> 
> -- 
> Thanks & Regards,
> Yaswanth Kumar Konathala.
> yaswanth...@gmail.com



Re: solr fq with contains not returning any results

2020-06-24 Thread yaswanth kumar
"nsallschools":["BostonSchool"]

That's how the data is stored against the field.

We have a functionality where we can do "Starts with, Contains, Ends with";
Also if you look at the above schema we are using






Also the strange part is that its working fine in Solr 5.5 but not in Solr
8.2 any thoughts??

Thanks,

On Wed, Jun 24, 2020 at 3:15 AM Jörn Franke  wrote:

> I don’t know your data, but could it be that you tokenize differently ?
>
> Why do you do the wildcard search at all? Maybe a different tokenizing
> strategy can bring you more effieciently results? Depends on what you need
> to achieve of course ...
>
> > Am 24.06.2020 um 05:37 schrieb yaswanth kumar :
> >
> > I am using solr 8.2
> >
> > And when trying to do fq=auto_nsallschools:*bostonschool*, the data is
> not
> > being returned. But if I do the same in solr 5.5 (which I already have
> and
> > we are in process of migrating to 8.2 ) its returning results.
> >
> > if I do fq=auto_nsallschools:bostonschool
> > or
> > fq=auto_nsallschools:bostonschool* its returning results but when I try
> > with contains like described above or fq=auto_nsallschools:*bostonschool
> > (ends with) it's not returning any results.
> >
> > The field which we are already using is a copy field and multi valued,
> am I
> > doing something wrong? or does 8.2 need some adjustment in the configs?
> >
> > Here is the schema
> >
> >  stored="true"
> > multiValued="true"/>
> >  > stored="false" multiValued="true"/>
> > 
> >
> >
> >  > positionIncrementGap="100">
> >  
> >
> >
> >
> >  
> >
> >
> >
> >  > positionIncrementGap="100">
> >  
> > > pattern="(\)" replacement="_and_" />
> > > pattern="(\$)" replacement="_dollar_" />
> > > pattern="(\*)" replacement="_star_" />
> > > pattern="(\+)" replacement="_plus_" />
> > > pattern="(\-)" replacement="_minus_" />
> > > pattern="(\#)" replacement="_sharp_" />
> > > pattern="(\%)" replacement="_percent_" />
> > > pattern="(\=)" replacement="_equal_" />
> > > pattern="(\)" replacement="_lessthan_" />
> > > pattern="(\)" replacement="_greaterthan_" />
> > > pattern="(\€)" replacement="_euro_" />
> > > pattern="(\¢)" replacement="_cent_" />
> > > pattern="(\£)" replacement="_pound_" />
> > > pattern="(\¥)" replacement="_yuan_" />
> > > pattern="(\©)" replacement="_copyright_" />
> > > pattern="(\®)" replacement="_registered_" />
> > > pattern="(\|)" replacement="_pipe_" />
> > > pattern="(\^)" replacement="_caret_" />
> > > pattern="(\~)" replacement="_tilt_" />
> > > pattern="(\™)" replacement="_treadmark_" />
> > > pattern="(\@)" replacement="_at_" />
> > > pattern="(\)" replacement=" _doublequote_ " />
> > > pattern="(\()" replacement=" _leftparentheses_ " />
> > > pattern="(\))" replacement=" _rightparentheses_ " />
> > > pattern="(\{)" replacement="_leftcurlybracket_" />
> > > pattern="(\})" replacement="_rightcurlybracket_" />
> > > pattern="(\[)" replacement="_leftsquarebracket_" />
> > > pattern="(\])" replacement="_rightsquarebracket_" />
> > > synonyms="punctuation-whitelist.txt" ignoreCase="true" expand="false"/>
> >
> >
> >
> >  
> >
> >
> > Thanks,
> >
> > --
> > Thanks & Regards,
> > Yaswanth Kumar Konathala.
> > yaswanth...@gmail.com
>


-- 
Thanks & Regards,
Yaswanth Kumar Konathala.
yaswanth...@gmail.com


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Mark H. Wood
On Wed, Jun 24, 2020 at 12:45:25PM +0200, Jan Høydahl wrote:
> Master/slave and standalone are used interchangably to mean zk-less Solr. I 
> have a feeling that master/slave is the more popular of the two, but 
> personally I have been using both.

I've been trying to stay quiet and let the new-terminology issue
settle, but I had a thought.  Someone has already pointed out that the
so-called master/slave cluster is misnamed:  the so-called "master"
node doesn't order the "slaves" about and indeed has no notion of
being a master in any sense.  It acts as a servant to the "slave"
nodes, which are in charge of keeping themselves updated.

So, it's kind of odd, but I could get used to calling this mode a
"client/server cluster".

That leaves the question of what to call Solr Cloud mode, in which no
node is permanently special.  I could see calling it a "herd" or
suchlike.

Now I'll try to shut up again. :-)

-- 
Mark H. Wood
Lead Technology Analyst

University Library
Indiana University - Purdue University Indianapolis
755 W. Michigan Street
Indianapolis, IN 46202
317-274-0749
www.ulib.iupui.edu


signature.asc
Description: PGP signature


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Jörn Franke
I agree with Bernd. I believe also that change is natural so eventually one 
needs to evolve the terminology or create a complete new product. To evolve the 
terminology one can write a page in the ref guide for translating it and over 
time adapt it in Solr etc.


> Am 24.06.2020 um 13:30 schrieb Bernd Fehling :
> 
> I'm following this thread now for a while and I can understand
> the wish to change some naming/wording/speech in one or the other
> programs but I always get back to the one question:
> "Is it the weapon which kills people or the hand controlled by
> the mind which fires the weapon?"
> 
> The thread started with slave - slavery, then turned over to master
> and followed by leader (for me as a german... you know).
> What will come next?
> 
> And more over, we now discuss about changes in the source code and
> due to this there need to be changes to the documentation.
> What about the books people wrote about this programs and source code,
> should we force this authors to rewrite their books?
> May be we should file a request to all web search engines to reject
> all stored content about these "banned" words?
> And contact all web hosters about providing bad content.
> 
> To sum things up, within my 40 years of computer science and writing
> programs I have never had a nanosecond any thoughts about words
> like master, slave, leader, ... other than thinking about computers
> and programming.
> 
> Just my 2 cents.
> 
> For what it is worth, I tend to guide/follower if there "must be" any changes.
> 
> Bernd


Re: Unexpected results using Block Join Parent Query Parser

2020-06-24 Thread Mikhail Khludnev
Jan, thanks for the clarification.
Sure you can use {!parent which=class:section} for return children, which
has a garndchildren matching subordinate query.
Note: there's something about named scopes, which I didn't get into yet,
but it might be relevant to the problem.

On Wed, Jun 24, 2020 at 1:43 PM Jan Høydahl  wrote:

> I guess the key question here is whether «parent» in BlockJoin is strictly
> top-level parent/root, i.e. class:composition for the example in this
> tread? Or can {!parent} parser also be used to select the «child» level in
> a child/grandchild relationship inside a block?
>
> Jan
>
> > 24. jun. 2020 kl. 11:36 skrev Tor-Magne Stien Hagen :
> >
> > Thanks for your answer,
> >
> > What kind of rules exists for the which clause? In other words, how can
> you identify parents without using some sort of filtering?
> >
> > Tor-Magne Stien Hagen
> >
> > -Original Message-
> > From: Mikhail Khludnev 
> > Sent: Wednesday, June 24, 2020 10:01 AM
> > To: solr-user 
> > Subject: Re: Unexpected results using Block Join Parent Query Parser
> >
> > Hello,
> >
> > Please check warning box titled Using which
> >
> https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flucene.apache.org%2Fsolr%2Fguide%2F8_5%2Fother-parsers.html%23block-join-parent-query-parserdata=02%7C01%7Ctsh%40dips.no%7C5fef4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C0%7C637285825378470570sdata=rB356EBZuDmFsTHT3ULcvr47zCcr%2F29XYaGA7%2BJ5HrI%3Dreserved=0
> >
> > On Wed, Jun 24, 2020 at 10:01 AM Tor-Magne Stien Hagen 
> wrote:
> >
> >> Hi,
> >>
> >> I have indexed the following nested document in Solr:
> >>
> >> {
> >>"id": "1",
> >>"class": "composition",
> >>"children": [
> >>{
> >>"id": "2",
> >>"class": "section",
> >>"children": [
> >>{
> >>"id": "3",
> >>"class": "observation"
> >>}
> >>]
> >>},
> >>{
> >>"id": "4",
> >>"class": "section",
> >>"children": [
> >>{
> >>"id": "5",
> >>"class": "instruction"
> >>}
> >>]
> >>}
> >>]
> >> }
> >>
> >> Given the following query:
> >>
> >> {!parent which='id:4'}id:3
> >>
> >> I expect the result to be empty as document 3 is not a child document
> >> of document 4.
> >>
> >> To reproduce, use the docker container provided here:
> >> https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> >> ub.com%2Ftormsh%2FSolr-Exampledata=02%7C01%7Ctsh%40dips.no%7C5fef
> >> 4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C
> >> 0%7C637285825378470570sdata=OyjBalFeXfb0W2euL76L%2BNyRDg9ukvT8TNI
> >> aODCmV30%3Dreserved=0
> >>
> >> Have I misunderstood something regarding the Block Join Parent Query
> >> Parser?
> >>
> >> Tor-Magne Stien Hagen
> >>
> >>
> >
> > --
> > Sincerely yours
> > Mikhail Khludnev
>
>

-- 
Sincerely yours
Mikhail Khludnev


Re: replica deleted but directory remains

2020-06-24 Thread Odysci
Thanks!
Reinaldo

On Tue, Jun 23, 2020 at 6:27 PM Erick Erickson 
wrote:

> In a word, “yes”. What it looks like is that the information in
> Zookeeper has been updated to reflect the deletion. But since
> node for some mysterious reason wasn’t available when the replica
> was deleted, the data couldn’t be removed.
>
> Best,
> Erick
>
> > On Jun 23, 2020, at 12:58 PM, Odysci  wrote:
> >
> > Hi,
> > I've got a solrcloud configuration with 2 shards and 2 replicas each.
> > For some unknown reason, one of the replicas was on "recovery" mode
> > forever, so I decided to create another replica, which went fine.
> > Then I proceeded to delete the old replica (using the SOlr UI). After a
> > while the interface gave me a msg about not being able to connect to the
> > solr node. But once i refreshed it, the old replica was no longer showing
> > in the interface, and the new replica was active.
> > However, the directory in disk for the old replica is still there (and
> it's
> > size is larger than originally).
> > In a previous time when I did this in the exact the same way, the
> directory
> > was removed.
> >
> > My quesion is, can I manually delete the directory for the old replica?
> > Or is there a solr command that will do this cleanly?
> > Thanks
> >
> > Reinaldo
>
>


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Bernd Fehling
I'm following this thread now for a while and I can understand
the wish to change some naming/wording/speech in one or the other
programs but I always get back to the one question:
"Is it the weapon which kills people or the hand controlled by
the mind which fires the weapon?"

The thread started with slave - slavery, then turned over to master
and followed by leader (for me as a german... you know).
What will come next?

And more over, we now discuss about changes in the source code and
due to this there need to be changes to the documentation.
What about the books people wrote about this programs and source code,
should we force this authors to rewrite their books?
May be we should file a request to all web search engines to reject
all stored content about these "banned" words?
And contact all web hosters about providing bad content.

To sum things up, within my 40 years of computer science and writing
programs I have never had a nanosecond any thoughts about words
like master, slave, leader, ... other than thinking about computers
and programming.

Just my 2 cents.

For what it is worth, I tend to guide/follower if there "must be" any changes.

Bernd


Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Jan Høydahl
Master/slave and standalone are used interchangably to mean zk-less Solr. I 
have a feeling that master/slave is the more popular of the two, but personally 
I have been using both.

Jan

> 24. jun. 2020 kl. 06:34 skrev Noble Paul :
> 
> Do we even call it the master/slave mode? I thought we had 2 modes
> 
> * Standalone mode
> * SolrCloud mode
> 
> On Wed, Jun 24, 2020 at 3:00 AM Tomás Fernández Löbbe
>  wrote:
>> 
>> I agree in general with what Trey and Jan said and have suggested. I
>> personally like to use "leader/follower". It's true that somewhat collides
>> with SolrCloud terminology, but that's not a problem IMO, now that replica
>> types exist, the “role” of the replica (leader vs. non-leader/follower)
>> doesn’t specify the internals of how they behave, the replica type defines
>> that. So, in a non-SolrCloud world, they would still be leader/followers
>> regardless of how they perform that role.
>> 
>> I also agree that the name of the role is not that important, more the
>> "mode" of the architecture needs to be renamed. We tend to refer to
>> "SolrCloud mode" and "Master/Slave mode", the main part in all this (IMO)
>> is to change that "mode" name. I kind of like Trey's suggestion of "Managed
>> Clustering" vs. "Manual Clustering" Mode (Or "managed" vs "manual"), but
>> still haven't made up my mind (especially the fact that "manual" usually
>> doesn't really mean "manual", is just "you build your tools”)…
>> 
>> On Fri, Jun 19, 2020 at 1:38 PM Walter Underwood 
>> wrote:
>> 
 On Jun 19, 2020, at 7:48 AM, Phill Campbell
>>>  wrote:
 
 Delegator - Handler
 
 A common pattern we are all aware of. Pretty simple.
>>> 
>>> The Solr master does not delegate and the slave does not handle.
>>> The master is a server that handles replication requests from the
>>> slave.
>>> 
>>> Delegator/handler is a common pattern, but it is not the pattern
>>> that describes traditional Solr replication.
>>> 
>>> wunder
>>> Walter Underwood
>>> wun...@wunderwood.org
>>> http://observer.wunderwood.org/  (my blog)
>>> 
>>> 
> 
> 
> 
> -- 
> -
> Noble Paul



Re: Unexpected results using Block Join Parent Query Parser

2020-06-24 Thread Jan Høydahl
I guess the key question here is whether «parent» in BlockJoin is strictly 
top-level parent/root, i.e. class:composition for the example in this tread? Or 
can {!parent} parser also be used to select the «child» level in a 
child/grandchild relationship inside a block?

Jan

> 24. jun. 2020 kl. 11:36 skrev Tor-Magne Stien Hagen :
> 
> Thanks for your answer,
> 
> What kind of rules exists for the which clause? In other words, how can you 
> identify parents without using some sort of filtering?
> 
> Tor-Magne Stien Hagen
> 
> -Original Message-
> From: Mikhail Khludnev  
> Sent: Wednesday, June 24, 2020 10:01 AM
> To: solr-user 
> Subject: Re: Unexpected results using Block Join Parent Query Parser
> 
> Hello,
> 
> Please check warning box titled Using which
> https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flucene.apache.org%2Fsolr%2Fguide%2F8_5%2Fother-parsers.html%23block-join-parent-query-parserdata=02%7C01%7Ctsh%40dips.no%7C5fef4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C0%7C637285825378470570sdata=rB356EBZuDmFsTHT3ULcvr47zCcr%2F29XYaGA7%2BJ5HrI%3Dreserved=0
> 
> On Wed, Jun 24, 2020 at 10:01 AM Tor-Magne Stien Hagen  wrote:
> 
>> Hi,
>> 
>> I have indexed the following nested document in Solr:
>> 
>> {
>>"id": "1",
>>"class": "composition",
>>"children": [
>>{
>>"id": "2",
>>"class": "section",
>>"children": [
>>{
>>"id": "3",
>>"class": "observation"
>>}
>>]
>>},
>>{
>>"id": "4",
>>"class": "section",
>>"children": [
>>{
>>"id": "5",
>>"class": "instruction"
>>}
>>]
>>}
>>]
>> }
>> 
>> Given the following query:
>> 
>> {!parent which='id:4'}id:3
>> 
>> I expect the result to be empty as document 3 is not a child document 
>> of document 4.
>> 
>> To reproduce, use the docker container provided here:
>> https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
>> ub.com%2Ftormsh%2FSolr-Exampledata=02%7C01%7Ctsh%40dips.no%7C5fef
>> 4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C
>> 0%7C637285825378470570sdata=OyjBalFeXfb0W2euL76L%2BNyRDg9ukvT8TNI
>> aODCmV30%3Dreserved=0
>> 
>> Have I misunderstood something regarding the Block Join Parent Query 
>> Parser?
>> 
>> Tor-Magne Stien Hagen
>> 
>> 
> 
> --
> Sincerely yours
> Mikhail Khludnev



Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Paras Lehana
Distributer/Fetcher?

On Wed, 24 Jun 2020 at 10:04, Noble Paul  wrote:

> Do we even call it the master/slave mode? I thought we had 2 modes
>
> * Standalone mode
> * SolrCloud mode
>
> On Wed, Jun 24, 2020 at 3:00 AM Tomás Fernández Löbbe
>  wrote:
> >
> > I agree in general with what Trey and Jan said and have suggested. I
> > personally like to use "leader/follower". It's true that somewhat
> collides
> > with SolrCloud terminology, but that's not a problem IMO, now that
> replica
> > types exist, the “role” of the replica (leader vs. non-leader/follower)
> > doesn’t specify the internals of how they behave, the replica type
> defines
> > that. So, in a non-SolrCloud world, they would still be leader/followers
> > regardless of how they perform that role.
> >
> > I also agree that the name of the role is not that important, more the
> > "mode" of the architecture needs to be renamed. We tend to refer to
> > "SolrCloud mode" and "Master/Slave mode", the main part in all this (IMO)
> > is to change that "mode" name. I kind of like Trey's suggestion of
> "Managed
> > Clustering" vs. "Manual Clustering" Mode (Or "managed" vs "manual"), but
> > still haven't made up my mind (especially the fact that "manual" usually
> > doesn't really mean "manual", is just "you build your tools”)…
> >
> > On Fri, Jun 19, 2020 at 1:38 PM Walter Underwood 
> > wrote:
> >
> > > > On Jun 19, 2020, at 7:48 AM, Phill Campbell
> > >  wrote:
> > > >
> > > > Delegator - Handler
> > > >
> > > > A common pattern we are all aware of. Pretty simple.
> > >
> > > The Solr master does not delegate and the slave does not handle.
> > > The master is a server that handles replication requests from the
> > > slave.
> > >
> > > Delegator/handler is a common pattern, but it is not the pattern
> > > that describes traditional Solr replication.
> > >
> > > wunder
> > > Walter Underwood
> > > wun...@wunderwood.org
> > > http://observer.wunderwood.org/  (my blog)
> > >
> > >
>
>
>
> --
> -
> Noble Paul
>


-- 
-- 
Regards,

*Paras Lehana* [65871]
Development Engineer, *Auto-Suggest*,
IndiaMART InterMESH Ltd,

11th Floor, Tower 2, Assotech Business Cresterra,
Plot No. 22, Sector 135, Noida, Uttar Pradesh, India 201305

Mob.: +91-9560911996
Work: 0120-4056700 | Extn:
*1196*

-- 
*
*

 


RE: Unexpected results using Block Join Parent Query Parser

2020-06-24 Thread Tor-Magne Stien Hagen
Thanks for your answer,

What kind of rules exists for the which clause? In other words, how can you 
identify parents without using some sort of filtering?

Tor-Magne Stien Hagen

-Original Message-
From: Mikhail Khludnev  
Sent: Wednesday, June 24, 2020 10:01 AM
To: solr-user 
Subject: Re: Unexpected results using Block Join Parent Query Parser

Hello,

Please check warning box titled Using which
https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flucene.apache.org%2Fsolr%2Fguide%2F8_5%2Fother-parsers.html%23block-join-parent-query-parserdata=02%7C01%7Ctsh%40dips.no%7C5fef4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C0%7C637285825378470570sdata=rB356EBZuDmFsTHT3ULcvr47zCcr%2F29XYaGA7%2BJ5HrI%3Dreserved=0

On Wed, Jun 24, 2020 at 10:01 AM Tor-Magne Stien Hagen  wrote:

> Hi,
>
> I have indexed the following nested document in Solr:
>
> {
> "id": "1",
> "class": "composition",
> "children": [
> {
> "id": "2",
> "class": "section",
> "children": [
> {
> "id": "3",
> "class": "observation"
> }
> ]
> },
> {
> "id": "4",
> "class": "section",
> "children": [
> {
> "id": "5",
> "class": "instruction"
> }
> ]
> }
> ]
> }
>
> Given the following query:
>
> {!parent which='id:4'}id:3
>
> I expect the result to be empty as document 3 is not a child document 
> of document 4.
>
> To reproduce, use the docker container provided here:
> https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> ub.com%2Ftormsh%2FSolr-Exampledata=02%7C01%7Ctsh%40dips.no%7C5fef
> 4e9a68cc41c72fd208d81814e93e%7C2f46c9197c11446584b2e354fb809979%7C0%7C
> 0%7C637285825378470570sdata=OyjBalFeXfb0W2euL76L%2BNyRDg9ukvT8TNI
> aODCmV30%3Dreserved=0
>
> Have I misunderstood something regarding the Block Join Parent Query 
> Parser?
>
> Tor-Magne Stien Hagen
>
>

--
Sincerely yours
Mikhail Khludnev


RE: Java - setting multi-valued fields

2020-06-24 Thread Eivind Hodneland
Hi,

Thanks for your input.
However, this approach did not work either, it gave the same result as 
previously.

Is there perhaps a different approach that could be used, other methods etc. ?


Uptime Consulting | Eivind Hodneland | Senior Consultant | Munchs gate 7, 
NO-0165 Oslo, Norway
Tel: +47 22 33 71 00 | Mob: +47 971 76 083 | 
eivind.hodnel...@uptimeconsulting.no  | www.uptimeconsulting.no
--
Search and Big Data solutions
Software Development
IT outsourcing services and consultancy
 




-Original Message-
From: kumar gaurav  
Sent: onsdag 17. juni 2020 19:02
To: solr-user@lucene.apache.org
Subject: Re: Java - setting multi-valued fields

HI

Example:

String[] values = new String[] {“value 1”, “value 2” };

inputDoc.setField (multiFieldName, values);


Can you try once to change the array to list ?

List values = new ArrayList<>();

values.add("value 1");

values.add("value 2");

inputDoc.setField (multiFieldName, values);



regards

Kumar Gaurav







On Wed, Jun 17, 2020 at 8:33 PM Eivind Hodneland < 
eivind.hodnel...@uptimeconsulting.no> wrote:

> Hi,
>
>
>
> My customer has a Solr index with a large amount of fields, many of 
> these are multivalued (type=”string”, multiValued=”true”).
>
>
>
> I am having problems with setting the values for these fields in my 
> Java update processors.
>
> Example:
>
> String[] values = new String[] {“value 1”, “value 2” };
>
> inputDoc.setField (multiFieldName, values);
>
>
>
> However, only “value 1” is present in the index after updating.
>
> What is the best / correct way to make this work?
>
>
>
>
>
>
>
> Uptime Consulting | Eivind Hodneland | Senior Consultant | Munchs gate 
> 7,
> NO-0165 Oslo, Norway
>
> Tel: +47 22 33 71 00 | Mob: +47 971 76 083 | 
> eivind.hodnel...@uptimeconsulting.no  | www.uptimeconsulting.no
>
> --
>
> Search and Big Data solutions
>
> Software Development
>
> IT outsourcing services and consultancy
>
>
>
> [image: 4180EEB7]
>
>
>


Re: Sorting in other collection in Solr 8.5.1

2020-06-24 Thread vishal patel
Forget this URL : 
https://drive.google.com/file/d/1Pw94topEJfarEA_P5JeiDOsHlvyACPK0/view

My listing looks like : 
https://drive.google.com/file/d/1K51qUrrdbTV65Qh83ZQD5Dm94XNte7xa/view
[https://lh3.googleusercontent.com/9JUR-mlCdbBTuqPtZa4sI_5njATXp28ERE4jmc_nBr5s9mHfAiJeHd1VI2t85OY=w1200-h630-p]
listing.PNG
drive.google.com


Sent from Outlook

From: vishal patel 
Sent: Wednesday, June 24, 2020 1:33 PM
To: solr-user@lucene.apache.org 
Subject: Re: Sorting in other collection in Solr 8.5.1

My listing looks like : 
https://drive.google.com/file/d/1Pw94topEJfarEA_P5JeiDOsHlvyACPK0/view
ID, Form Title and Created Date are coming from form collection while My Tasks 
is coming from actionscomments collection.
My listing fields are mapped with schema fields below:
collection name Listing Fields  Scehma Fields
forms   ID  id
Form Title  title
Created Dateform_creation_date
actionscomments ID  id
My Taskstitle

My form collection has forms related data and actionscomments collection has 
action related data. One form has many actions. So, we put a separate 
collection.

My storage like below:
actionscomments collections data
 {
"id":"ACTC10589345_656644",
"title":"Respond"
"form_id":"10252140",
"project_id":"102578"
 },
  {
"id":"ACTC10589345_709647",
"title":"For Information"
"form_id":"10252141",
"project_id":"102578"
 }

form collection data
  {
"id":"RFI019(2)",
"title":"QA Test 01",
"form_creation_date":"25-Jun-2018"
"form_id":"10252140",
"project_id":"102578"
  },
  {
"id":"RFI011(3)",
"title":"Test_RFI_Bolgs",
"form_creation_date":"29-Nov-2017"
"form_id":"10252141",
"project_id":"102578"
  },
  {
"id":"RFI015",
"title":"Check",
"form_creation_date":"20-Nov-2017"
"form_id":"10252142",
"project_id":"102579"
  }


 As of now one form has only one action but possible one form has many actions.
  For getting listing of project_id:102578, First we sort in ascending in 
actionscomments with field title so we get form_id in order 10252141 and 
10252140.
  Base on that ordered form_id we want same order data in form collection for 
project_id:102578

I am expecting listing like first form "Test_RFI_Bolgs" and second "QA Test 01".

 How can we achieve this?

Regards,
Vishal Patel

From: Erick Erickson 
Sent: Tuesday, June 23, 2020 5:07 PM
To: solr-user@lucene.apache.org 
Subject: Re: Sorting in other collection in Solr 8.5.1

You have two separate collections with dissimilar data, so what
does “sorting them in the same order” mean? Your example
sorts on title, so why can’t you sort them both on title? That won’t
work of course for any field that isn’t identical in both
collections.

These are actually pretty small collections. It sounds like you’re
doing what in SQL terms would be a sub-select. Have you considered
putting all the records (with different types) in the same collection
and using something like join queries or RerankQParser?

Don’t know how that fits into your model….

Best,
Erick

> On Jun 23, 2020, at 2:06 AM, vishal patel  
> wrote:
>
> Hi
>
> I am upgrading Solr 8.5.1. I have created 2 shards and each has one replica.
> I have created 2 collection one is form and second is actionscomment.forms 
> related data are stored in form collection and actions of that forms are 
> stored in actionscomment collection.
> There are 10 lakh documents in form and 50 lakh documents in actionscomment 
> collection.
>
> form schema.xml
>  multiValued="false" docValues="true"/>
>  docValues="true"/>
>  docValues="true"/>
>  omitNorms="true"/>
>  docValues="true"/>
>
> actionscomment schema.xml
>  multiValued="false" docValues="true"/>
>  docValues="true"/>
>  docValues="true"/>
> 
>  omitNorms="true"/>
>  docValues="true"/>
>  docValues="true"/>
>
>
>
> We are showing form listing using form and actionscomment collection. We are 
> showing only 250 records in form listing page. Our form listing columns are 
> id,title,form created date and action names. id,title,form created date and 
> action names come from form collection and action names come from 
> actionscomment collection. We want to give the sorting functionality for all 
> columns.It is easy to sort id, title and form created date because it is in 
> same collection.
>
> For action name sorting, I execute 2 query. First I execute query in 
> actionscomment collection with sort field title and get the form_id list and 
> using those form_ids I execute in form collection. But I do not get the 
> proper sorting. Sometimes I got so many form ids and my second query length 
> becomes larger.
> How can I get data from form 

Re: Sorting in other collection in Solr 8.5.1

2020-06-24 Thread vishal patel
My listing looks like : 
https://drive.google.com/file/d/1Pw94topEJfarEA_P5JeiDOsHlvyACPK0/view
ID, Form Title and Created Date are coming from form collection while My Tasks 
is coming from actionscomments collection.
My listing fields are mapped with schema fields below:
collection name Listing Fields  Scehma Fields
forms   ID  id
Form Title  title
Created Dateform_creation_date
actionscomments ID  id
My Taskstitle

My form collection has forms related data and actionscomments collection has 
action related data. One form has many actions. So, we put a separate 
collection.

My storage like below:
actionscomments collections data
 {
"id":"ACTC10589345_656644",
"title":"Respond"
"form_id":"10252140",
"project_id":"102578"
 },
  {
"id":"ACTC10589345_709647",
"title":"For Information"
"form_id":"10252141",
"project_id":"102578"
 }

form collection data
  {
"id":"RFI019(2)",
"title":"QA Test 01",
"form_creation_date":"25-Jun-2018"
"form_id":"10252140",
"project_id":"102578"
  },
  {
"id":"RFI011(3)",
"title":"Test_RFI_Bolgs",
"form_creation_date":"29-Nov-2017"
"form_id":"10252141",
"project_id":"102578"
  },
  {
"id":"RFI015",
"title":"Check",
"form_creation_date":"20-Nov-2017"
"form_id":"10252142",
"project_id":"102579"
  }


 As of now one form has only one action but possible one form has many actions.
  For getting listing of project_id:102578, First we sort in ascending in 
actionscomments with field title so we get form_id in order 10252141 and 
10252140.
  Base on that ordered form_id we want same order data in form collection for 
project_id:102578

I am expecting listing like first form "Test_RFI_Bolgs" and second "QA Test 01".

 How can we achieve this?

Regards,
Vishal Patel

From: Erick Erickson 
Sent: Tuesday, June 23, 2020 5:07 PM
To: solr-user@lucene.apache.org 
Subject: Re: Sorting in other collection in Solr 8.5.1

You have two separate collections with dissimilar data, so what
does “sorting them in the same order” mean? Your example
sorts on title, so why can’t you sort them both on title? That won’t
work of course for any field that isn’t identical in both
collections.

These are actually pretty small collections. It sounds like you’re
doing what in SQL terms would be a sub-select. Have you considered
putting all the records (with different types) in the same collection
and using something like join queries or RerankQParser?

Don’t know how that fits into your model….

Best,
Erick

> On Jun 23, 2020, at 2:06 AM, vishal patel  
> wrote:
>
> Hi
>
> I am upgrading Solr 8.5.1. I have created 2 shards and each has one replica.
> I have created 2 collection one is form and second is actionscomment.forms 
> related data are stored in form collection and actions of that forms are 
> stored in actionscomment collection.
> There are 10 lakh documents in form and 50 lakh documents in actionscomment 
> collection.
>
> form schema.xml
>  multiValued="false" docValues="true"/>
>  docValues="true"/>
>  docValues="true"/>
>  omitNorms="true"/>
>  docValues="true"/>
>
> actionscomment schema.xml
>  multiValued="false" docValues="true"/>
>  docValues="true"/>
>  docValues="true"/>
> 
>  omitNorms="true"/>
>  docValues="true"/>
>  docValues="true"/>
>
>
>
> We are showing form listing using form and actionscomment collection. We are 
> showing only 250 records in form listing page. Our form listing columns are 
> id,title,form created date and action names. id,title,form created date and 
> action names come from form collection and action names come from 
> actionscomment collection. We want to give the sorting functionality for all 
> columns.It is easy to sort id, title and form created date because it is in 
> same collection.
>
> For action name sorting, I execute 2 query. First I execute query in 
> actionscomment collection with sort field title and get the form_id list and 
> using those form_ids I execute in form collection. But I do not get the 
> proper sorting. Sometimes I got so many form ids and my second query length 
> becomes larger.
> How can I get data from form collection same as order of form id list came 
> from actionscomment?
>
> Regards,
> Vishal Patel
> 



Re: Unexpected results using Block Join Parent Query Parser

2020-06-24 Thread Mikhail Khludnev
Hello,

Please check warning box titled Using which
https://lucene.apache.org/solr/guide/8_5/other-parsers.html#block-join-parent-query-parser

On Wed, Jun 24, 2020 at 10:01 AM Tor-Magne Stien Hagen  wrote:

> Hi,
>
> I have indexed the following nested document in Solr:
>
> {
> "id": "1",
> "class": "composition",
> "children": [
> {
> "id": "2",
> "class": "section",
> "children": [
> {
> "id": "3",
> "class": "observation"
> }
> ]
> },
> {
> "id": "4",
> "class": "section",
> "children": [
> {
> "id": "5",
> "class": "instruction"
> }
> ]
> }
> ]
> }
>
> Given the following query:
>
> {!parent which='id:4'}id:3
>
> I expect the result to be empty as document 3 is not a child document of
> document 4.
>
> To reproduce, use the docker container provided here:
> https://github.com/tormsh/Solr-Example
>
> Have I misunderstood something regarding the Block Join Parent Query
> Parser?
>
> Tor-Magne Stien Hagen
>
>

-- 
Sincerely yours
Mikhail Khludnev


Re: solr fq with contains not returning any results

2020-06-24 Thread Jörn Franke
I don’t know your data, but could it be that you tokenize differently ?

Why do you do the wildcard search at all? Maybe a different tokenizing strategy 
can bring you more effieciently results? Depends on what you need to achieve of 
course ... 

> Am 24.06.2020 um 05:37 schrieb yaswanth kumar :
> 
> I am using solr 8.2
> 
> And when trying to do fq=auto_nsallschools:*bostonschool*, the data is not
> being returned. But if I do the same in solr 5.5 (which I already have and
> we are in process of migrating to 8.2 ) its returning results.
> 
> if I do fq=auto_nsallschools:bostonschool
> or
> fq=auto_nsallschools:bostonschool* its returning results but when I try
> with contains like described above or fq=auto_nsallschools:*bostonschool
> (ends with) it's not returning any results.
> 
> The field which we are already using is a copy field and multi valued, am I
> doing something wrong? or does 8.2 need some adjustment in the configs?
> 
> Here is the schema
> 
>  multiValued="true"/>
>  stored="false" multiValued="true"/>
> 
> 
> 
>  positionIncrementGap="100">
>  
>
>
>
>  
>
> 
> 
>  positionIncrementGap="100">
>  
> pattern="(\)" replacement="_and_" />
> pattern="(\$)" replacement="_dollar_" />
> pattern="(\*)" replacement="_star_" />
> pattern="(\+)" replacement="_plus_" />
> pattern="(\-)" replacement="_minus_" />
> pattern="(\#)" replacement="_sharp_" />
> pattern="(\%)" replacement="_percent_" />
> pattern="(\=)" replacement="_equal_" />
> pattern="(\)" replacement="_lessthan_" />
> pattern="(\)" replacement="_greaterthan_" />
> pattern="(\€)" replacement="_euro_" />
> pattern="(\¢)" replacement="_cent_" />
> pattern="(\£)" replacement="_pound_" />
> pattern="(\¥)" replacement="_yuan_" />
> pattern="(\©)" replacement="_copyright_" />
> pattern="(\®)" replacement="_registered_" />
> pattern="(\|)" replacement="_pipe_" />
> pattern="(\^)" replacement="_caret_" />
> pattern="(\~)" replacement="_tilt_" />
> pattern="(\™)" replacement="_treadmark_" />
> pattern="(\@)" replacement="_at_" />
> pattern="(\)" replacement=" _doublequote_ " />
> pattern="(\()" replacement=" _leftparentheses_ " />
> pattern="(\))" replacement=" _rightparentheses_ " />
> pattern="(\{)" replacement="_leftcurlybracket_" />
> pattern="(\})" replacement="_rightcurlybracket_" />
> pattern="(\[)" replacement="_leftsquarebracket_" />
> pattern="(\])" replacement="_rightsquarebracket_" />
> synonyms="punctuation-whitelist.txt" ignoreCase="true" expand="false"/>
>
>
>
>  
>
> 
> Thanks,
> 
> -- 
> Thanks & Regards,
> Yaswanth Kumar Konathala.
> yaswanth...@gmail.com


Unexpected results using Block Join Parent Query Parser

2020-06-24 Thread Tor-Magne Stien Hagen
Hi,

I have indexed the following nested document in Solr:

{
"id": "1",
"class": "composition",
"children": [
{
"id": "2",
"class": "section",
"children": [
{
"id": "3",
"class": "observation"
}
]
},
{
"id": "4",
"class": "section",
"children": [
{
"id": "5",
"class": "instruction"
}
]
}
]
}

Given the following query:

{!parent which='id:4'}id:3

I expect the result to be empty as document 3 is not a child document of 
document 4.

To reproduce, use the docker container provided here:
https://github.com/tormsh/Solr-Example

Have I misunderstood something regarding the Block Join Parent Query Parser?

Tor-Magne Stien Hagen