RE: solr url control

2018-03-05 Thread Becky Bonner
Thank you for your response.  Our dev instance is not a cloud but we will be 
implementing cloud in our staging and production environments.  I was afraid 
you were going to tell me that the substructure was not supported. I was hoping 
that in the core autodiscovery, it would keep the path.  Thanks for your help. 

-Original Message-
From: Shawn Heisey [mailto:apa...@elyograg.org] 
Sent: Friday, March 2, 2018 6:45 PM
To: solr-user@lucene.apache.org
Subject: Re: solr url control

On 3/2/2018 10:29 AM, Becky Bonner wrote:
> We are trying to setup one solr server for several applications each with a 
> different collection.  Is there a way to have have 2 collections under one 
> folder and the url be something like this:
> http://mysolrinstance.com/solr/myParent1/collection1
> http://mysolrinstance.com/solr/myParent1/collection2
> http://mysolrinstance.com/solr/myParent2
> http://mysolrinstance.com/solr/myParent3

No. I am not aware of any way to set up a hierarchy like this. Collections and 
cores have one identifier for their names.  You could use myparent1_collection1 
as a name.

Implementing such a hierarchy like this would likely be difficult for the dev 
team, and would probably be a large source of bugs for several releases after 
it first became available.  I don't think a feature like this is likely to 
happen.

Later, you said "We would not want the data from one collection to ever show up 
in another collection query."  That's not ever going to happen unless the 
software making the query explicitly requests it, and it will need to know 
details about the indexes in your Solr server to be able to do it successfully. 
 FYI: People who cannot be trusted shouldn't ever have direct access to your 
Solr installation.

Are you running SolrCloud?  I ask because if you're not, then the terminology 
for each index isn't a "collection" ... it's a core.  This is a pedantic 
statement, but you'll get better answers if your terminology is correct.

If you were running SolrCloud, it would be extremely unlikely for you to have a 
directory structure like you describe.  SolrCloud normally handles all core 
creation behind the scenes and isn't going to set up a directory structure like 
that.

Information about how core discovery works:

https://wiki.apache.org/solr/Core%20Discovery%20%284.4%20and%20beyond%29#Finding_cores

Thanks,
Shawn



Re: solr url control

2018-03-02 Thread Shawn Heisey
On 3/2/2018 10:29 AM, Becky Bonner wrote:
> We are trying to setup one solr server for several applications each with a 
> different collection.  Is there a way to have have 2 collections under one 
> folder and the url be something like this:
> http://mysolrinstance.com/solr/myParent1/collection1
> http://mysolrinstance.com/solr/myParent1/collection2
> http://mysolrinstance.com/solr/myParent2
> http://mysolrinstance.com/solr/myParent3

No. I am not aware of any way to set up a hierarchy like this. 
Collections and cores have one identifier for their names.  You could
use myparent1_collection1 as a name.

Implementing such a hierarchy like this would likely be difficult for
the dev team, and would probably be a large source of bugs for several
releases after it first became available.  I don't think a feature like
this is likely to happen.

Later, you said "We would not want the data from one collection to ever
show up in another collection query."  That's not ever going to happen
unless the software making the query explicitly requests it, and it will
need to know details about the indexes in your Solr server to be able to
do it successfully.  FYI: People who cannot be trusted shouldn't ever
have direct access to your Solr installation.

Are you running SolrCloud?  I ask because if you're not, then the
terminology for each index isn't a "collection" ... it's a core.  This
is a pedantic statement, but you'll get better answers if your
terminology is correct.

If you were running SolrCloud, it would be extremely unlikely for you to
have a directory structure like you describe.  SolrCloud normally
handles all core creation behind the scenes and isn't going to set up a
directory structure like that.

Information about how core discovery works:

https://wiki.apache.org/solr/Core%20Discovery%20%284.4%20and%20beyond%29#Finding_cores

Thanks,
Shawn



RE: solr url control

2018-03-02 Thread Becky Bonner
So the thing is ... these collections all have very unique schemas and the data 
are unrelated to each other.  And we do a lot of field queries on the content.  
We would not want the data from one collection to ever show up in another 
collection query.  They are used by different audiences and securities as well. 
 We want to keep them separated.  

While it is not required that the urls include the myParentX ... it would be 
consistent with our current implementation that we are upgrading from 4.6 to 
7.2.  this was a very simple task under apache but I cant figure out how to do 
this in solr 7

-Original Message-
From: Becky Bonner 
Sent: Friday, March 2, 2018 1:11 PM
To: 'solr-user@lucene.apache.org' <solr-user@lucene.apache.org>
Subject: RE: solr url control

Sorry Webster - I meant to make this a new question ... but accidentally sent 
it. You wrote
From: Webster Homer [mailto:webster.ho...@sial.com] 
Sent: Friday, March 2, 2018 12:20 PM
To: solr-user@lucene.apache.org
Subject: Re: NRT replicas miss hits and return duplicate hits when paging 
solrcloud searches

Becky,
This should have been its own question.

Solrcloud is different from standalone solr, the configurations live in 
Zookeeper and the index is created under SOLR_HOME. You might want to rethink 
your solution, What problem are you trying to solve with that layout? Would it 
be solved by creating the Parent1 collection with 2 shards?

-Original Message-
From: Becky Bonner 
Sent: Friday, March 2, 2018 11:29 AM
To: solr-user@lucene.apache.org
Subject: solr url control

We are trying to setup one solr server for several applications each with a 
different collection.  Is there a way to have have 2 collections under one 
folder and the url be something like this:
http://mysolrinstance.com/solr/myParent1/collection1
http://mysolrinstance.com/solr/myParent1/collection2
http://mysolrinstance.com/solr/myParent2
http://mysolrinstance.com/solr/myParent3


We organized it like that under the solr folder but the URLs to the collections 
do not include the "myParent1".
This makes the names of my collections more confusing because you can't tell 
what application they belong to.  It wasn’t a problem until we had 2 
collections for one of the apps.


RE: solr url control

2018-03-02 Thread Becky Bonner
Sorry Webster - I meant to make this a new question ... but accidentally sent 
it. You wrote
From: Webster Homer [mailto:webster.ho...@sial.com] 
Sent: Friday, March 2, 2018 12:20 PM
To: solr-user@lucene.apache.org
Subject: Re: NRT replicas miss hits and return duplicate hits when paging 
solrcloud searches

Becky,
This should have been its own question.

Solrcloud is different from standalone solr, the configurations live in 
Zookeeper and the index is created under SOLR_HOME. You might want to rethink 
your solution, What problem are you trying to solve with that layout? Would it 
be solved by creating the Parent1 collection with 2 shards?

-Original Message-
From: Becky Bonner 
Sent: Friday, March 2, 2018 11:29 AM
To: solr-user@lucene.apache.org
Subject: solr url control

We are trying to setup one solr server for several applications each with a 
different collection.  Is there a way to have have 2 collections under one 
folder and the url be something like this:
http://mysolrinstance.com/solr/myParent1/collection1
http://mysolrinstance.com/solr/myParent1/collection2
http://mysolrinstance.com/solr/myParent2
http://mysolrinstance.com/solr/myParent3


We organized it like that under the solr folder but the URLs to the collections 
do not include the "myParent1".
This makes the names of my collections more confusing because you can't tell 
what application they belong to.  It wasn’t a problem until we had 2 
collections for one of the apps.