RE: solr url control
Thank you for your response. Our dev instance is not a cloud but we will be implementing cloud in our staging and production environments. I was afraid you were going to tell me that the substructure was not supported. I was hoping that in the core autodiscovery, it would keep the path. Thanks for your help. -Original Message- From: Shawn Heisey [mailto:apa...@elyograg.org] Sent: Friday, March 2, 2018 6:45 PM To: solr-user@lucene.apache.org Subject: Re: solr url control On 3/2/2018 10:29 AM, Becky Bonner wrote: > We are trying to setup one solr server for several applications each with a > different collection. Is there a way to have have 2 collections under one > folder and the url be something like this: > http://mysolrinstance.com/solr/myParent1/collection1 > http://mysolrinstance.com/solr/myParent1/collection2 > http://mysolrinstance.com/solr/myParent2 > http://mysolrinstance.com/solr/myParent3 No. I am not aware of any way to set up a hierarchy like this. Collections and cores have one identifier for their names. You could use myparent1_collection1 as a name. Implementing such a hierarchy like this would likely be difficult for the dev team, and would probably be a large source of bugs for several releases after it first became available. I don't think a feature like this is likely to happen. Later, you said "We would not want the data from one collection to ever show up in another collection query." That's not ever going to happen unless the software making the query explicitly requests it, and it will need to know details about the indexes in your Solr server to be able to do it successfully. FYI: People who cannot be trusted shouldn't ever have direct access to your Solr installation. Are you running SolrCloud? I ask because if you're not, then the terminology for each index isn't a "collection" ... it's a core. This is a pedantic statement, but you'll get better answers if your terminology is correct. If you were running SolrCloud, it would be extremely unlikely for you to have a directory structure like you describe. SolrCloud normally handles all core creation behind the scenes and isn't going to set up a directory structure like that. Information about how core discovery works: https://wiki.apache.org/solr/Core%20Discovery%20%284.4%20and%20beyond%29#Finding_cores Thanks, Shawn
Re: solr url control
On 3/2/2018 10:29 AM, Becky Bonner wrote: > We are trying to setup one solr server for several applications each with a > different collection. Is there a way to have have 2 collections under one > folder and the url be something like this: > http://mysolrinstance.com/solr/myParent1/collection1 > http://mysolrinstance.com/solr/myParent1/collection2 > http://mysolrinstance.com/solr/myParent2 > http://mysolrinstance.com/solr/myParent3 No. I am not aware of any way to set up a hierarchy like this. Collections and cores have one identifier for their names. You could use myparent1_collection1 as a name. Implementing such a hierarchy like this would likely be difficult for the dev team, and would probably be a large source of bugs for several releases after it first became available. I don't think a feature like this is likely to happen. Later, you said "We would not want the data from one collection to ever show up in another collection query." That's not ever going to happen unless the software making the query explicitly requests it, and it will need to know details about the indexes in your Solr server to be able to do it successfully. FYI: People who cannot be trusted shouldn't ever have direct access to your Solr installation. Are you running SolrCloud? I ask because if you're not, then the terminology for each index isn't a "collection" ... it's a core. This is a pedantic statement, but you'll get better answers if your terminology is correct. If you were running SolrCloud, it would be extremely unlikely for you to have a directory structure like you describe. SolrCloud normally handles all core creation behind the scenes and isn't going to set up a directory structure like that. Information about how core discovery works: https://wiki.apache.org/solr/Core%20Discovery%20%284.4%20and%20beyond%29#Finding_cores Thanks, Shawn
RE: solr url control
So the thing is ... these collections all have very unique schemas and the data are unrelated to each other. And we do a lot of field queries on the content. We would not want the data from one collection to ever show up in another collection query. They are used by different audiences and securities as well. We want to keep them separated. While it is not required that the urls include the myParentX ... it would be consistent with our current implementation that we are upgrading from 4.6 to 7.2. this was a very simple task under apache but I cant figure out how to do this in solr 7 -Original Message- From: Becky Bonner Sent: Friday, March 2, 2018 1:11 PM To: 'solr-user@lucene.apache.org' <solr-user@lucene.apache.org> Subject: RE: solr url control Sorry Webster - I meant to make this a new question ... but accidentally sent it. You wrote From: Webster Homer [mailto:webster.ho...@sial.com] Sent: Friday, March 2, 2018 12:20 PM To: solr-user@lucene.apache.org Subject: Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches Becky, This should have been its own question. Solrcloud is different from standalone solr, the configurations live in Zookeeper and the index is created under SOLR_HOME. You might want to rethink your solution, What problem are you trying to solve with that layout? Would it be solved by creating the Parent1 collection with 2 shards? -Original Message- From: Becky Bonner Sent: Friday, March 2, 2018 11:29 AM To: solr-user@lucene.apache.org Subject: solr url control We are trying to setup one solr server for several applications each with a different collection. Is there a way to have have 2 collections under one folder and the url be something like this: http://mysolrinstance.com/solr/myParent1/collection1 http://mysolrinstance.com/solr/myParent1/collection2 http://mysolrinstance.com/solr/myParent2 http://mysolrinstance.com/solr/myParent3 We organized it like that under the solr folder but the URLs to the collections do not include the "myParent1". This makes the names of my collections more confusing because you can't tell what application they belong to. It wasn’t a problem until we had 2 collections for one of the apps.
RE: solr url control
Sorry Webster - I meant to make this a new question ... but accidentally sent it. You wrote From: Webster Homer [mailto:webster.ho...@sial.com] Sent: Friday, March 2, 2018 12:20 PM To: solr-user@lucene.apache.org Subject: Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches Becky, This should have been its own question. Solrcloud is different from standalone solr, the configurations live in Zookeeper and the index is created under SOLR_HOME. You might want to rethink your solution, What problem are you trying to solve with that layout? Would it be solved by creating the Parent1 collection with 2 shards? -Original Message- From: Becky Bonner Sent: Friday, March 2, 2018 11:29 AM To: solr-user@lucene.apache.org Subject: solr url control We are trying to setup one solr server for several applications each with a different collection. Is there a way to have have 2 collections under one folder and the url be something like this: http://mysolrinstance.com/solr/myParent1/collection1 http://mysolrinstance.com/solr/myParent1/collection2 http://mysolrinstance.com/solr/myParent2 http://mysolrinstance.com/solr/myParent3 We organized it like that under the solr folder but the URLs to the collections do not include the "myParent1". This makes the names of my collections more confusing because you can't tell what application they belong to. It wasn’t a problem until we had 2 collections for one of the apps.