[ 
https://issues.apache.org/jira/browse/SDAP-127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Frank Greguska updated SDAP-127:
--------------------------------
    Component/s: nexus

> ID should be unique in SOLR schema
> ----------------------------------
>
>                 Key: SDAP-127
>                 URL: https://issues.apache.org/jira/browse/SDAP-127
>             Project: Apache Science Data Analytics Platform
>          Issue Type: Improvement
>          Components: nexus
>            Reporter: Frank Greguska
>            Priority: Major
>
> The "solr_id_s" field is currently the "uniqueKey" for the schema:
> https://github.com/apache/incubator-sdap-nexus/blob/107438af45b479348ffb75a667b276ee3c81f9da/data-access/config/schemas/solr/nexustiles/conf/managed-schema#L200
> This is fine but a lot of the algorithms depend on the simple "id" field for 
> working with tiles (the id field is the same as solr_id_s but without the 
> prefix used for document routing):
> https://github.com/apache/incubator-sdap-nexus/blob/107438af45b479348ffb75a667b276ee3c81f9da/data-access/config/schemas/solr/nexustiles/conf/managed-schema#L120
> If possible, the "id" field should also be marked as unique so that it is 
> impossible to generate tiles with identical "id"s.
> This problem was found because of SLCP ice shelf data where 2 variables from 
> the same granule were being ingested. The ID is generated from the granule 
> name and section spec and an optional 'salt' value. In this case no salt was 
> used (incorrectly) so the tiles were generated with identical "id"s but no 
> error occurred because they had different dataset names which caused the 
> "solr_id_s" field to be unique.
> Not sure if it is possible to have more than one unique field in a SOLR 
> schema.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to