[ 
https://issues.apache.org/jira/browse/SOLR-9055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15275315#comment-15275315
 ] 

Mark Miller commented on SOLR-9055:
-----------------------------------

bq. But it looks like during "restore" operation we need to use Lucene API to 
compute the file checksum.

I think it's reading it from the index files, not calculating it? But yeah, I'm 
sure you could recalculate it, though some files may not even have it.

It doesn't seem like a full Directory implementation is required - just the 
ability to read the checksums for a file in the repo (read a header and see if 
it matches a checksum). And of course an impl could just do the same checksum 
computation over itself. It should just be used to be sure we don't treat a 
file with the same name and size as another file the same when the data is 
actually different. This can happen fairly easily with different Lucene indexes.



> Make collection backup/restore extensible
> -----------------------------------------
>
>                 Key: SOLR-9055
>                 URL: https://issues.apache.org/jira/browse/SOLR-9055
>             Project: Solr
>          Issue Type: Task
>            Reporter: Hrishikesh Gadre
>            Assignee: Mark Miller
>         Attachments: SOLR-9055.patch, SOLR-9055.patch
>
>
> SOLR-5750 implemented backup/restore API for Solr. This JIRA is to track the 
> code cleanup/refactoring. Specifically following improvements should be made,
> - Add Solr/Lucene version to check the compatibility between the backup 
> version and the version of Solr on which it is being restored.
> - Add a backup implementation version to check the compatibility between the 
> "restore" implementation and backup format.
> - Introduce a Strategy interface to define how the Solr index data is backed 
> up (e.g. using file copy approach).
> - Introduce a Repository interface to define the file-system used to store 
> the backup data. (currently works only with local file system but can be 
> extended). This should be enhanced to introduce support for "registering" 
> repositories (e.g. HDFS, S3 etc.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to