ahanikel commented on PR #1296:
URL: https://github.com/apache/jackrabbit-oak/pull/1296#issuecomment-1946357721

   > Another issue could be if the segment is not in the tar archive that is 
being recovered. To find the segment in that case, we need TarReader to check 
other archives since we will get a cache miss.
   
   @smiroslav I think that shouldn't happen because the blob reference should 
be either in the same segment or in a segment written before, which should also 
be in the cache.
   
   But I agree that the cache size is a problem. It would need to keep not only 
the segments of the current tar file but of all the tar files. But then, that 
is also true for a separate Map --- wouldn't that Map then grow exceptionally 
big for large repositories?
   
   Can we perhaps find a way to initialise `FileStore#tarFiles` without the 
recovery and run the recovery process after that?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to