Re: [PR] Enable background refresh for the scan server tablet metadata cache [accumulo]

via GitHub Sat, 11 May 2024 13:19:07 -0700


ben-manes commented on code in PR #4551:
URL: https://github.com/apache/accumulo/pull/4551#discussion_r1597502374



##########
server/tserver/src/main/java/org/apache/accumulo/tserver/ScanServer.java:
##########
@@ -246,9 +246,30 @@ public ScanServer(ScanServerOpts opts, String[] args) {
         LOG.warn(
             "Tablet metadata caching less than one minute, may cause excessive 
scans on metadata table.");
       }
-      tabletMetadataCache =
-          Caffeine.newBuilder().expireAfterWrite(cacheExpiration, 
TimeUnit.MILLISECONDS)
-              
.scheduler(Scheduler.systemScheduler()).recordStats().build(tabletMetadataLoader);
+
+      // Get the cache refresh percentage property
+      // Value must be less than 100% as 100 or over would effectively disable 
it
+      double cacheRefreshPercentage =
+          
getConfiguration().getFraction(Property.SSERV_CACHED_TABLET_METADATA_REFRESH_PERCENT);
+      Preconditions.checkArgument(cacheRefreshPercentage < cacheExpiration,
+          "Tablet metadata cache refresh percentage is '%s' but must be less 
than 1",
+          cacheRefreshPercentage);
+
+      var builder = Caffeine.newBuilder().expireAfterWrite(cacheExpiration, 
TimeUnit.MILLISECONDS)

Review Comment:
   The idea is to aggregate individual refreshes over a time window, perform 
the bulk operation, and complete the respective `CompletableFuture` with the 
value for the given key. Caffeine doesn't know it is being batched, which 
wouldn't help since most refreshes are triggered from distinct lookups (unlike, 
for instance, getAll). Since it is an async task the cache merely thinks it 
took longer to execute, the worst case is slightly skewed stats (but custom 
stats fixes that). It is a little bit of code wrangling if using 
java.util.concurrent, but very trivial if you can glue together with a reactive 
stream library.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] Enable background refresh for the scan server tablet metadata cache [accumulo]

Reply via email to