clintropolis commented on a change in pull request #6988: [Improvement] 
historical fast restart by lazy load columns metadata(20X faster)
URL: https://github.com/apache/incubator-druid/pull/6988#discussion_r350568250
 
 

 ##########
 File path: docs/configuration/index.md
 ##########
 @@ -1359,6 +1359,7 @@ These Historical configurations can be defined in the 
`historical/runtime.proper
 |`druid.segmentCache.announceIntervalMillis`|How frequently to announce 
segments while segments are loading from cache. Set this value to zero to wait 
for all segments to be loaded before announcing.|5000 (5 seconds)|
 |`druid.segmentCache.numLoadingThreads`|How many segments to drop or load 
concurrently from deep storage. Note that the work of loading segments involves 
downloading segments from deep storage, decompressing them and loading them to 
a memory mapped location. So the work is not all I/O Bound. Depending on CPU 
and network load, one could possibly increase this config to a higher 
value.|Number of cores|
 |`druid.segmentCache.numBootstrapThreads`|How many segments to load 
concurrently during historical startup.|`druid.segmentCache.numLoadingThreads`|
+|`druid.segmentCache.lazyLoadOnStart`|Whether or not to load segment columns 
metadata lazily during historical startup. Set this value to true will have a 
definite improvement during historical startup with HDD(may be 20 times 
faster). This features is unnecessary for historical with SSD, because it is 
fast enough.|false|
 
 Review comment:
   >...(may be 20 times faster)...
   
   I don't think we should make any promises here about performance, I think 
mentioning a dramatic increase in historical start-up time is enough, and also 
think we can leave out the part about SSDs. Instead, I think we should mention 
the implication that the first query that processes a lazy loaded segment will 
incur the cost that was previously included in the start-up time. How about 
something like:
   
   > Whether or not to load segment columns metadata lazily during historical 
startup. When set to true, Historical startup time will be dramatically 
improved by deferring segment loading until the first time that segment takes 
part in a query, which will incur this cost instead.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to