Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

via GitHub Mon, 01 Sep 2025 05:49:22 -0700


shubhamvishu commented on PR #14963:
URL: https://github.com/apache/lucene/pull/14963#issuecomment-3242215130


   OK, I ran the `luceneutil` benchmarks and I see a huge improvement in the 
indexing throughput with this PR compared to baseline(without this change). I 
see an almost **`4x`** improvement in the indexing time here.
   
   *Note : The improvement in the latency or CPUTime seems to be driven by the 
less no. of segments which also very slightly impact the recall as we know.*
   
   CC - @benwtrent @msokolov @jpountz @vigyasharma 
   
   #### With `HNSW_GRAPH_THRESHOLD` = 10
   
   **Baseline**
   ```
   Results:
   recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  num_segments  index_size(MB)  
vec_disk(MB)  vec_RAM(MB)  indexType
    0.515       11.084  11.071        0.999  500000   100      50       64      
  250     4 bits    120.51       4149.14             7         1690.12      
1649.857      185.013       HNSW
    0.875        9.965   9.946        0.998  500000   100      50       64      
  250     7 bits    119.18       4195.23             3         1871.00      
1832.962      368.118       HNSW
    0.978       19.637  19.621        0.999  500000   100      50       64      
  250         no    127.63       3917.51             8         1501.98      
1464.844     1464.844       HNSW
    ```
    
   **Candidate**
   ```
   Results:
   recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  num_segments  index_size(MB)  
vec_disk(MB)  vec_RAM(MB)  indexType
    0.517        5.933   5.914        0.997  500000   100      50       64      
  250     4 bits     30.34      16482.07             2         1694.34      
1649.857      185.013       HNSW
    0.871        9.655   9.635        0.998  500000   100      50       64      
  250     7 bits     27.86      17945.59             3         1869.52      
1832.962      368.118       HNSW
    0.961       11.280  11.269        0.999  500000   100      50       64      
  250         no     31.16      16046.73             3         1503.46      
1464.844     1464.844       HNSW
    ```
    
    --------------------------
    
    #### With `HNSW_GRAPH_THRESHOLD` = 100
    
   **Baseline**
   ```
   Results:
   recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  num_segments  index_size(MB)  
vec_disk(MB)  vec_RAM(MB)  indexType
    0.515       11.101  11.074        0.998  500000   100      50       64      
  250     4 bits    118.33       4225.40             7         1690.02      
1649.857      185.013       HNSW
    0.874       10.199  10.176        0.998  500000   100      50       64      
  250     7 bits    118.18       4230.83             3         1871.16      
1832.962      368.118       HNSW
    0.977       19.990  19.979        0.999  500000   100      50       64      
  250         no    126.92       3939.61             8         1501.95      
1464.844     1464.844       HNSW
   ```
   **Candidate**
   ```
   Results:
   recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  num_segments  index_size(MB)  
vec_disk(MB)  vec_RAM(MB)  indexType
    0.517        5.900   5.882        0.997  500000   100      50       64      
  250     4 bits     30.13      16596.96             2         1694.14      
1649.857      185.013       HNSW
    0.872        9.680   9.660        0.998  500000   100      50       64      
  250     7 bits     28.58      17495.98             3         1869.05      
1832.962      368.118       HNSW
    0.964       11.640  11.619        0.998  500000   100      50       64      
  250         no     27.25      18349.97             3         1502.64      
1464.844     1464.844       HNSW
    ```
    


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

Reply via email to