shubhamvishu commented on PR #14963: URL: https://github.com/apache/lucene/pull/14963#issuecomment-3242215130
OK, I ran the `luceneutil` benchmarks and I see a huge improvement in the indexing throughput with this PR compared to baseline(without this change). I see an almost **`4x`** improvement in the indexing time here. *Note : The improvement in the latency or CPUTime seems to be driven by the less no. of segments which also very slightly impact the recall as we know.* CC - @benwtrent @msokolov @jpountz @vigyasharma #### With `HNSW_GRAPH_THRESHOLD` = 10 **Baseline** ``` Results: recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn beamWidth quantized index(s) index_docs/s num_segments index_size(MB) vec_disk(MB) vec_RAM(MB) indexType 0.515 11.084 11.071 0.999 500000 100 50 64 250 4 bits 120.51 4149.14 7 1690.12 1649.857 185.013 HNSW 0.875 9.965 9.946 0.998 500000 100 50 64 250 7 bits 119.18 4195.23 3 1871.00 1832.962 368.118 HNSW 0.978 19.637 19.621 0.999 500000 100 50 64 250 no 127.63 3917.51 8 1501.98 1464.844 1464.844 HNSW ``` **Candidate** ``` Results: recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn beamWidth quantized index(s) index_docs/s num_segments index_size(MB) vec_disk(MB) vec_RAM(MB) indexType 0.517 5.933 5.914 0.997 500000 100 50 64 250 4 bits 30.34 16482.07 2 1694.34 1649.857 185.013 HNSW 0.871 9.655 9.635 0.998 500000 100 50 64 250 7 bits 27.86 17945.59 3 1869.52 1832.962 368.118 HNSW 0.961 11.280 11.269 0.999 500000 100 50 64 250 no 31.16 16046.73 3 1503.46 1464.844 1464.844 HNSW ``` -------------------------- #### With `HNSW_GRAPH_THRESHOLD` = 100 **Baseline** ``` Results: recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn beamWidth quantized index(s) index_docs/s num_segments index_size(MB) vec_disk(MB) vec_RAM(MB) indexType 0.515 11.101 11.074 0.998 500000 100 50 64 250 4 bits 118.33 4225.40 7 1690.02 1649.857 185.013 HNSW 0.874 10.199 10.176 0.998 500000 100 50 64 250 7 bits 118.18 4230.83 3 1871.16 1832.962 368.118 HNSW 0.977 19.990 19.979 0.999 500000 100 50 64 250 no 126.92 3939.61 8 1501.95 1464.844 1464.844 HNSW ``` **Candidate** ``` Results: recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn beamWidth quantized index(s) index_docs/s num_segments index_size(MB) vec_disk(MB) vec_RAM(MB) indexType 0.517 5.900 5.882 0.997 500000 100 50 64 250 4 bits 30.13 16596.96 2 1694.14 1649.857 185.013 HNSW 0.872 9.680 9.660 0.998 500000 100 50 64 250 7 bits 28.58 17495.98 3 1869.05 1832.962 368.118 HNSW 0.964 11.640 11.619 0.998 500000 100 50 64 250 no 27.25 18349.97 3 1502.64 1464.844 1464.844 HNSW ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org