Ignacio Vera created LUCENE-8452:
------------------------------------

             Summary: BKD-based shape indexing benchmarks
                 Key: LUCENE-8452
                 URL: https://issues.apache.org/jira/browse/LUCENE-8452
             Project: Lucene - Core
          Issue Type: Improvement
          Components: modules/sandbox
            Reporter: Ignacio Vera


Initial benchmarking of the new BKD-based shape indexing suggest that searches 
can be somewhat under-performing.   I open this ticket to share the findings 
and to open a discussion how to speed up the solution.

 

The first benchmark is done by using the current benchmark in luceneutils for 
indexing points and search by bounding box. We would expect {{LatLonShape}} to 
be slower that {{LatLonPoint}} but still having a good performance. The results 
of running such benchmark in my computer looks like:

 

LatLonPoint:

89.717239531 sec to index

INDEX SIZE: 0.5087761553004384 GB

READER MB: 0.6098232269287109

maxDoc=60844404

totHits=221118844

BEST M hits/sec: 72.91056132596746

BEST QPS: 74.19031323419311 

 

LatLonShape:

89.388678805 sec to index

INDEX SIZE: 1.3028179928660393 GB

READER MB: 0.8827085494995117

maxDoc=60844404

totHits=221118844

BEST M hits/sec: 1.0053836784184809

BEST QPS: 1.0230305276205143

 

A second benchmark has been performed indexing around 10 million 4-side 
polygons and around 3 million points. Searches are performed using bounding 
boxes. The results are compared with spatial trees alternatives. Spatial trees 
use a composite strategy, precision=0.001 degrees and distErrPct=0.25:

 

s2 (Geo3d):

1191.732124301 sec to index part 0

INDEX SIZE: 3.2086284114047885 GB

READER MB: 19.453557014465332

maxDoc=12949519

totHits=705758537

BEST M hits/sec: 13.311369588840462

BEST QPS: 4.243743434150063

 

quad (JTS):

3252.62925159 sec to index part 0

INDEX SIZE: 4.5238002222031355 GB

READER MB: 41.15725612640381

maxDoc=12949519

totHits=705758357

BEST M hits/sec: 35.54591930673003

BEST QPS: 11.332252412866938

 

LatLonShape:

30.32712009 sec to index part 0

INDEX SIZE: 0.5627057952806354 GB

READER MB: 0.29498958587646484

maxDoc=12949519

totHits=705758228

BEST M hits/sec: 3.4130465326433357

BEST QPS: 1.0880999177593018

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to