[ 
https://issues.apache.org/jira/browse/LUCENE-8521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637420#comment-16637420
 ] 

Nicholas Knize commented on LUCENE-8521:
----------------------------------------

>From LUCENE-8496 I ran two benchmarks using 20 Million shape documents 
>(linestring, polygon, multipolygon) from the 
>[planetOSM|https://www.openstreetmap.org/stats/data_stats.html] corpus of 
>data. The results are as follows:

6 dimension LatLonShape w/o selective indexing:
—
INDEX SIZE: 4.944349942728877 GB
READER MB: 2.7018051147460938
maxDoc=20000000
BEST M hits/sec: 0.9616386761834046
BEST QPS: 5.816462716249435

7 dimension LatLonShape w/ selective indexing (provided patch):
—
INDEX SIZE: 8.462444095872343 GB
READER MB: 2.5924673080444336
maxDoc=20000000
BEST M hits/sec: 1.5636273064182318
BEST QPS: 9.457585426978618

This gives about a 70% performance boost at search time with the tradeoff of a 
larger index.

> Change LatLonShape encoding to use selective indexing
> -----------------------------------------------------
>
>                 Key: LUCENE-8521
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8521
>             Project: Lucene - Core
>          Issue Type: New Feature
>            Reporter: Nicholas Knize
>            Priority: Major
>         Attachments: LUCENE-8521.patch
>
>
> LUCENE-8496 allows for selecting the first n dimensions to be used for 
> building the index and the remaining dimensions to be used as data 
> dimensions. This feature changes {{LatLonShape}} encoding to a 7 dimension 
> encoding instead of 6; where the first 4 are index dimensions defining the 
> bounding box of the {{LatLonShape.Triangle}} and the remaining 3 data 
> dimensions defining the vertices of the triangle.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to