[
https://issues.apache.org/jira/browse/LUCENE-8521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637420#comment-16637420
]
Nicholas Knize commented on LUCENE-8521:
----------------------------------------
>From LUCENE-8496 I ran two benchmarks using 20 Million shape documents
>(linestring, polygon, multipolygon) from the
>[planetOSM|https://www.openstreetmap.org/stats/data_stats.html] corpus of
>data. The results are as follows:
6 dimension LatLonShape w/o selective indexing:
—
INDEX SIZE: 4.944349942728877 GB
READER MB: 2.7018051147460938
maxDoc=20000000
BEST M hits/sec: 0.9616386761834046
BEST QPS: 5.816462716249435
7 dimension LatLonShape w/ selective indexing (provided patch):
—
INDEX SIZE: 8.462444095872343 GB
READER MB: 2.5924673080444336
maxDoc=20000000
BEST M hits/sec: 1.5636273064182318
BEST QPS: 9.457585426978618
This gives about a 70% performance boost at search time with the tradeoff of a
larger index.
> Change LatLonShape encoding to use selective indexing
> -----------------------------------------------------
>
> Key: LUCENE-8521
> URL: https://issues.apache.org/jira/browse/LUCENE-8521
> Project: Lucene - Core
> Issue Type: New Feature
> Reporter: Nicholas Knize
> Priority: Major
> Attachments: LUCENE-8521.patch
>
>
> LUCENE-8496 allows for selecting the first n dimensions to be used for
> building the index and the remaining dimensions to be used as data
> dimensions. This feature changes {{LatLonShape}} encoding to a 7 dimension
> encoding instead of 6; where the first 4 are index dimensions defining the
> bounding box of the {{LatLonShape.Triangle}} and the remaining 3 data
> dimensions defining the vertices of the triangle.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]