Nicholas Knize created LUCENE-6422:
--------------------------------------

             Summary: Add StreamingQuadPrefixTree
                 Key: LUCENE-6422
                 URL: https://issues.apache.org/jira/browse/LUCENE-6422
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/spatial
            Reporter: Nicholas Knize


To conform to Lucene's inverted index, SpatialStrategies use strings to 
represent QuadCells and GeoHash cells. Yielding 1 byte per QuadCell and 5 bits 
per GeoHash cell, respectively.  To create the terms representing a Shape, the 
BytesRefIteratorTokenStream first builds all of the terms into an ArrayList of 
Cells in memory, then passes the ArrayList.Iterator back to invert() which 
creates a second lexicographically sorted array of Terms. This doubles the 
memory consumption when indexing a shape.

This task introduces a PackedQuadPrefixTree that uses a StreamingStrategy to 
accomplish the following:

1.  Create a packed 8byte representation for a QuadCell
2.  Build the Packed cells 'on demand' when incrementToken is called

Improvements over this approach include the generation of the packed cells 
using an AutoPrefixAutomaton



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to