Re: Off-heap binary doc values

Chris Hegarty Thu, 05 Dec 2024 03:46:57 -0800

Hi Ignacio,

I completely agree with the idea of having a BytesRef-like thing that can be 
off-heap. For a while now I’ve been thinking about how we could evolve BytesRef 
so as to not expose its on-heap representation. Having a separate primitive is 
probably a better way to go.


-Chris.

> On 5 Dec 2024, at 10:42, Ignacio Vera <iver...@gmail.com> wrote:
> 
> Hello,
> 
> I have been working with the idea of reading binary doc values
> off-heap for a while. The idea behind it is that binary doc values are
> often used for faceting where structure data is encoded at write time
> and decoded at read time. It feels wasteful to have to read the data
> on-heap before decoding it when we can read the data directly from the
> off-heap buffer.
> 
> The current proposal is to evolve the current API from an on-heap data
> structure (BytesRef) to an off-heap data structure (currently named
> RandomAccessInputRef). Because we are currently reading the data into
> the buffer using a RandomAccessInput with an offset and a length, it
> feels very natural to create an off-heap equivalent to BytesRef that
> is backed by a RandomAccessInput.
> 
> I am hoping to move this idea forward so I am asking for feedback as
> this is a change on a public API so I would love to hear other
> opinions.
> 
> Thank you!
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Off-heap binary doc values

Reply via email to