askoa commented on issue #3520: URL: https://github.com/apache/arrow-rs/issues/3520#issuecomment-1398246267
> Providing an iterator abstraction, similar to TypedDictionaryArray, that downcasts the values and uses ArrayAccessor to "decode" the runs makes sense to me to help ergonomics Yes, I am planning to model `ArrayAccessor` after `TypedDictionaryArray`. > However, most kernels I imagine will need custom logic to handle RunEncodedArrays efficiently, e.g. take will need to parse the runs array and compute a new set of runs along with the take indices to apply to the values array. Filter will need to do something similar. Does it make sense to add a function to `RunEndEncodedArray` that'll convert from logical index to physical index? I think it can be used by functions that are trying to decode arbitrary indices. >One thing to be extremely careful of, is to avoid generic code typed on both the key type and the value type in our kernels - this explodes codegen and has caused a lot of pain with dictionaries I am planning to model after `TypedDictionaryArray` which has generics for both key and value. Would that be a problem? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
