jhorstmann opened a new pull request #8092:
URL: https://github.com/apache/arrow/pull/8092


   - sort_to_indices further splits up float64/float32 inputs into 
nulls/non-nan/nan, sorts the non-nan values and then concats those 3 slices 
according to the sort options. Nans are distinct from null and sort greater 
than any other valid value
   - implemented a sort method for dictionary arrays with string values. this 
kernel checks the is_ordered flag and sorts just by the keys if it is set, it 
will look up the string values otherwise
   - for the lexical sort use case the above kernel are not used, instead the 
OrdArray trait is used. To make that more flexible and allow wrapping arrays 
with differend ordering behavior I will make it no longer extend Array and 
instead only contain the cmp_value method
   - string dictionary sorting is then implemented with a wrapper struct 
StringDictionaryArrayAsOrdArray which implements OrdArray
   - NaN aware sorting of floats is then also implemented with a wrapper struct 
and trait implementation


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to