[ 
https://issues.apache.org/jira/browse/ARROW-14290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17428090#comment-17428090
 ] 

Antoine Pitrou commented on ARROW-14290:
----------------------------------------

bq. An interface that allows returning of a comparison may allow flexibility to 
adapt for different circumstances.  Arrow allows for flexible schema, so 
optimizations may be possible which are not possible with a regular column.

Which optimizations do you have in mind? I'm not sure that calling a function 
pointer O(n log n) times is generally better than computing an additional 
column in O(n) time.

Also, the function pointer has to be "dynamically typed" since its exact 
signature will depend on the datatype being sorted. Perhaps this is solvable 
using a base class and some virtual methods...

> [C++] String comparison in between ternary kernel
> -------------------------------------------------
>
>                 Key: ARROW-14290
>                 URL: https://issues.apache.org/jira/browse/ARROW-14290
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Benson Muite
>            Assignee: Benson Muite
>            Priority: Minor
>
> String comparisons in C++ will use order by unicode. This may not be suitable 
> in many language applications, for example when using characters from 
> languages that use more than ASCII.   Sorting algorithms can often allow for 
> the use of custom comparison functions.  It would be helpful to allow for 
> this for the between kernel as well.  Initial work on the between kernel is 
> being tracked in https://issues.apache.org/jira/browse/ARROW-9843



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to