[ 
https://issues.apache.org/jira/browse/ARROW-14290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17428038#comment-17428038
 ] 

Benson Muite commented on ARROW-14290:
--------------------------------------

Some of the issues on ordering are discussed in 
[https://en.wikipedia.org/wiki/Alphabetical_order#Similar_orderings]

An example sorting program for multilingual dictionaries is 
[Msort|http://www.billposer.org/Software/msort.html] available under GPL3, but 
the description should give some idea of flexibility that may be required.

> [C++] String comparison in between ternary kernel
> -------------------------------------------------
>
>                 Key: ARROW-14290
>                 URL: https://issues.apache.org/jira/browse/ARROW-14290
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Benson Muite
>            Assignee: Benson Muite
>            Priority: Minor
>
> String comparisons in C++ will use order by unicode. This may not be suitable 
> in many language applications, for example when using characters from 
> languages that use more than ASCII.   Sorting algorithms can often allow for 
> the use of custom comparison functions.  It would be helpful to allow for 
> this for the between kernel as well.  Initial work on the between kernel is 
> being tracked in https://issues.apache.org/jira/browse/ARROW-9843



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to