lidavidm opened a new pull request #10369:
URL: https://github.com/apache/arrow/pull/10369


   This uses RE2 to implement a case-insensitive substring search.
   
   Originally, I implemented this using utf8proc, but then found it was about 
an order of magnitude slower than RE2. (This isn't an apples-to-apples 
comparison; utf8proc does it more 'properly' and handles more Unicode corners.) 
So I switched to just doing it with RE2 instead, especially since the utf8proc 
approach was complicated. (You can still see it in the original commit here if 
you're curious.)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to