Re: Major performance problem with std.array.front()

Sarath Kodali Fri, 07 Mar 2014 15:16:28 -0800

On Friday, 7 March 2014 at 22:35:47 UTC, Sarath Kodali wrote:

+1
In Indian languages, a character consists of one or moreUNICODE code points. For example, in Sanskrit "ddhrya"http://en.wikipedia.org/wiki/File:JanaSanskritSans_ddhrya.svgconsists of 7 UNICODE code points. So to search for this char Ihave to use string search.
- Sarath


Oops, incomplete reply ...

Since a single "alphabet" in Indian languages can containmultiple code-points, iterating over single code-points is likeiterating over char[] for non English European languages. Sodecode is of no use other than decreasing the performance. A rawchar[] comparison is much faster.

And then there is this "unicode normalization" that makes it verydifficult for string searches or comparisons.


- Sarath

Re: Major performance problem with std.array.front()

Reply via email to