[pcre-dev] PCRE_UTF8 flag

Steven Gerrard Mon, 19 Oct 2009 10:14:27 -0700

Hi,
 
On your homepage it said that I could contact the active PCRE developers via 
this mail address, hope this is alright.
 
I've been using PCRE in a project of mine, a project of which I have recently 
started performing internationalization.
 
As far as I could understand, the way of using UTF8 strings with PCRE is by 
passing the PCRE_UTF8 option to pcre_compile.
 
Now, while I understand that passing this option flag to pcre_compile causes 
non-valid UTF8 strings to fail compilation, it seems that I can still use UTF8 
strings without passing this option to pcre_compile. This makes strings get 
treated like plain ASCII strings, thereby comparing English characters case 
insensitively, and the rest of the chart (for example, Hebrew characters 
represented by the 128-256 part of the chart) using a plain binary comparison. 
This, as far as I can see, works for me perfectly - this way I can pass both 
ASCII and UTF8 strings, which will be matched using case insensitive collation 
for English characters, and binary comparison for any other character.
 
Am I missing something? Does the PCRE_UTF8 benefit me in any other way I've 
managed to miss so far in my testing?
 
Thanks,
G.



      
-- 
## List details at http://lists.exim.org/mailman/listinfo/pcre-dev

[pcre-dev] PCRE_UTF8 flag

Reply via email to