Hi,
On your homepage it said that I could contact the active PCRE developers via
this mail address, hope this is alright.
I've been using PCRE in a project of mine, a project of which I have recently
started performing internationalization.
As far as I could understand, the way of using UTF8 strings with PCRE is by
passing the PCRE_UTF8 option to pcre_compile.
Now, while I understand that passing this option flag to pcre_compile causes
non-valid UTF8 strings to fail compilation, it seems that I can still use UTF8
strings without passing this option to pcre_compile. This makes strings get
treated like plain ASCII strings, thereby comparing English characters case
insensitively, and the rest of the chart (for example, Hebrew characters
represented by the 128-256 part of the chart) using a plain binary comparison.
This, as far as I can see, works for me perfectly - this way I can pass both
ASCII and UTF8 strings, which will be matched using case insensitive collation
for English characters, and binary comparison for any other character.
Am I missing something? Does the PCRE_UTF8 benefit me in any other way I've
managed to miss so far in my testing?
Thanks,
G.
--
## List details at http://lists.exim.org/mailman/listinfo/pcre-dev