Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

2019-08-24 Thread yury.t
Although this thread now might be offtopic, let me send a follow-up. By searching with C related terms, I found some articles about this issue.  It seems to be a common problem on regex + multibyte in C.  (e.g. https://stackoverflow.com/a/15895746 ) On Wed,

Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

2019-08-22 Thread Tomi Ollila
On Thu, Aug 22 2019, David Bremner wrote: > "yury.t" writes: > >> Thank you for your reply. >> I confirmed that the issue is reproduced in C program.  >> https://pastebin.com/5NaCM45G >> >> Sorry for bothering you... > > I'm not sure, but it might be a glibc bug.

Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

2019-08-22 Thread David Bremner
"yury.t" writes: > Thank you for your reply. > I confirmed that the issue is reproduced in C program.  > https://pastebin.com/5NaCM45G > > Sorry for bothering you... I'm not sure, but it might be a glibc bug. Since we are already using glib, maybe we should use

Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

2019-08-22 Thread yury.t
Thank you for your reply. I confirmed that the issue is reproduced in C program.  https://pastebin.com/5NaCM45G Sorry for bothering you... ___ notmuch mailing list notmuch@notmuchmail.org

Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

2019-08-21 Thread David Bremner
"yury.t" writes: > Some regular expression returns incorrect results if the pattern > contains multibyte characters in square brackets. The following > bracket expression matches subjects not starting with `[1-9]` and > returns more results than the parenthesis expression. We rely on POSIX.2