On Wed, 25 Mar 2009 17:08:11 +0100 Jennifer Pioch wrote:
> On 3/24/09, Glenn Fowler <[email protected]> wrote:
> >
> >  here's what the regexp page says:
> >
> >   The Simple Regular Expressions described below differ from the
> >   Internationalized Regular Expressions described on the regex(5) manual
> >   page in the following ways:
> >
> >     * only Basic Regular Expressions are supported
> >     * the Internationalization features--character class, equivalence class,
> >       and multi-character collation--are not supported.
> >
> >  if these are indeed the only differences then I can add a REG_NOI18N
> >  regcomp() flag -- but I need verification of exactly what that means
> >  does that mean that it is byte based, or does . match a multibyte char?

> It supports and matches multibyte characters, only supports Basic
> Regular Expressions and does not support the extended set of character
> *classes*.
> The name REG_NOI18N would be misleading, its better to call it
> REG_REGEXP (basic regexp).

I was concerned about multibyte because the regexp text refers to byte
instead of character in some places

can you verify how the regexp grep works in the C and multibyte locales
try with a file that has one line with one multibyte char
and try this pattern
        '^.$'

_______________________________________________
ast-users mailing list
[email protected]
https://mailman.research.att.com/mailman/listinfo/ast-users

Reply via email to