REG_REGEXP for <regexp.h> compatibility will be in the next release
REG_BINARY won't be needed
just set the C or POSIX locale for byte matching

On Wed, 22 Apr 2009 13:59:17 +0200 Jennifer Pioch wrote:
> On 4/8/09, Jennifer Pioch <[email protected]> wrote:
> > On 3/25/09, Glenn Fowler <[email protected]> wrote:
> >  >
> >  >  On Wed, 25 Mar 2009 17:08:11 +0100 Jennifer Pioch wrote:
> >  >  > On 3/24/09, Glenn Fowler <[email protected]> wrote:
> >  >  > >
> >  >  > >  here's what the regexp page says:
> >  >  > >
> >  >  > >   The Simple Regular Expressions described below differ from the
> >  >  > >   Internationalized Regular Expressions described on the regex(5) 
> > manual
> >  >  > >   page in the following ways:
> >  >  > >
> >  >  > >     * only Basic Regular Expressions are supported
> >  >  > >     * the Internationalization features--character class, 
> > equivalence class,
> >  >  > >       and multi-character collation--are not supported.
> >  >  > >
> >  >  > >  if these are indeed the only differences then I can add a 
> > REG_NOI18N
> >  >  > >  regcomp() flag -- but I need verification of exactly what that 
> > means
> >  >  > >  does that mean that it is byte based, or does . match a multibyte 
> > char?
> >  >
> >  >  > It supports and matches multibyte characters, only supports Basic
> >  >  > Regular Expressions and does not support the extended set of character
> >  >  > *classes*.
> >  >  > The name REG_NOI18N would be misleading, its better to call it
> >  >  > REG_REGEXP (basic regexp).
> >  >
> >  >
> >  > I was concerned about multibyte because the regexp text refers to byte
> >  >  instead of character in some places
> >
> >
> > Solaris regexp matches multibyte characters, but a REG_BINARY option
> >  to match in singlebyte characters may be useful for matching binary
> >  data.
> >
> >
> >  >  can you verify how the regexp grep works in the C and multibyte locales
> >  >  try with a file that has one line with one multibyte char
> >  >  and try this pattern
> >  >         '^.$'
> >
> >
> > $ printf "a\nä\nb\n" | /usr/bin/grep '^.$'
> >  a
> >  ä
> >
> > b

> Glenn, are you going to make the changes to regex?

> Jenny
> -- 
> Jennifer Pioch, Uni Frankfurt

_______________________________________________
ast-users mailing list
[email protected]
https://mailman.research.att.com/mailman/listinfo/ast-users

Reply via email to