Re: combinator parsers and XSLT

Manuel M. T. Chakravarty Wed, 27 Sep 2000 01:59:05 -0700

[EMAIL PROTECTED] (Marcin 'Qrczak' Kowalczyk) wrote,

> Wed, 27 Sep 2000 00:22:05 +1100, Manuel M. T. Chakravarty <[EMAIL PROTECTED]> 
>pisze:
> 
> > Hmm, this seems like a shortcoming in the Haskell spec.  We have all
> > these isAlpha, isDigit, etc functions, but I can't get at a list of,
> > say, all characters for which isAlpha is true.
> 
> You can: filter isAlpha ['\0'..'\xFFFF']
> (don't use maxBound here because it's too large and we know that
> currently there are no isAlpha characters outside this range).
> 
> Working on large explicit lists is inefficient. 45443 characters
> are isAlpha. A lexer should be designed to avoid using a full list.

You are right, just having a list of the characters is to
naive an approach.  But this re-enforces may point, we need
an _efficient_ way of getting at the unicode ranges for
certain character classes.  H98 is seems to be lacking some
features for practical use of unicode - the header to the
standard library `Char' actually admits that

  This module offers only a limited view of the full Unicode
  character set; the full set of Unicode character
  attributes is not accessible in this library.

Manuel

Re: combinator parsers and XSLT

Reply via email to