Re: [Haskell-cafe] Valid Haskell characters

Deborah Goldsmith Mon, 25 Aug 2008 18:31:37 -0700

You can't determine Unicode character properties by analyzing thenames of the characters.


Read chapter 4 of the standard:
http://www.unicode.org/versions/Unicode5.0.0/ch04.pdf


and get the property values here:
http://www.unicode.org/Public/UNIDATA/DerivedCoreProperties.txt

It sounds like the properties you want are "Case" and "GeneralCategory". Maybe the spec should be more explicit on exactly how thedefinitions map onto Unicode properties, so there is no ambiguity.


Deborah

On Aug 25, 2008, at 6:15 PM, Maurí cio wrote:

Hi,

In Haskell reference, I see the
following definitions:

uniWhite -> any Unicode character defined
as whitespace;

uniSmall -> any Unicode lowercase letter;

uniLarge -> any uppercase or titlecase
Unicode letter;

uniSymbol -> any Unicode symbol or
punctuation.

Where do I get lists for those
characters? My first attempt was to
check:

http://unicode.org/Public/UNIDATA/UnicodeData.txt

and consider large anything marked as
CAPITAL and small anything marked as SMALL. I
didn't know what to guess about the symbols.
Am I using the right reference? How can I
recognize (or get a list of) valid uppercase and
lowercase unicode letters, as well as symbols
and punctuation?

Thanks for your help,
Maurício

_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe


_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Valid Haskell characters

Reply via email to