On 03/26/2012 02:39 AM, Gabriel Dos Reis wrote: > True, but should the language definition default to a string type > that is one the most unsuited for text processing in the 21st > century where global multilingualism abounds? Even C has qualms > about that. ... > I have no doubt believing that if all texts my students have to > process are US ASCII, [Char] is more than sufficient. So, I have > sympathy for your position. However, I doubt [Char] would be > adequate if I ask them to shared texts from their diverse cultures.
Uh, while a C char is (usually) just a byte (2^8 bits of information, like
Word8 in Haskell), a Haskell Char is a Unicode character (2^21 bits of
information). A single C char cannot contain arbitrary Unicode character,
while a Haskell Char can, and does. Hence [Char] is (efficiency issues
aside) perfectly adequate for dealing with texts written in arbitrary languages.
Best regards
Christian
[I first accidentally send this just to Gabriel, sorry.]
--
|------- Dr. Christian Siefkes ------- [email protected] -------
| Homepage: http://www.siefkes.net/ | Blog: http://www.keimform.de/
| Peer Production Everywhere: http://peerconomy.org/wiki/
|---------------------------------- OpenPGP Key ID: 0x346452D8 --
A bug is a test case you haven't written yet.
signature.asc
Description: OpenPGP digital signature
_______________________________________________ Haskell-prime mailing list [email protected] http://www.haskell.org/mailman/listinfo/haskell-prime
