Re: [Haskell-cafe] Re: PROPOSAL: New efficient Unicode string library.

Deborah Goldsmith Tue, 02 Oct 2007 15:20:12 -0700

On Oct 2, 2007, at 3:01 PM, Twan van Laarhoven wrote:

Lots of people wrote:
> I want a UTF-8 bikeshed!
> No, I want a UTF-16 bikeshed!
What the heck does it matter what encoding the library usesinternally? I expect the interface to be something like (from my ownCompactString library):
> fromByteString :: Encoding -> ByteString -> UnicodeString
> toByteString   :: Encoding -> UnicodeString -> ByteString


I agree, from an API perspective the internal encoding doesn't matter.


The only matter is efficiency for a particular encoding.


This matters a lot.

I would suggest that we get a working library first. Either UTF-8 orUTF-16 will do, as long as it works.
Even better would be to implement both (and perhaps more encodings),and then benchmark them to get a sensible default. Then the choicecan be made available to the user as well, in case someone hasspecifix needs. But again: get it working first!

The problem is that the internal encoding can have a big effect on theimplementation of the library. It's better not to have to do it overagain if the first choice is not optimal.

I'm just trying to share the experience of the Unicode Consortium, theICU library contributors, and Apple, with the Haskell community. They,and I personally, have many years of experience implementing supportfor Unicode.


Anyway, I think we're starting to repeat ourselves...

Deborah

_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Re: PROPOSAL: New efficient Unicode string library.

Reply via email to