Re: [Haskell-cafe] The question of ByteString

Bryan O'Sullivan Fri, 02 Nov 2007 14:11:36 -0800

Andrew Coppin wrote:

1. Why do I have to type "ByteString" in my code? Why isn't the compilerautomatically performing this optimisation for me?

One reason is that ByteString is stricter than String. Even lazyByteString operates on 64KB chunks. You can see how this might lead toproblems with a String like this:


"foo" ++ undefined

The first three elements of this list are well-defined, but if you touchthe fourth, you die.

2. ByteString makes text strings faster. But what about other kinds ofcollections? Can't we do something similar to them that makes them gofaster?

Not as easily. The big wins with ByteString are, as you observe, thatthe data are tiny, uniformly sized, and easily unboxed (though usingForeignPtr seems to be a significant win compared to UArray, too). Thisalso applies to other basic types like Int and Double, but leave thosebehind, and you get problems.

If your type is an instance of Storable, it's going to have a uniformsize, but it might be expensive to flatten and unflatten it, so whoknows whether or not it's truly beneficial. If it's not an instance ofStorable, you have to store an array of boxed values, and we know thatarrays of boxes have crummy locality of reference.

Spencer Janssen hacked up the ByteString code to produce StorableVectoras part of last year's SoC, but it never got finished off:


http://darcs.haskell.org/SoC/fps-soc/Data/StorableVector/

More recently, we've been pinning our hopes on the new list fusion stuffto give many of the locality of reference benefits of StorableVectorwith fewer restrictions, and all the heavy work done in a library.


        <b
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] The question of ByteString

Reply via email to