On Tue, Jan 5, 2010 at 12:32 PM, Ted Dunning <ted.dunn...@gmail.com> wrote:
> "same representation" doesn't have to mean that the representation doesn't > have magic internally. > > It just means that if you put the same content into three different kinds > of > vectors, you plausibly ought to see roughly the same thing go out the wire. > This is subject to a few caveats like the fact that a dense vector doesn't > really know if it has only a few non-zero elements. I would be happy if > the > serialized form decided that it had lots of non-zeros and thus could do > away > with writing all of the indexes. Yeah, ok, I guess that is what I was getting at when Drew mentioned fully decoupling the serialized form from the in-memory representation. That would be ideal, but might be a little more work. -jake