Re: [elephant-devel] Lisp Btrees: design considerations

Ian Eslick Fri, 16 May 2008 08:33:51 -0700

I checked, Rucksack writes dirty objects to disk on each commit - soshould have the same performance profile as our existing backends, ofcourse with different constants...

Ian


On May 16, 2008, at 11:26 AM, Ian Eslick wrote:

I just wanted to clarify a few points in this interesting discussion.
but if we are using high-level language like Lisp, it's much morelikely that CPU overhead will be large comparing to memory latency.and as compression, even simpiest/fastest one will only add to CPUoverhead, i'm very skeptical about it's benefits.
I doubt that using a Lisp with a modern compiler affects CPU time inany interesting way; I've observed that algorithm implementationwith reasonable data structure choices is within 10's of % points ofthe same algorithm in C. I don't care how fast your CPU is, it'sthe data access patterns that dominate time unless you're working onsmall or highly local datasets, then the inner loop speed _might_matter.
by the way, interesting fact: some time ago i've found that evenwith cache enabled in elehant/postmodernslots reads are relatively slow -- on 10 thousands per second scaleor so. it was quite surprising
that abusing function is form-slot-key (that is used in CL-SQL too):
At least in Allegro, the profiler only counts thread time and thatmuch of the delay shown by 'time' vs. the profiler is taken upwaiting on cache-miss IO. It may be that thread time is only ahandful of percentage points of the total time?
(defun form-slot-key (oid name)
(format nil "~A ~A" oid name))
Format is notoriously slow. Still, I find it hard to believe thatit matters that much. (See above)
and even more surprising was to find out that there is no portableand fast way to convert interger to string.so i had to use SBCL's internal function called sb-impl::quick-integer-to-string -- apparently SBCLdevelopers knew about this problem so they made this function forthemselves.
For fun, I did a test with princ and for integers only it's 2xfaster, for the above idiom it's only about 50% faster, but some ofthat is the with-string-stream idea. If you do it manually as theydid and write it to an array one character at a time it can getpretty fast, but probably only 3x-4x.
so i think that subtle effects like I/O bandwidth will be only seenin we'll hardcore optimize elephant and applications themselves.but as it is now, storage backend won't be of a big influence ofoveral performance and just needs to be moderately good
I agree with this in general. However I feel like this wholediscussion is premature optimization being done while blindfolded.I do think that over time it would be nice to give people tools likecompression to enable/disable for their particular applications, butit's probably not worth all the debate until we solve some of thebigger problems in our copious free time.
By the way, if you are caching your data in memory and using themost conservative transaction model of BDB (blocking commits) youare write bandwidth limited. At the end of each little transaction,you have to write a log entry to disk, then flush a 16kb page todisk, then wait for the controller to say OK, then write the logagain, then return to your regularly scheduled programming. Inprevalence, you only write the log and occasionally dump to disk. Isuspect that the SQL back ends function similarly. When you finisha transaction you have to wait for this to all happen over a socket...
Prevalence is much, much faster because you don't have to flush datastructures on each commit, so cl-prevalence performance withElephant's data and transaction abstractions would be a really nicedesign point. I wonder if we would get some of this benefit from aRucksack adaptation?
Ian


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Lisp Btrees: design considerations

Reply via email to