Re: [agi] database access fast enough?

J. Andrew Rogers Wed, 16 Apr 2008 23:01:27 -0700


On Apr 16, 2008, at 9:51 PM, YKY (Yan King Yin) wrote:

Typically we need to retrieve many nodes from the DB to do inference.
The nodes may be scattered around the DB.  So it may require *many*
disk accesses.  My impression is that most DBMS are optimized for
complex queries but not for large numbers of simple retrievals -- am I
correct about this?

No, you are not correct about this. All good database engines use acombination of clever adaptive cache replacement algorithms (read:keeps stuff you are most likely to access next in RAM) and cost-basedoptimization (read: optimizes performance by adaptively selectingquery execution algorithms based on measured resource access costs) tooptimize performance across a broad range of use cases. For highlyregular access patterns (read: similar query types and complexity),the engine will converge on very efficient access patterns andresource management that match this usage. For irregular accesspatterns, it will attempt to dynamically select the best options givenrecent access history and resource cost statistics -- not always thebest result (on occasion hand optimization could do better), but morelikely to produce good results than simpler rule-based optimization onaverage.

Note that by "good database engine" I am talking engines that actuallysupport these kinds of tightly integrated and adaptive managementfeatures: Oracle, DB2, PostgreSQL, et al. This does *not* includeMySQL, which is a naive and relatively non-adaptive engine, and whichscales much worse and is generally slower than PostgreSQL anyway ifyou are looking for a free open source solution.

I would also point out that different engines are optimized fordifferent use cases. For example, while Oracle and PostgreSQL sharethe same transaction model, Oracle design decisions optimized formassive numbers of small concurrent update transactions and PostgreSQLdesign decisions optimized for massive numbers of small concurrentinsert/delete transaction. Databases based on other transactionmodels, such as IBM's DB2, sacrifice extreme write concurrency forsuperior read-only performance. There are unavoidable tradeoffs withsuch things, so the market has a diverse ecology of engines that havechosen a different set of tradeoffs and buyers should be aware of whatthese tradeoffs are if scalable performance is a criteria.



J. Andrew Rogers

-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=101455710-f059c4
Powered by Listbox: http://www.listbox.com

Re: [agi] database access fast enough?

Reply via email to