Re: [elephant-devel] Re: Derived Indicies

Ian Eslick Sun, 23 Mar 2008 11:29:19 -0700

Another thought on this topic is that with a sufficiently efficientquery system, some of the indices you describe should becomeunnecessary. If you have messages indexed by time than you can simplywalk the index and filter by user until you have a web page worth ofmessages. A scan, even of an index, may not be fast enough so youcould do an intersection of user-to values with an ordered list ofrecent messages. This should perform similarly to a SQL engine whichmany ORM systems use for queries like this.

Nothing prevents you from making custom indices when you have to speedthings up, of course!


Ian

On Mar 23, 2008, at 11:12 AM, Alex Mizrahi wrote:

IE> :index t is not necessary - in fact it is ignored. :slot-deps
are
also not required, but the derived index is updated on any slotwriteif that slot is not transient, set-valued or an association.We canadd those last three slot types into the mix if necessary, butI'mtrying to avoid too much complex computation taking placeduring slot
     writes (self-deadlock, etc) for the time being.


seems to be fine..
actually we are using derived indices in quite special way -- to getindex
that is ordered in special way, to do efficient lookups of some kind.
suppose we have a messaging system with two persistent classes --user and
message:

(defpclass user ()
    ((username :index t)
    ...)

(defpclass message ()
   ((from-user :accessor from-user :index t)
    (to-user :accessor to-user :index t)
    (text :accessor text-of :index t)
    (modification-time :accessor modification-time-of :index t)))
suppose we'd like to get inbox and outbox views for user, i.e. listof 10
latest messages to user or from user.
with considerable amounts of users and messages it is not efficientto getthese latest messages from any of normal indices, as it requiresscanning
potentially large number of messages.
it's possible to make efficient lookups via derived indices --messages
ordered first by user, then my modification time. iirc cons sorting in
elephant has desired characteristics, but unfortunately postmodernbackend
does not support complex type sorting.
but we happily can reduce problem to sorting strings, e.g."13_31321433"where 13 is oid of from-user and 31321433 is modification time(universaltime). (actually we have user-id field instead of oid to preserveidentity
across multiple stores etc).
thus, via two derived indices we can efficiently list messages forinbox and
outbox. (via cursor operations, that's not very easy, but works fine).
as i understant you're going to make elephant more high level, andpossibly
such low level index operations can be replaced with some high level
concept.
are you planning something like these dual indices?
if there will be no special functionality for these, it would makesense tomake derived indices somewhat more flexible. for example, i suspectsome
uses might require "foreign slot dependency".
like if user definition above had group field, and we'd like to build
messages-by-group index. when user-group changes, all messages ofthis user
should update derived index.
probably there's no need in this stuff right now, but it would benice if it
will be possible to add it in future, so advanced indices can be built



_______________________________________________
elephant-devel site list
[email protected]
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
[email protected]
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Re: Derived Indicies

Reply via email to