Re: [elephant-devel] Berkeley DB error: Cannot allocate memory.

Ian Eslick Tue, 23 Sep 2008 11:38:33 -0700

Comments on original "out of memory problems"

- Running out of disk space for logs can also be a problem as well astransaction size- The max # of locks needed will grow with the log of the DB size andlinearly with the number of concurrent transactions. You can rundb_stat to see what your resource usage is.- When bulk loading be sure to pull the data from a file into memorybefore running the transaction if there might be any concurrenttransactions on that DB. (think about what happens when you have torestart the transaction and the DB state is reset but the stream &loop variables in memory are not).

but not for larger logical transactions where they'd
really be called for. Raising the limits at best slightly lessens the
pain, but is not really a solution.

If raising the limits isn't a solution, then your application isprobably mismatched with Elephant/BDB. There might be some artificialreasons you are seeing these problems.

1) If you have a large DB with deep indices and highly concurrenttransactions, then you may start hit lock limits as described above.You should be able to set these limits high enough that you'll neverhit problems. Fortunately increasing locks is a good solution. max-concurrent-sessions * max-objects-per-session * 2 + 20% is a goodnumber for max locks. You can use db_stat to see what kind of locks-per-session numbers you're seeing and an analysis of your applicationshould tell you the max-concurrent-sessions.

Max concurrent transactions is upper-bounded by your machine'sprocessing capacity and max # of locks is proportional to thelogarithm of the number of objects which will grow very slowly so canbe given practical upper bounds.

(Set lock sizes in config.sexp - :berkeley-db-max-locks & :berkeley-db-max-objects)

2) If you really are blowing out the BDB data cache in normaloperation (vs. bulk loads) then your application is either not usingtransactions efficiently or is highly mismatched to any transactionalDB whether that be BDB or some future lisp implementation.

If a logical transaction results in writing or modifying 10's ofmegabytes of data, then no transactional DB will ever work for you.This is just physics. To get ACID properties you have to be able tostage all side effects in memory to ensure that they are written as anatomic unit. If you don't have enough memory to perform this staging,then you can't implement a transaction.

There is a special case where you are manipulating large data objectssuch as full-res images or uploaded files. I use a hybrid solutionfor these where BDB keeps pointers to disk files which store just thelarge items. If you can handle potential inconsistency between the on-disk storage and the DB, then you can write the file to disk, keep apointer to it and do the bookkeeping in the DB inside a singletransaction. You'll run into serializer limits before you run intocache size limits for large data items anyway.

The following may be obvious, but it's easy to get complacent andforget that Elephant is backed up by a transactional DB and that youcan't really ignore those implications for real-world applications.

- The concept behind the transaction is to combine small DB operationsinto a single atomic chunk. Sometimes you can wrap a small set ofsuch chunks for performance. You shouldn't be wrapping lots ofunrelated operations into a single transaction. This is better foravoiding concurrency conflicts as well.

For example, all composite operations inside Elephant use transactionsto ensure consistency of slot and index values. Slot writes toindexed slots wrap both the write and the index updates into a singletransaction. Otherwise every sub-operation ends up in its own littletransaction and you can get inconsistent indices if you have an errorin between the slot write and index update.

- One downside of a transactional store on BDB is every littletransaction results in a set of pages being flushed to disk (any Btreeindex pages and the leaf pages with data). Flushing pages to diskwaits for a write-done from the driver; this can be very expensive!(10's of ms if I remember correctly). Flushing one page takes aboutthe same time as flushing a few dozen pages so you can savesignificant run time by wrapping your leaf operations in a transaction- plus all the usual ACID properties.

In my web application (using weblocks) I wrap every ajax callback onthe server side in one big transaction since I know apriori thatoperations within these callbacks aren't going to blow out 20M ofcache or 1000's of locks. Typically I update a couple of objects andindices. This may be a few dozen 4k or 8k pages.


Cheers,
Ian


On Sep 23, 2008, at 9:42 AM, Marc wrote:

Ian Eslick wrote:

You could be running out of cache or locks.  I believe there are now
parameters in config.sexp you can set to raise the default limits.
The lack of robustness to allocation failures is a problem with
Berkeley DB.

Unfortunately, running recovery isn't a trivial process.  You have to
guarantee that all other threads have released any Berkeley DB

resources (abort all active transactions) and don't try to requestany

more (meaning no persistent slot reads/writes for Elephant) so you
essentially need to get inside the state of each process, abort any

transactions, and then suspend each thread. This isn't somethingthat

you can canonicalize inside the Elephant library.

Chalk this up as another reason to someday implement a lisp-only
version of the library!

Indeed, that'd be a dream. For us, at least, this prevents us from

seriously using transactions at all in elephant. We do use them tospeed

up some bulk inserts when we know that the number of inserts in one go
won't be too big, but not for larger logical transactions where they'd
really be called for. Raising the limits at best slightly lessens the
pain, but is not really a solution.

Best regards,

Marc


On Sep 21, 2008, at 10:25 PM, Red Daly wrote:

I have recently run into "Cannot allocate memory" problems with
elephant on a production server.  Unfortunately when one transaction

is too large, it seems to blow the database until a manual recoveris

done.

The occasional failure is slightly worrisome but it the whole
database requiring a manual recover from one extra-large transaction
is a scary thought for a live application with thousands of users.

Why does the memory allocation failure sour the whole database
instead of aborting a single transaction?  I think that elephant
should try to encapsulate this failure, recovering the database or

whatever is necessary to make the store usable for the nexttransaction.


Best,
Red Daly


On Sat, Jan 5, 2008 at 5:02 PM, Victor Kryukov
<[EMAIL PROTECTED]> wrote:
On Jan 4, 2008 2:54 AM, Ian Eslick <[EMAIL PROTECTED]> wrote:

Hi Victor,

Sounds like your transaction is blowing out the shared memory
allocated by Berkeley DB to store dirty pages.  This is caused by
transactions that are too large; putting an entire file of datacouldwell accomplish this. (We really should change the error messageto
be more informative in these cases).
Try pushing with-transaction into the loop in import-movie asfollows:


Thanks for your suggestion, Ian - the problem was solved once I've
moved with-transaction inside the collect-rating-info.
_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel



_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel


_______________________________________________
elephant-devel site list
elephant-devel@common-lisp.net
http://common-lisp.net/mailman/listinfo/elephant-devel

Re: [elephant-devel] Berkeley DB error: Cannot allocate memory.

Reply via email to