Re: [Bacula-users] [Bacula-devel] Releasing the new batch DB insert code

Eric Bollengier Fri, 23 Mar 2007 00:49:13 -0800

Hi,

> 1. With the batch insert code turned on there are a number of regression
> tests that fail.  They must all pass without errors prior to production
> release. Responsible: Eric
> Deadline: Roughly the end of March


I work on it

> 2. I am very concerned that the new batch insert code will under certain
> circumstances fail with an out of memory condition because load placed on
> the SQL engine will be too large. Marc points out that the SQL engine
> should handle this, but my experience is that most users (including myself)
> install MySQL or PostgreSQL and do not understand the details tuning the
> engine, and in its default mode, it can have a catastrophic failure (out of
> memory). This currently never happens within Bacula code because of a very
> conservative design, but it does happen in dbcheck.  This happens when
> *all* the data is maintained in the engine.  The particular command that
> fails is:
>
> "SELECT JobMedia.JobMediaId,Job.JobId FROM JobMedia "
>                 "LEFT OUTER JOIN Job ON (JobMedia.JobId=Job.JobId) "
>                 "WHERE Job.JobId IS NULL LIMIT 300000";
>
> and it only fails if I remove the "LIMIT 3000000".
>
> The bottom line is that I believe that there are three measures that we
> should take to ensure that this does not happen, and if it does, the user
> will have a way to easily workaround the problem without needing two users
> of training as a DBA.
>
> - Write some test code inside Bacula that will create 10 million batch
> insert records (this item would be nice, but it is not required) so that we
> can test it on "default" database installations.

I'm creating a bbatch tool wich will do that. (and we will have a bench tool
for bacula database).

> - First, have a default limit of the number of records that will be
> inserted in any one batch request.  This should guarantee that an out of
> memory problem will not normally occur.

It's the database job... I never seen a database (mysql, postgres or oracle)
saying something like "Sorry i'm out of memory". Database takes memory
that you give, never more. 

> - Second, add a new directive to the Catalog resource that allows the user
> to change the default batch size (by setting the value to 0 or very large,
> you can effectively allow the batch to be arbitrarily large).

I can do that

> - Third (this is optional, but very desirable), implement a new directive
> that allows the user to enable/disable the batch insert mechanism.  This
> would require a bit of change in the subroutine names in the cats directory
> so that we can enable the new code and the old code at the same time, but
> select which is used at runtime for each DB based on the directive value. 
> If the first and second items are implemented, the batch insert would be
> enabled by default, otherwise it will be turned off.
> Responsiblity for above 3 (4) points: Eric (and possibly Marc)
> Deadline: rougly the end of March

It's possible to, but i must wrote bbatch before, i think we will have 
suprises...

> 3. Documentation
> Before putting this into production, I would like to see a bit more
> documentation about the new algorithms -- this is documentation that would
> be placed in the code.  Marc has offerred to answer questions and write
> some documentation, I offer to integrate it into the code, and continue to
> ask questions until it is clear.  This item should be no problem to
> resolve. Responsible: Kern + Marc  (with help from Eric if he wants)
> Deadline: Kern understands the algorthm by mid April.

Marc is writing it.

Bye

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Re: [Bacula-users] [Bacula-devel] Releasing the new batch DB insert code

Reply via email to