Re: FW: Advice on very badly performing query

Mike Matrigali Wed, 05 Dec 2007 09:39:48 -0800

Most of the "stat" information that derby uses is automatically kept up
to date as part of underlying index and table maintenance.  This info
includes count of rows in the table and data distribution of data in
indexes.


The one piece of info that is not kept up to date is average number of
duplicates for columns in an index.  This stat is given a default and
then is updated whenever you create an index, run the discussed compress

option, and as a side effect of some of the alter table commands. Idon't remember what the default is, something like 10%.


/mikem

Kim Haase wrote:

Do you think it might also help to put the information into the TuningGuide under "Performance tips and tricks"?
http://db.apache.org/derby/docs/dev/tuning/tuning-single.html#ctunperf22457

Thanks,

Kim Haase

Matt Doran wrote:
[EMAIL PROTECTED] wrote:
Matt Doran <[EMAIL PROTECTED]> writes:
I had not idea
that derby didn't keep any stats up-to-date without performing that
operation explicitly.  Ideally it would keep this up-to-date itself.
The sys.sysstatistics didn't have any rows in it until I ran the
compress table operation.
Agreed, but strangely I cannot recall many users actually requesting
this. Maybe people just suffer silently?
We have hundreds if not thousands of customers using our product andnot many of them have seen this pathological performance problem. Somaybe the optimizer does a good enough job in 90% of cases. We justhappened to hit the an extremely bad case.
So maybe it's just not something that people notice often. Or theyjust think "oh it's an embedded java database, it probably doesn'tperform that well. Let's just upgrade to a real database". That'swhat we did, and it's what other people probably do.
The beauty of the embedded DB is that it is self-maintaining. Isuspect that if it maintained the statistics by itself and thereforethere were performance benefits ... it will improve people'sperception that it performs well.
Anyway, thank you for what I would call an exemplary
bug-report/question! Even though you use Hibernate you took the time
to identfy the actual SQL causing the problem, identified a minimal
repro and provided query plans.
Thanks. I had trouble understanding the behaviour ... so I thoughthat nobody would believe me unless I provided enough evidence.
It really needs to be made more prominent in the documentation.
i.e. once your database is loaded with representative data, perform
the compress op for optimal performance.
Agreed. Any thoughts on where it would be good to mention it? If
you want, you can file a Jira issue about this.
I'm not sure. No-one is every going to read a whole manual. But Ihad read the ApacheCon performance presentations, and I don't rememberthem ever mention this. I think those presentations would be one ofthe first places people start when they have performance problems. Iknow you can't change these retrospectively ... but maybe making thisclear in the wiki would be a start.
I'm not sure if this is the appropriate page, but it was the onlything that looked relevant to performance (http://wiki.apache.org/db-derby/PerformanceDiagnosisTips). It doesn'tmention the stats/compress.Maybe some of the tips in those presentations should be the distilledinto some performance tips wiki pages ... and also make it clear thatstats need to be updated.
Regards,
Matt

Re: FW: Advice on *very* badly performing query

Reply via email to

Re: FW: Advice on very badly performing query