Re: Behaviour of SYSCS_COMPRESS_TABLE

Mike Matrigali Wed, 01 Jun 2005 10:33:25 -0700

Wogster makes a good point, SYSCS_COMPRESS_TABLE may help get data
in tables to be more contiguous.


The Wogster wrote:

�ystein Gr�vlen wrote:
"TW" == The Wogster <[EMAIL PROTECTED]> writes:
    TW> �ystein Gr�vlen wrote:

    >> Is this also true for B-tree indexes?  I would imagine that if you
>> have a index on a monotocally increasing key (e.g., atimestamp) and>> where you regularly delete old records, there may be a lot ofempty
    >> B-tree pages that will never be possible to reuse.
    >>
TW> What happens in most databases. is that the database has afixed pageTW> size, say 8K, when an index page is full, it splits thatpage into 2TW> half pages. When an index page is empty it's dropped from theindex,TW> and added to the empty page pool. Many will mergealmost empty
    TW> neighbouring pages, but that doesn't matter for this discussion.

I know this.  The reason I asked was because I have got the impression
that in Derby the only way to drop empty index pages is to do
compression.
Derby should work in a similar way to other databases, the techniquesfor developing a database were established years ago. When it comes toindexes there are 3 technologies:
ISAM: The index is loaded into an array in memory, it adds and deletesindex members through moving pointers around, then dumps the index backto disk when the database is closed: Dbase worked this way at one time.
B-Tree, traditional B-tree isn't used much anymore, because it's easy toget an off-balance tree, which ends up very inefficient.
Balanced B-Tree, most databases use this one, logic in the indexing codeis meant to keep shifting the tree around so that it stays in balance.Requires that the indexes at least be page based, so most page baseddatabases use this one. Most efficient in reads, can be slower at addsand deletes because of the shifting around, but for most databases thereare 100 reads for every add or delete.
For dropping index pages, look at the source code and see what it doeson a delete, if it releases the page to the empty page pool (more like acache actually), then it doesn't matter whether you compress or not.
One thing you need to remember, to shorten (if possible) or lengthen afile are expensive operations, this is why most databases add a bunch ofpages to the empty page pool, rather then add a single page at a time.They also reuse empty pages within the file, because to reuse anexisting page means changing a pointer, adding to or deleting from thefile means kernel operations, which are much slower.
Compression can be good though, as a database ages, an index can havethe first page on page 25, the next on page 234 535 the next on page 43,and these pages can be all over the disk volume as well, so your movingthe heads all over the place to find where these pages are. Compress adatabase, follow this by a disk defrag can make the database faster.
W

Re: Behaviour of SYSCS_COMPRESS_TABLE

Reply via email to